Artificial Intelligence, Machine Learning, Computer Vision, and Medical Imaging

I work at the intersection of computer vision and multimodal machine learning, with a focus on research that is reproducible, evidence grounded, and rigorously evaluated. I care a lot about understanding failure modes, hallucinations, shortcut learning, and brittle generalization, especially in high-stakes settings like medical imaging.
I’ve worked on gaze-supervised learning for chest X-rays, using human attention signals to better align diagnosis and support more faithful report generation. I’ve also contributed to long-document and long-context evaluation as part of a broader goal: making model limitations visible and measurable through stronger benchmarks and analysis.
Alongside research, I bring solid software engineering experience. I build end-to-end systems and research tooling, often with React and Tailwind on the frontend, Go on the backend, plus Docker and AWS for practical deployment.
I’m preparing to pursue a PhD in AI/ML. My goal is to advance reliable computer vision and multimodal methods that are both scientifically grounded and genuinely useful.
Relevant Coursework: Artificial Intelligence (CSE422), Neural Networks (CSE425), Algorithms (CSE221), Data Structures (CSE220), Discrete Mathematics (CSE230), Computer Graphics (CSE423).
June 2024 – Present
Fullstack Developer
July 2023 – May 2024
Software Engineer
Jan 2023 – June 2023
Backend Engineer
July 2022 – Dec 2022
Mobile Developer Intern
Built mobile UI components and integrated Firebase for real-time user tracking.
Jan 2022 – June 2022
DevOps Intern
Automated CI/CD pipelines and managed containerized deployments using Docker and Linux servers.
Sep 2021 – Dec 2021
UI/UX Design Asst.
Designed web and mobile graphics using Photoshop, Illustrator, and Dreamweaver.
Tanjim Islam Riju, Shuchismita Anwar, Saman Sarker Joy, Farig Sadeque, Swakkhar Shatabda
arXiv:2508.13068
Focused on leveraging gaze data and multimodal contrastive learning to improve medical AI systems. Developed frameworks integrating vision-language models for diagnosis and report generation.
BRAC University
BRAC University
BRAC University