Hi! This is Aref Mahjoubfar. 👋

A physician–researcher specializing in Artificial Intelligence, developing Medical AI applications and automation

This webpage aims to help you know more about me—my interests, research, projects, and insights.

I'm currently working on:

Trustworthy AI

Uncertainty quantification utilizing token-level log-probabilities for sequences in a variety of tasks. Leveraging SHAP explanation models to calibrate LLM confidence scores. Rule-based (logically-grounded) actions in terms of confidence scores and explanations.

My Journey

My fascination with technology began in childhood, when I started designing custom websites from scratch. As I grew older, I explored application development with C#, further deepening my interest in software and automation.

During my medical education, I teamed up with a group of friends (EN.SAF 😘 — Fairness in Persian) to explore machine learning with a focus on computer vision in medical practice. This led to a thesis project that involved diagnosing Acute Lymphoblastic and Myeloblastic Leukemia (ALL and AML) through the analysis of blood and bone marrow slide images, with a dataset we collected over the span of one year. Over time, I honed my skills in Python 🐍 and worked extensively with machine learning libraries like PyTorch and TensorFlow. More recently, I've expanded into Natural Language Processing and working with Large Language Models, actively designing models to push the boundaries of Medical AI.

My Vision

Life moves on, and I'm genuinely excited about the future and the next wave of technological innovations (This is what immortality is for 😉). I firmly believe in open science and the importance of combining our efforts, step by step, to lay the foundations for tomorrow's breakthroughs. This is how I can experience more in less time:)

Tools I've been working with:

Featured projects

Leukemia Slide Classifier

Classification of ALL, AML and benign cases utilizing BMA & PBS images.

Trained and evaluated lightweight CNNs on a year-collected dataset of bone marrow aspiration and peripheral blood smear images. Emphasis on reproducible pipelines and clinically relevant evaluation metrics.

Uncertainty in LLMs

Narrative review + experiments on uncertainty quantification for clinical LLM tasks.

Research on calibrating and quantifying uncertainty in large language models used for clinical extraction and decision support — combining log-prob analyses, SHAP explainers, and calibration metrics.

Tooling & Reproducibility

Scripts & notebooks for reproducible ML research (PyTorch/TensorFlow).

Standardized preprocessing, finetuning pipelines, and evaluation scripts aimed to make experiments repeatable and shareable. Focus on clear notebooks and open-science-friendly licenses.

Connect

ORCID · Google Scholar · LinkedIn · Email

ORCID Google Scholar LinkedIn Email Github