I am a principal investigator at Aithyra in Vienna, Austria. Aithyra is a new research institute at the intersection of machine learning and life sciences led by Michael Bronstein and funded by the Boehringer Ingelheim Foundation. If you’re interested in PhD, Postdoc, or Visiting researcher positions please feel free to reach out via email!
Previously, I was briefly an assistant professor at Duke University. Before that I did my postdoc with Yoshua Bengio working on efficient machine learning algorithms with applications to cell and molecular biology at Mila in Montreal. I completed my PhD in the computer science department at Yale University in 2021 where I was advised by Smita Krishnaswamy. My dissertation can be found here. My research interests are in generative modeling, deep learning, and optimal transport. I’m working on applying ideas from generative modeling, causal discovery, optimal transport, and graph signal processing to understand how cells develop and respond to changing conditions. I’m also interested in generative models for protein design and cofounded Dreamfold to work on these problems.
I grew up in Seattle and graduated from Tufts University in 2017 with a BS and MS in computer science. Outside of work, I love sailing and running. I am the 2019 junior North American champion in the 505 class, and I recently ran my first 50 mile race the Vermont 50!
Postdoc
Mila & University of Montreal
PhD in Computer Science, 2021
Yale University
MPhil in Computer Science, 2020
Yale University
MS in Computer Science, 2017
Tufts University
BS in Computer Science, 2017
Tufts University

Diffusion language models have emerged as a powerful alternative to autoregressive models, enabling fast inference through flexible and parallel generation paths. This flexibility is enabled by new sampling strategies, or planners, that iteratively choose where to denoise along the sequence rather than sampling uniformly at random. However, by modifying reverse paths, planners introduce a mismatch between the uniformly random denoising paths used during training and the planning-based paths used at inference. In this work, we systematically investigate this mismatch and theoretically show that the standard discrete diffusion training evidence lower bound (ELBO) does not accurately describe a denoiser under non-uniform planning. To bridge this gap, we derive a new Planned Evidence Lower Bound (P-ELBO) that directly incorporates planner-based reverse dynamics into the training objective. Building on this, we propose Planner Aware Path Learning (PAPL), a simple and effective modification of the standard masked discrete diffusion loss that aligns training and inference under planned denoisers. Empirically, PAPL delivers consistent improvements across domains, including a 40% relative gain in protein sequence modeling, up to a 4x improvement in MAUVE for text generation, and a 23% relative gain in HumanEval pass@10 for code generation.
PDF Cite DOI code reproducible code Tutorial Cancer Research Spotlight Yale News News-Medical
PDF Cite Code Poster Slides Video Arxiv ICML Page Tweetorial Workshop Poster
PDF Cite Code Slides Video Yale Daily Article Best Paper Award
PDF Cite Code Project Slides Video Arxiv ICML Page Introductory Talk