Alex Tong

Principal Investigator

Aithyra

About Me

I am a principal investigator at Aithyra in Vienna, Austria. Aithyra is a new research institute at the intersection of machine learning and life sciences led by Michael Bronstein and funded by the Boehringer Ingelheim Foundation. If you’re interested in PhD, Postdoc, or Visiting researcher positions please feel free to reach out via email!

Previously, I was briefly an assistant professor at Duke University. Before that I did my postdoc with Yoshua Bengio working on efficient machine learning algorithms with applications to cell and molecular biology at Mila in Montreal. I completed my PhD in the computer science department at Yale University in 2021 where I was advised by Smita Krishnaswamy. My dissertation can be found here. My research interests are in generative modeling, deep learning, and optimal transport. I’m working on applying ideas from generative modeling, causal discovery, optimal transport, and graph signal processing to understand how cells develop and respond to changing conditions. I’m also interested in generative models for protein design and cofounded Dreamfold to work on these problems.

I grew up in Seattle and graduated from Tufts University in 2017 with a BS and MS in computer science. Outside of work, I love sailing and running. I am the 2019 junior North American champion in the 505 class, and I recently ran my first 50 mile race the Vermont 50!

Recent News

The Tong Group website is now live at https://tonggroup.org/.
Co-organizing the Non-Autoregressive Language Models workshop at COLM 2026 in San Francisco (Oct 9). Submissions are due June 23, 2026 (AoE) via OpenReview — please consider submitting!
Two papers accepted to ICML 2026. Checkout our work on autoregressive Boltzmann generators ARBG (Danyal) (spotlight) and topological guidance for macrocycle generation MacroGuide (Alicja Maksymiuk).
Six papers (2 Oral 4 Poster) accepted to ICLR 2026. Checkout new work on discrete diffusion model training PAPL (Fred and Zack), crystal structure prediction with OXtal (Emily and Andrei), efficient Boltzmann generators with FALCON and RegFlow (Danyal), branched generative modeling in BranchSBM (Sophia), and Topological FM (Kacper).
Three papers accepted to NeurIPS 2025. Checkout our work on non-gradient dynamics with Curly-FM (Katarina and Lazar), transferable amortized sampling with Prose (Charlie and Majdi), and annealing diffusion models for Boltzmann sampling with PITA (Tara and Yoon) (spotlight).
Joined Aithyra as a principal investigator. If you’re interested in PhD, Postdoc, or Visiting positions please feel free to reach out via email!
Congrats to Danyal Rehman and collaborators for winning the Best Paper Award at the Gen Bio workshop at ICML 2025 for our work on forward only regression training of normalizing flows FORT.
Join us at the second edition of the Frontiers in Probabilistic Inference: Sampling meets Learning workshop at NeurIPS 2025 in San Diego.

Older News (2024)

Started as an assistant professor at Duke University.
Two papers accepted to ICML 2025. Checkout our work on Feynman-Kac Correctors to guide diffusion models (spotlight) and Scaling Boltzmann Generators as well as newer workshop papers such as FORT, PITA and other works coming soon.
Congrats to Fred Zhangzhi Peng and collaborators for winning an outstanding paper award at the DELTA workshop at ICLR 2025 for our work on improved sampling from masked diffusion models in P2.
Four papers accepted to ICLR 2025. Checkout our work on steering masked and continuous diffusion and our work on flow matching in cells with greater realism in CFGen and transferrability in Meta FM.
Join us at our workshop Frontiers in Probabilistic Inference: Sampling meets Learning at ICLR 2025 in Singapore.
Presenting a tutorial on Geometric Generative Models with Heli Ben-Hamu and Joey Bose at the LoG Conference 2024.
Visiting the group of Michael Bronstein at Oxford winter 2024.
Three papers accepted to NeurIPS 2024! Awesome work led by Xi (Nicole) Zhang, Yuan Pu, Guillaume Huguet, James Vuckovic, and Kacper Kapuśniak.

Interests

Generative Modeling
Flow Models
Optimal Transport
Protein Design
Single Cell Dynamics

Education

Postdoc
Mila & University of Montreal
PhD in Computer Science, 2021
Yale University
MPhil in Computer Science, 2020
Yale University
MS in Computer Science, 2017
Tufts University
BS in Computer Science, 2017
Tufts University

Featured Publications

Danyal Rehman, Tara Akhound-Sadegh, Artem Gazizov, Yoshua Bengio, Alex Tong

April, 2026 ICLR 2026 (Oral) Archival

FALCON: Few-step Accurate Likelihoods for Continuous Flows

Scalable sampling of molecular states in thermodynamic equilibrium is a long-standing challenge in statistical physics. Boltzmann Generators tackle this problem by pairing a generative model, capable of exact likelihood computation, with importance sampling to obtain consistent samples under the target distribution. Current Boltzmann Generators primarily use continuous normalizing flows (CNFs) trained with flow matching for efficient training of powerful models. However, likelihood calculation for these models is extremely costly, requiring thousands of function evaluations per sample, severely limiting their adoption. In this work, we propose Few-step Accurate Likelihoods for Continuous Flows (FALCON), a method which allows for few-step sampling with a likelihood accurate enough for importance sampling applications by introducing a hybrid training objective that encourages invertibility. We show FALCON outperforms state-of-the-art normalizing flow models for molecular Boltzmann sampling and is two orders of magnitude faster than the equivalently performing CNF model.

Jarrid Rector-Brooks, Théophile Lambert, Marta Skreta, Daniel Roth, Yueming Long, Zi-Qi Li, Xi Zhang, Miruna Cretu, Francesca-Zhoufan Li, Tanvi Ganapathy, Emily Jin, Avishek Joey Bose, Jason Yang, Kirill Neklyudov, Yoshua Bengio, Alex Tong, Frances H. Arnold, Cheng-Hao Liu

April, 2026 arXiv Preprint

General Multimodal Protein Design Enables DNA-Encoding of Chemistry

Fred Zhangzhi Peng*, Zachary Bezemek*, Jarrid Rector-Brooks, Shuibai Zhang, Anru R. Zhang, Michael Bronstein, Avishek Joey Bose*, Alex Tong*

April, 2026 ICLR 2026 (Oral) Archival

Planner Aware Path Learning in Diffusion Language Models Training

Diffusion language models have emerged as a powerful alternative to autoregressive models, enabling fast inference through flexible and parallel generation paths. This flexibility is enabled by new sampling strategies, or planners, that iteratively choose where to denoise along the sequence rather than sampling uniformly at random. However, by modifying reverse paths, planners introduce a mismatch between the uniformly random denoising paths used during training and the planning-based paths used at inference. In this work, we systematically investigate this mismatch and theoretically show that the standard discrete diffusion training evidence lower bound (ELBO) does not accurately describe a denoiser under non-uniform planning. To bridge this gap, we derive a new Planned Evidence Lower Bound (P-ELBO) that directly incorporates planner-based reverse dynamics into the training objective. Building on this, we propose Planner Aware Path Learning (PAPL), a simple and effective modification of the standard masked discrete diffusion loss that aligns training and inference under planned denoisers. Empirically, PAPL delivers consistent improvements across domains, including a 40% relative gain in protein sequence modeling, up to a 4x improvement in MAUVE for text generation, and a 23% relative gain in HumanEval pass@10 for code generation.

Publications

Danyal Rehman, Charlie B. Tan, Yoshua Bengio, Joey Bose, Alex Tong (2026). Autoregressive Boltzmann Generators. ICML (spotlight).

PDF Cite Code arXiv

Alicja Maksymiuk, Alexandre Duplessis, Michael Bronstein, Alex Tong, Fernanda Duarte, Ismail Ceylan (2026). MacroGuide: Topological Guidance for Macrocycle Generation. ICML.

PDF Cite Code arXiv

Alex Morehead, Lazar Atanackovic, Akshata Hegde, Yanli Wang, Frimpong Boadu, Joel Selvaraj, Alex Tong, Aditi S. Krishnapriyan, Jianlin Cheng (2026). Flow matching for generative modelling in bioinformatics and computational biology. Nature Machine Intelligence, 1-18.

PDF Cite OpenReview

Sophia Tang, Yinuo Zhang, Alex Tong, Pranam Chatterjee (2026). Branched Schrödinger Bridge Matching. ICLR 2026.