About Me

Starting July 2025, I will be joining Duke University as an assistant professor in the departments of computer science, cell biology, and biostatistics and bioinformatics. I’m looking to recruit phd students for Fall 2025! If you are a prospective student interested in working with me, please apply through the Computer Science PhD program (Deadline Dec. 16) or the Computational Biology and Bioinformatics PhD program (Deadline Dec. 2) and mention me.

I am a postdoc with Yoshua Bengio working on efficient machine learning algorithms with applications to cell and molecular biology at Mila in Montreal.

I completed my PhD in the computer science department at Yale University in 2021 where I was advised by Smita Krishnaswamy. My dissertation can be found here. My research interests are in generative modeling, deep learning, and optimal transport. I’m working on applying ideas from generative modeling, causal discovery, optimal transport, and graph signal processing to understand how cells develop and respond to changing conditions. I’m also interested in generative models for protein design and cofounded Dreamfold to work on these problems.

I grew up in Seattle and graduated from Tufts University in 2017 with a BS and MS in computer science. Outside of work, I love sailing and running. I am the 2019 junior North American champion in the 505 class, and I recently ran my first 50 mile race the Vermont 50!

Recent News

Interests
  • Generative Modeling
  • Optimal Transport
  • Single Cell Dynamics
  • Protein Design
  • Graph Signal Processing
  • Causal Discovery
Education
  • PhD in Computer Science, 2021

    Yale University

  • MPhil in Computer Science, 2020

    Yale University

  • MS in Computer Science, 2017

    Tufts University

  • BS in Computer Science, 2017

    Tufts University

Preprints

(2024). Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction. Preprint.

Cite arXiv

(2024). Defining and Benchmarking Open Problems in Single-Cell Analysis. Preprint.

Cite DOI URL

(2024). Generating Multi-Modal and Multi-Attribute Single-Cell Counts with CFGen. Preprint.

Cite arXiv

(2023). Causal Inference in Gene Regulatory Networks with GFlowNet: Towards Scalability in Large Systems. arXiv.

PDF Cite

(2023). Learning Transcriptional and Regulatory Dynamics Driving Cancer Cell Plasticity Using Neural ODE-Based Optimal Transport. BioRxiv.

PDF Cite DOI

Publications

(2024). Trajectory Flow Matching with Applications to Clinical Time Series Modeling. In NeurIPS (Spotlight).

PDF Cite Code

(2024). Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation. In NeurIPS.

PDF Cite

(2024). Metric Flow Matching for Smooth Interpolations on the Data Manifold. In NeurIPS.

PDF Cite Code Logg Video

(2024). Iterated Denoising Energy Matching for Sampling from Boltzmann Densities. In ICML.

PDF Cite Code

(2024). A Computational Framework for Solving Wasserstein Lagrangian Flows. In ICML.

PDF Cite Code

(2024). SE(3)-Stochastic Flow Matching for Protein Backbone Generation. In ICLR (Spotlight).

PDF Cite Code MoML Video Logg Video VantAI Video

(2024). Simulation-Free Schrodinger Bridges via Score and Flow Matching. In AISTATS.
Also presented at Frontiers4LCD Workshop @ ICML 2023.

PDF Cite Code

(2024). Learnable Filters for Geometric Scattering Modules. In IEEE Transactions on Signal Processing.

PDF Cite arXiv

(2023). Trellis tree-based analysis reveals stromal regulation of patient-derived organoid drug responses. Cell.

PDF Cite DOI code reproducible code Tutorial Cancer Research Spotlight Yale News News-Medical

(2023). A Heat Diffusion Perspective on Geodesic Preserving Dimensionality Reduction. In NeurIPS.

PDF Cite Code

(2023). DynGFN: Bayesian Dynamic Causal Discovery Using Generative Flow Networks. In NeurIPS.
Also presented at Frontiers4LCD Workshop @ NeurIPS 2022.

PDF Cite Code

(2023). Understanding Graph Neural Networks with Generalized Geometric Scattering Transforms. In SIMODS.

PDF Cite Code

(2023). Geodesic Sinkhorn for Fast and Accurate Optimal Transport on Manifolds. In IEEE MLSP.

PDF Cite

(2023). Neural FIM for Learning Fisher Information Metrics from Point Cloud Data. In ICML.

PDF Cite

(2023). Time-Inhomogeneous Diffusion Geometry and Topology. In SIMODS.

PDF Cite SIMODS

(2023). Single-Cell Analysis Reveals Inflammatory Interactions Driving Macular Degeneration. Nature Communications.

PDF Cite DOI

(2023). Improving and Generalizing Flow-Based Generative Models with Minibatch Optimal Transport. In TMLR
Also presented at Frontiers4LCD Workshop @ ICML 2023.

PDF Cite Code Video

(2023). Graph Fourier MMD for signals on data graphs. In SAMPTA.

PDF Cite

(2022). Manifold Interpolating Optimal-Transport Flows for Trajectory Inference. In NeurIPS.

PDF Cite Code NeurIPS Arxiv

(2022). Embedding Signals on Knowledge Graphs with Unbalanced Diffusion Earth Mover's Distance. In ICASSP.

PDF Cite Code Arxiv

(2022). Immune Cells and Their Inflammatory Mediators Modify Beta Cells and Cause Checkpoint Inhibitor-Induced Diabetes. JCI Insight.

PDF Cite DOI

(2022). Topological Analysis of Single-Cell Hierarchy Reveals Inflammatory Glial Landscape of Macular Degeneration. Investigative Ophthalmology & Visual Science.

Cite

(2022). Multiscale PHATE identifies multimodal signatures of COVID-19. Nature Biotechnology.

PDF Cite DOI

(2021). MURAL: An Unsupervised Random Forest-Based Embedding for Electronic Health Record Data. In IEEE Big Data.

Cite Code Arxiv Tweetorial

(2021). A sandbox for prediction and integration of DNA, RNA, and protein data in single cells. In NeurIPS Datasets and Benchmarks.

PDF Cite Code Competition Dataset

(2021). Fixing Bias in Reconstruction-based Anomaly Detection with Lipschitz Discriminators. In Journal of Signal Processing Systems.

PDF Cite Code Slides Video

(2021). Data-Driven Learning of Geometric Scattering Networks. In IEEE MLSP.
Also presented at ML4M Workshop @ NeurIPS 2020.

PDF Cite Code Slides Video ArXiv Tweetorial

(2021). Multimodal data visualization and denoising with integrated diffusion. In IEEE MLSP.

PDF Cite Arxiv

(2021). Diffusion Earth Mover's Distance and Distribution Embeddings. In ICML.
Also presented at LMRL Workshop @ NeurIPS 2020.

PDF Cite Code Poster Slides Video Arxiv ICML Page Tweetorial Workshop Poster

(2021). POT: Python Optimal Transport. In JMLR.

PDF Cite Code

(2021). Quantifying the effect of experimental perturbations in single-cell RNA-sequencing data using graph signal processing. In Nature Biotechnology.

Cite Code DOI BioRxiv

(2020). Uncovering the Folding Landscape of RNA Secondary Structure with Deep Graph Embeddings. In IEEE Big Data.
Also at GRLB Workshop @ ICML 2020.

PDF Cite Code

(2020). TrajectoryNet: A Dynamic Optimal Transport Network for Modeling Cellular Dynamics. In ICML.
Also at LMRL Workshop @ NeurIPS 2019.

PDF Cite Code Project Slides Video Arxiv ICML Page Introductory Talk

(2020). Interpretable Neuron Structuring with Graph Spectral Regularization. In IDA
Also presented at RLGM Workshop @ ICLR 2019.

PDF Cite Code Slides Video DOI

(2019). Finding Archetypal Spaces Using Neural Networks. In IEEE Big Data.

PDF Cite Code DOI Arxiv

(2018). Allocate-On-Use Space Complexity of Shared-Memory Algorithms. In DISC.

PDF Cite DOI