Workshop
Machine Learning for Genomics Explorations (MLGenX)
Ehsan Hajiramezanali · Arman Hasanzadeh · Tommaso Biancalani · Eric Nguyen · Ying Jin · Maria Brbic · Aviv Regev · Fabian Theis
Lehar 1
Sat 11 May, midnight PDT
The critical bottleneck in drug discovery is still our limited understanding of the biological mechanisms underlying diseases. Consequently, often we do not know why patients develop specific diseases, and many drug candidates fail in clinical trials. Recent advancements in new genomics platforms and the development of diverse omics datasets have ignited a growing interest in the study of this field. In addition, machine learning plays a pivotal role in improving success rates in language processing, image analysis, and molecular design. The boundaries between these two domains are becoming increasingly blurred, particularly with the emergence of modern foundation models that stand at the intersection of data-driven approaches, self-supervised techniques, and genomic explorations. This workshop aims to elucidate the intricate relationship between genomics, target identification, and fundamental machine learning methods. By strengthening the connection between machine learning and target identification via genomics, new possibilities for interdisciplinary research in these areas will emerge.
Schedule
Sat 12:00 a.m. - 12:15 a.m.
|
Opening Remarks
SlidesLive Video |
Organizers 🔗 |
Sat 12:15 a.m. - 12:50 a.m.
|
Functional Causal Bayesian Optimization and DiscoGen for Learning Optimal Interventions and Inferring Gene Regulatory Networks
(
Invited Talk I
)
>
link
SlidesLive Video |
Silvia Chiappa 🔗 |
Sat 12:50 a.m. - 1:00 a.m.
|
Coffee Break
|
🔗 |
Sat 1:00 a.m. - 1:35 a.m.
|
Leveraging (natural) language models for biology
(
Invited Talk II
)
>
link
SlidesLive Video |
James Y Zou 🔗 |
Sat 1:40 a.m. - 2:00 a.m.
|
DNA-DIFFUSION: Leveraging generative models for controlling chromatin accessibility and gene expression via synthetic regulatory elements
(
Oral Paper I
)
>
link
SlidesLive Video |
Luca Pinello 🔗 |
Sat 2:05 a.m. - 2:25 a.m.
|
Dirichlet Flow Matching with Applications to DNA Sequence Design
(
Oral Paper II
)
>
link
SlidesLive Video |
Gabriele Corso 🔗 |
Sat 2:25 a.m. - 2:40 a.m.
|
Break and Poster Setup
|
🔗 |
Sat 2:39 a.m. - 4:40 a.m.
|
Poster Session and Lunch (provided)
|
🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Optimizing Genetically-Driven Synaptogenesis ( Poster ) > link | Tommaso Boccato · Matteo Ferrante · Nicola Toschi 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Contrastive Poincaré Maps for single-cell data analysis ( Poster ) > link | Nithya Bhasker · Hattie Chung · Louis Boucherie · Vladislav Kim · Stefanie Speidel · Melanie Weber 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Whole Genome Transformers for Gene Interaction Effects in Microbiome Habitat Prediction ( Poster ) > link | Li · Sandeep Cranganore · Nicholas Youngblut · Niki Kilbertus 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Joint Embedding of Transcriptomes and Text Enables Interactive Single-Cell RNA-seq Data Exploration via Natural Language ( Poster ) > link | Moritz Schaefer · Peter Peneder · Daniel Malzl · Anna Hakobyan · Varun Sharma · Thomas Krausgruber · Jörg Menche · Eleni Tomazou · Christoph Bock 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
cellFlow: a generative flow-based model for single-cell count data ( Poster ) > link | Alessandro Palma · Till Richter · Hanyi Zhang · Andrea Dittadi · Fabian Theis 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Multi-Modal Contrastive Learning for Proteins by Combining Domain-Informed Views ( Poster ) > link | Haotian Xu · Yuning You · Yang Shen 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Scalable Amortized GPLVMs for Single Cell Transcriptomics Data ( Poster ) > link | Sarah Zhao · Aditya Ravuri · Vidhi R Lalchand · Neil Lawrence 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
AcceleratedLiNGAM: Learning causal DAGs at the speed of GPUs ( Poster ) > link | Victor Akinwande · J Kolter 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Cell-Type Prediction in Spatial Transcriptomics Data using Graph Neural Networks ( Poster ) > link | Moritz Lampert · Christopher Blöcker · Ingo Scholtes · Dominic Grün 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Active learning to discover pairwise genetic interactions via representation learning ( Poster ) > link | Moksh Jain · Alisandra Denton · Shawn Whitfield · Aniket Rajiv Didolkar · Berton Earnshaw · Jason Hartford 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
EvoSBDD: Latent Evolution for Accurate and Efficient Structure-Based Drug Design ( Poster ) > link | Danny Reidenbach 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
sc-OTGM: Single-Cell Perturbation Modeling by Solving Optimal Mass Transport on the Manifold of Gaussian Mixtures ( Poster ) > link | Andac Demir · Elizaveta Solovyeva · Jamie Boylan · Mei Xiao · Fabrizio Serluca · Sebastian Hoersch · Murthy Devarakonda · Bulent Kiziltan 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Evaluating predictive patterns of antigen specific B cells by single cell transcriptome and antibody repertoire sequencing ( Poster ) > link | Lena Erlach · Raphael Kuhn · Andreas Agrafiotis · Danielle Shlesinger · Alexander Yermanos · Sai Reddy 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
ROBUST SYMBOLIC REGRESSION FOR NETWORK TRAJECTORY INFERENCE ( Poster ) > link | Ramzi Dakhmouche · Ivan Lunati · Hossein Gorji 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation Experiments ( Poster ) > link | Yusuf Roohani · Jian Vora · Qian Huang · Percy Liang · Jure Leskovec 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Sample, estimate, aggregate: A recipe for causal discovery foundation models ( Poster ) > link | Menghua (Rachel) Wu · Yujia Bao · Regina Barzilay · Tommi Jaakkola 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Drug Discovery with Dynamic Goal-aware Fragments ( Poster ) > link | Seul Lee · Seanie Lee · Kenji Kawaguchi · Sung Ju Hwang 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
DNA-DIFFUSION: LEVERAGING GENERATIVE MODELS FOR CONTROLLING CHROMATIN ACCESSIBILITY AND GENE EXPRESSION VIA SYNTHETIC REGULATORY ELEMENTS ( Poster ) > link | Luca Pinello 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Biologically Interpretable VAE with Supervision for Transcriptomics Data Under Ordinal Perturbations ( Poster ) > link | Seyednami Niyakan · Xihaier Luo · Byung-Jun Yoon · Xiaoning Qian 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Dirichlet Flow Matching with Applications to DNA Sequence Design ( Poster ) > link | Hannes Stärk · Bowen Jing · Chenyu Wang · Gabriele Corso · Bonnie Berger · Regina Barzilay · Tommi Jaakkola 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Enhancing generative perturbation models with LLM-informed gene embeddings ( Poster ) > link | Kaspar Märtens · Rory Donovan-Maiye · Jesper Ferkinghoff-Borg 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Interpretable and Generalizable Graph Learning via Subgraph Multilinear Extension ( Poster ) > link | Yongqiang Chen · Yatao Bian · Bo Han · James Cheng 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Fine-tuning Protein Language Models with Deep Mutational Scanning improves Variant Effect Prediction ( Poster ) > link | Aleix Lafita · Ferran Gonzalez · Mahmoud Hossam · Paul Smyth · Jacob Deasy · Ari Allyn-Feuer · Daniel Seaton · Stephen Young 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
DARKIN: A zero-shot classification benchmark and an evaluation of protein language models ( Poster ) > link | Emine Ayşe Sunar · Zeynep Işık · Mert Pekey · Ramazan Gokberk Cinbis · Oznur Tastan 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Expanding Genomic Discovery: Causally-Inspired Neural Networks for Predicting Therapeutic Targets ( Poster ) > link | Guadalupe Gonzalez · Isuru Herath · Kirill Veselkov · Michael Bronstein · Marinka Zitnik 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Season combinatorial intervention predictions with Salt & Peper ( Poster ) > link | Thomas Gaudelet · Alice Del Vecchio · Eli Carrami · Juliana Cudini · Chantriolnt-Andreas Kapourani · Caroline Uhler · Lindsay Edwards 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
DNA language models identify variants predictive across the human phenome ( Poster ) > link | Benjamin Wild · Julius Upmeier zu Belzen · Luis Herrmann · Paul Kittner · Roland Eils 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
IST-editing: Infinite spatial transcriptomic editing in a generated gigapixel mouse pup ( Poster ) > link | Jiqing Wu · Ingrid Berg · Viktor Koelzer 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Pairing interacting protein sequences using masked language modeling ( Poster ) > link | Damiano Sgarbossa · Umberto Lupo · Anne-Florence Bitbol 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Multi-ContrastiveVAE disentangles perturbation effects in single cell images from optical pooled screens ( Poster ) > link | Jerry Wang · Romain Lopez · Jan-Christian Huetter · TAKAMASA KUDO · Heming Yao · Philipp Hanslovsky · Burkhard Hoeckendorf · Rahul Mohan · David Richmond · Aviv Regev 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
ResTran: A GNN Alternative to Learn A Graph with Features ( Poster ) > link | Shota Saito · Takanori Maehara · Mark Herbster 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
A mechanistically interpretable neural-network architecture for discovery of regulatory genomics ( Poster ) > link | Alex M Tseng · Gökcen Eraslan · Nathaniel Diamant · Tommaso Biancalani · Gabriele Scalia 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Deep Learning and Direct Sequencing of Labeled RNA Captures Transcriptome Dynamics ( Poster ) > link | Vlastimil Martinek · Jessica Martin · Cedric Belair · Matthew Payea · Sulochan Malla · Panagiotis Alexiou · Manolis Maragkakis 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
ADVANCING DNA LANGUAGE MODELS: THE GENOMICS LONG-RANGE BENCHMARK ( Poster ) > link | Chia Hsiang Kao · Evan Trop · McKinley Polen · Yair Schiff · Bernardo Almeida · Aaron Gokaslan · Thomas PIERROT · Volodymyr Kuleshov 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Recurrent memory augmentation of GENA-LM improves performance on long DNA sequence tasks ( Poster ) > link | Yuri Kuratov · Aleksei Shmelev · Veniamin Fishman · Olga Kardymon · Mikhail Burtsev 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
INTEGRATION OF GRAPH NEURAL NETWORK AND NEURAL-ODES FOR TUMOR DYNAMICS PREDICTION
(
Poster
)
>
|
Omid Bazgir · Zichen Wang · Ji Won Park · Marc Hafner · James Lu 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
NICHEVI: A PROBABILISTIC FRAMEWORK TO EMBED CELLULAR INTERACTION IN SPATIAL TRANSCRIPTOMICS ( Poster ) > link | Nathan LEVY · Florian Ingelfinger · Can Ergen-Behr · Boaz Nadler 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
scvi-hub: A flexible framework for reference enabled single-cell data analysis ( Poster ) > link | Can Ergen 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Learning Drug Perturbations via Conditional Map Estimators ( Poster ) > link | Benedek Harsanyi · Marianna Rapsomaniki · Jannis Born 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Unveiling Zero Shot Prediction for Gene Attributes Through Interpretable AI ( Poster ) > link | Ala Jararweh · Oladimeji Macaulay · David Arredondo · Olufunmilola Oyebamiji · Luis Tafoya · Kushal Virupakshappa · Avinash Sahu 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Are Genomic Language Models All You Need? Exploring Genomic Language Models on Protein Downstream Tasks ( Poster ) > link | Sam Boshar · Evan Trop · Bernardo Almeida · Thomas PIERROT 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Disentanglement via Mechanism Sparsity by Replaying Realizations of the Past ( Poster ) > link | Soroor Hediyeh-zadeh · Tom Fischer · Fabian Theis 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Propensity Score Alignment of Unpaired Multimodal Data ( Poster ) > link | Johnny Xi · Jason Hartford 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Protein Representation Learning by Capturing Protein Sequence-Structure-Function Relationship ( Poster ) > link | Eunji Ko · Seul Lee · Minseon Kim · Dongki Kim · Sung Ju Hwang 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Evaluating Spatial Encoding Strategies for Cell Type Annotation with Spatial Omics Data ( Poster ) > link | Merel Kuijs · Alma Andersson · Ehsan Hajiramezanali · Tommaso Biancalani · Aicha BenTaieb 🔗 |
Sat 2:40 a.m. - 4:40 a.m.
|
Multi-Resolution Graph Diffusion ( Poster ) > link | Mahdi Karami · Igor Krawczuk · Volkan Cevher 🔗 |
Sat 4:40 a.m. - 5:40 a.m.
|
Panel Discussion
SlidesLive Video |
Kyunghyun Cho · Lindsay Edwards · Nicola Richmond · Michael Bronstein 🔗 |
Sat 5:45 a.m. - 6:20 a.m.
|
Efficiently detecting interactions from high dimensional observations of pairwise perturbations
(
Invited Talk III
)
>
link
SlidesLive Video |
Jason Hartford 🔗 |
Sat 6:20 a.m. - 6:35 a.m.
|
Coffee Break
|
🔗 |
Sat 6:35 a.m. - 6:55 a.m.
|
Season combinatorial intervention predictions with Salt & Peper
(
Oral Paper III
)
>
link
SlidesLive Video |
Thomas Gaudelet 🔗 |
Sat 6:55 a.m. - 7:15 a.m.
|
A mechanistically interpretable neural-network architecture for discovery of regulatory genomics
(
Oral Paper IV
)
>
link
SlidesLive Video |
Alex M Tseng 🔗 |
Sat 7:20 a.m. - 7:55 a.m.
|
Evo: Long-context modeling from molecular to genome scale
(
Invited Talk IV
)
>
link
SlidesLive Video |
Brian Hie 🔗 |
Sat 7:55 a.m. - 8:00 a.m.
|
Closing Remarks
|
Organizers 🔗 |