Poster
in
Workshop: AI for Nucleic Acids (AI4NA)
RNAlign: Alignment of tumor and cell line transcriptomes using conditional VAEs
Jacob Alvarez · Kiran Krishnamachari · Anders Skanderup
Preclinical cancer models such as cancer cell lines (CL) are central to cancer research but can poorly represent tumor samples due to fundamental differences like stromal cell contamination or in-vitro adaptation. This hinders the translation of new biomarkers or therapeutics into the clinical setting, leading to false leads, failed clinical trials, and the need for expensive multiomics pipelines to reconcile data sets. In this work, we build on conditional variational auto-encoders (CVAE) to enable the direct comparison or selection of the most representative CL for cancer research. We introduce RNAlign (pronounced RNA-align), a CVAE framework with novel regularization techniques, to enable pan-cancer alignment of tumor and CL gene expression profiles. The resulting learned transformation achieves state-of-the-art removal of the most significant differences between the model types, while preserving biologically important subtype information. This framework is extendable to other tumor models such as organoids and can be directly integrated into existing workflows to guide clinical precision medicine.