Adversarial Deconfounding Autoencoder for Learning Robust Gene Expression Embeddings

BioRxiv : the Preprint Server for Biology
A. B. DincerSu-In Lee

Abstract

Motivation: Increasing number of gene expression profiles has enabled the use of complex models, such as deep unsupervised neural networks, to extract a latent space from these profiles. However, expression profiles, especially when collected in large numbers, inherently contain variations introduced by technical artifacts (e.g., batch effects) and uninteresting biological variables (e.g., age) in addition to the true signals of interest. These sources of variations, called confounders, produce embeddings that fail to transfer to different domains, i.e., an embedding learned from one dataset with a specific confounder distribution does not generalize to different distributions. To remedy this problem, we attempt to disentangle confounders from true signals to generate biologically informative embeddings. Results: In this paper, we introduce the AD-AE (Adversarial Deconfounding AutoEncoder) approach to deconfounding gene expression latent spaces. The AD-AE model consists of two neural networks: (i) an autoencoder to generate an embedding that can reconstruct original measurements, and (ii) an adversary trained to predict the confounder from that embedding. We jointly train the networks to generate embeddings that can encode as m...Continue Reading

Related Concepts

Hic1
Genomics
Chromosomes
Cell Type
Chromosomes, Mammalian
HIST1H1C
Plasma Cells
Laboratory mice
Single cell proteins
Research Study

Related Feeds

BioRxiv & MedRxiv Preprints

BioRxiv and MedRxiv are the preprint servers for biology and health sciences respectively, operated by Cold Spring Harbor Laboratory. Here are the latest preprint articles (which are not peer-reviewed) from BioRxiv and MedRxiv.

Related Papers

Nature Methods
Vijay RamaniJay Shendure
Nature Reviews. Microbiology
Alan Walker, Julian Parkhill
Environmental Microbiology
Howard Ochman
Nature Biotechnology
Clyde A Hutchison, J Craig Venter
Briefings in Functional Genomics
Martin Hemberg
© 2020 Meta ULC. All rights reserved