scTree: Discovering Cellular Hierarchies in the Presence of Batch Effects in scRNA-seq Data
Jul 26, 2024·
,,,,

Moritz Vandenhirtz*
Florian Barkmann*
Laura Manduchi
Julia E Vogt
Valentina Boeva
Abstract
We propose a novel method, scTree, for single-cell Tree Variational Autoencoders, extending a hierarchical clustering approach to single-cell RNA sequencing data. scTree corrects for batch effects while simultaneously learning a tree-structured data representation. This VAE-based method allows for a more in-depth understanding of complex cellular landscapes independently of the biasing effects of batches. We show empirically on seven datasets that scTree discovers the underlying clusters of the data and the hierarchical relations between them, as well as outperforms established baseline methods across these datasets. Additionally, we analyze the learned hierarchy to understand its biological relevance, thus underpinning the importance of integrating batch correction directly into the clustering procedure.
Type
Publication
Oral @ Efficient Foundation Models for Biological Discovery Workshop at ICML 2024