The gene expression profile of a tissue averages the expression profiles of all cells in this tissue. Digital tissue deconvolution addresses the following inverse problem: given the expression profile y of a tissue, what is the cellular composition c of that tissue? If X is a matrix whose columns are reference profiles of individual cell types, the composition c can be computed by minimizing L(y-Xc) for a given loss function L. Current methods use predefined all-purpose loss functions. They successfully quantify the dominating cells of a tissue, while often falling short in detecting small cell populations. In this study we use training data to learn the loss function L along with the composition c. This allows us to adapt to applicationspecific requirements such as focusing on small cell populations or distinguishing phenotypically similar cell populations. Our method quantifies large cell fractions as accurately as existing methods and significantly improves the detection of small cell populations and the distinction of similar cell types.
Digital tissue deconvolution (DTD) estimates the cellular composition of a tissue from its bulk gene-expression profile. For this, DTD approximates the bulk as a mixture of cell-specific expression profiles. Different tissues have different cellular compositions, with cells in different activation states, and embedded in different environments. Consequently, DTD can profit from tailoring the deconvolution model to a specific tissue context. Loss-function learning adapts DTD to a specific tissue context, such as the deconvolution of blood, or a specific type of tumor tissue. We provide software for loss-function learning, for its validation and visualization, and for applying the DTD models to new data.
Motivation: The inference of cellular compositions from bulk and spatial transcriptomics data increasingly complements data analyses. Multiple computational approaches were suggested and recently, machine learning techniques were developed to systematically improve estimates. Such approaches allow to infer additional, less abundant cell types. However, they rely on training data which do not capture the full biological diversity encountered in transcriptomics analyses; data can contain cellular contributions not seen in the training data and as such, analyses can be biased or blurred. Thus, computational approaches have to deal with unknown, hidden contributions. Moreover, most methods are based on cellular prototypes which serve as a reference; e.g., a generic T-cell profile is used to infer the proportion of T-cells. It is well known that cells adapt their molecular phenotype to the environment and as such, pre-specified cell prototypes can distort the inference of cellular compositions. Results: We propose Adaptive Digital Tissue Deconvolution (ADTD) to estimate cellular proportions of pre-selected cell types together with possibly unknown and hidden background contributions. Moreover, ADTD adapts the prototypic reference profiles to the molecular environment of the cells, which allows one to resolve cell-type specific regulation from bulk transcriptomics data. The performance of ADTD was verified in simulation studies, and in an application to breast cancer data we demonstrate how ADTD can be used to gain insights into molecular differences between breast cancer sub-types. Availability and implementation: A python implementation of ADTD and a tutorial are available at https://doi.org/10.5281/zenodo.7548362
Background: The gene expression profile of a tissue averages the expression profiles of all cells in this tissue. Digital tissue deconvolution (DTD) addresses the following inverse problem: Given the expression profile y of a tissue, what is the cellular composition c of that tissue? If X is a matrix whose columns are reference profiles of individual cell types, the composition c can be computed by minimizing L(y − Xc) for a given loss function L. Current methods use predefined all-purpose loss functions. They successfully quantify the dominating cells of a tissue, while often falling short in detecting small cell populations.Results: Here we learn the loss function L along with the composition c. This allows us to adapt to application-specific requirements such as focusing on small cell populations or distinguishing phenotypically similar cell populations. Our method quantifies large cell fractions as accurately as existing methods and significantly improves the detection of small cell populations and the distinction of similar cell types. * michael.altenbuchinger@ukr.de arXiv:1801.08447v1 [q-bio.QM] 25 Jan 2018 2 Methods NotationsLet X ∈ R p×q be a matrix with cellular reference profiles X ·,j in its columns, where the dot stands for all row indices. X ij is the reference expression value of gene i in cells of type j, p the number of genes, and q the number of cell types in X, respectively. We further introduce a matrix Y ∈ R p×n with bulk profiles of n cell mixtures Y ·,k in its columns and a matrix C ∈ R q×n with the cellular compositions of the mixtures C ·,k as columns.
Senescent cells are characterized by an arrest in proliferation. In addition to replicative senescence resulting from telomere exhaustion, sub-lethal genotoxic stress resulting from DNA damage, oncogene activation, mitochondrial dysfunction or reactive metabolites also elicits a senescence phenotype. Senescence is a controlled programme affecting a wide variety of biological processes with some core hallmarks of senescence as well as tissue specific changes. This study presents an integrative multi-omic analysis of proteomic and RNA-seq from proliferating and senescent osteosarcoma cells. This study demonstrates senescence induction in a widely used cell line which can be used as a model system for characterising cancer cell responses to sub-lethal doses of chemotherapeutic agents, and makes available both RNA-seq and proteomic data from proliferating and senescent cells in open access repositories to aid reuse by the community.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.