“…To bridge the gap, numerous computational methods have been proposed to estimate individual cell type abundance from bulk RNA data of heterogeneous tissues ( Supplementary Table S1). With the bulk gene expression values as input, the abundance of each cell type from the mixed sample can be quantified by aggregating the expressions of the marker genes into an abundance score (MCP-counter 9 ), or by measuring the enrichment level of the marker genes using statistical analysis (xCell 10 ), or by deconvolution algorithms that adopt computational methods, such as least squares (quanTiseq 11 , EPIC 12 ), support vector regression (SVR) (CIBERSORT 13 , CIBERSORTx 14 ), or non-negative matrix factorization (NMF) 15 , to derive an optimal dissection of the original sample based on a set of pre-identified cell type-specific expression signatures. Obviously, regardless of the actual computational methods being used, the adoption of any of these methods as a reliable clinical routine for cell type proportion estimation requires that its underlying assumptions to be held over a large variety of cell types, tissues, and RNA sequencing conditions, which is challenging in practice.…”