Sitan Yang scite author profile

Sitan Yang

2Publications

15Citation Statements Received

76Citation Statements Given

How they've been cited

How they cite others

Affiliations

Publications

Order By: Most citations

Multiclass cancer classification based on gene expression comparison

Yang¹,

Naiman²

2014

View full text Add to dashboard Cite

As the complexity and heterogeneity of cancer is being increasingly appreciated through genomic analyses, microarray-based cancer classification comprising multiple discriminatory molecular markers is an emerging trend. Such multiclass classification problems pose new methodological and computational challenges for developing novel and effective statistical approaches. In this paper, we introduce a new approach for classifying multiple disease states associated with cancer based on gene expression profiles. Our method focuses on detecting small sets of genes in which the relative comparison of their expression values leads to class discrimination. For an m-class problem, the classification rule typically depends on a small number of m-gene sets, which provide transparent decision boundaries and allow for potential biological interpretations. We first test our approach on seven common gene expression datasets and compare it with popular classification methods including support vector machines and random forests. We then consider an extremely large cohort of leukemia cancer to further assess its effectiveness. In both experiments, our method yields comparable or even better results to benchmark classifiers. In addition, we demonstrate that our approach can integrate pathway analysis of gene expression to provide accurate and biological meaningful classification.

show abstract

MQRetNN: Multi-Horizon Time Series Forecasting with Retrieval Augmentation

Yang¹,

Eisenach²,

Madeka³

2022

Preprint

View full text Add to dashboard Cite

Multi-horizon probabilistic time series forecasting has wide applicability to real-world tasks such as demand forecasting. Recent work in neural time-series forecasting mainly focus on the use of Seq2Seq architectures Sutskever et al. (2014). For example, MQTransformer (Eisenach et al., 2020) -an improvement of MQCNN (Wen et al., 2017) -has shown the state-of-the-art performance in probabilistic demand forecasting. In this paper, we consider incorporating crossentity information to enhance model performance by adding a cross-entity attention mechanism along with a retrieval mechanism to select which entities to attend over. We demonstrate how our new neural architecture, MQRetNN, leverages the encoded contexts from a pretrained baseline model on the entire population to improve forecasting accuracy. Using MQCNN as the baseline model (due to computational constraints, we do not use MQTransformer), we first show on a small demand forecasting dataset that it is possible to achieve ∼3% improvement in test loss by adding a cross-entity attention mechanism where each entity attends to all others in the population. We then evaluate the model with our proposed retrieval methods -as a means of approximating an attention over a large population -on a large-scale demand forecasting application with over 2 million products and observe ∼1% performance gain over the MQCNN baseline.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Sitan Yang

Multiclass cancer classification based on gene expression comparison

MQRetNN: Multi-Horizon Time Series Forecasting with Retrieval Augmentation

Contact Info

Product

Resources

About