Alona Levy-Jurgenson scite author profile

Digital analysis of pathology whole-slide images is fast becoming a game changer in cancer diagnosis and treatment. Specifically, deep learning methods have shown great potential to support pathology analysis, with recent studies identifying molecular traits that were not previously recognized in pathology H&E whole-slide images. Simultaneous to these developments, it is becoming increasingly evident that tumor heterogeneity is an important determinant of cancer prognosis and susceptibility to treatment, and should therefore play a role in the evolving practices of matching treatment protocols to patients. State of the art diagnostic procedures, however, do not provide automated methods for characterizing and/or quantifying tumor heterogeneity, certainly not in a spatial context. Further, existing methods for analyzing pathology whole-slide images from bulk measurements require many training samples and complex pipelines. Our work addresses these two challenges. First, we train deep learning models to spatially resolve bulk mRNA and miRNA expression levels on pathology whole-slide images (WSIs). Our models reach up to 0.95 AUC on held-out test sets from two cancer cohorts using a simple training pipeline and a small number of training samples. Using the inferred gene expression levels, we further develop a method to spatially characterize tumor heterogeneity. Specifically, we produce tumor molecular cartographies and heterogeneity maps of WSIs and formulate a heterogeneity index (HTI) that quantifies the level of heterogeneity within these maps. Applying our methods to breast and lung cancer slides, we show a significant statistical link between heterogeneity and survival. Our methods potentially open a new and accessible approach to investigating tumor heterogeneity and other spatial molecular properties and their link to clinical characteristics, including treatment susceptibility and survival.

show abstract

CRISPECTOR provides accurate estimation of genome editing translocation and off-target activity from comparative NGS data

Amit

Iancu

Levy-Jurgenson

et al. 2021

Nat Commun

View full text Add to dashboard Cite

Controlling off-target editing activity is one of the central challenges in making CRISPR technology accurate and applicable in medical practice. Current algorithms for analyzing off-target activity do not provide statistical quantification, are not sufficiently sensitive in separating signal from noise in experiments with low editing rates, and do not address the detection of translocations. Here we present CRISPECTOR, a software tool that supports the detection and quantification of on- and off-target genome-editing activity from NGS data using paired treatment/control CRISPR experiments. In particular, CRISPECTOR facilitates the statistical analysis of NGS data from multiplex-PCR comparative experiments to detect and quantify adverse translocation events. We validate the observed results and show independent evidence of the occurrence of translocations in human cell lines, after genome editing. Our methodology is based on a statistical model comparison approach leading to better false-negative rates in sites with weak yet significant off-target activity.

show abstract

Spatial Transcriptomics Inferred from Pathology Whole-Slide Images Links Tumor Heterogeneity to Survival in Breast and Lung Cancer

Levy-Jurgenson

Tekpli

Kristensen

et al. 2020

Preprint

View full text Add to dashboard Cite

AbstractDigital analysis of pathology whole-slide images is fast becoming a game changer in cancer diagnosis and treatment. Specifically, deep learning methods have shown great potential to support pathology analysis, with recent studies identifying molecular traits that were not previously recognized on pathology H&E whole-slide images. Simultaneous to these developments, it is becoming increasingly evident that tumor heterogeneity is an important determinant of cancer prognosis and susceptibility to treatment, and should therefore play a role in the evolving practices of matching treatment protocols to patients. State of the art diagnostic procedures, however, do not provide scalable methods for characterizing and/or quantifying tumor heterogeneity, certainly not in a spatial context. In this paper, we present a scalable approach that accurately and automatically spatially resolves mRNA and miRNA expression levels on pathology whole-slide images. This is the first demonstration of this type of inference from H&E images. We use this method to produce tumor molecular cartographies and to characterize certain aspects of tumor spatial transcriptomics. Specifically, we develop a heterogeneity index (HTI), derived from the molecular cartographies. Applying our methods to breast and lung cancer slides, we show a significant statistical link between heterogeneity and survival. Our results highlight the value of automated analysis of pathology whole slide images. Our methods potentially open a new approach to investigating tumor heterogeneity and other spatial molecular properties and their link to clinical characteristics, including treatment susceptibility and survival.

show abstract

Predicting Methylation from Sequence and Gene Expression Using Deep Learning with Attention

Levy-Jurgenson

Tekpli

Kristensen

et al. 2018

Preprint

View full text Add to dashboard Cite

DNA methylation has been extensively linked to alterations in gene expression, playing a key role in the manifestation of multiple diseases, most notably cancer. For this reason, researchers have long been measuring DNA methylation in living organisms. The relationship between methylation and expression, and between methylation in different genomic regions is of great theoretical interest from a molecular biology perspective. Therefore, several models have been suggested to support the prediction of methylation status in samples. These models, however, have two main limitations: (a) they heavily rely on partially measured methylation levels as input, somewhat defeating the object as one is required to collect measurements from the sample of interest before applying the model; and (b) they are largely based on human mediated feature engineering, thus preventing the model from unveiling its own representations. To address these limitations we used deep learning, with an attention mechanism, to produce a general model that predicts DNA methylation for a given sample in any CpG position based solely on the sample's gene expression profile and the sequence surrounding the CpG.We show that our model is capable of generalizing to a completely separate test set of CpG positions and subjects. Depending on gene-CpG proximity conditions, our model can attain a Spearman correlation of up to 0.8 and MAE of 0.14 for thousands of CpG sites in the test data. We also identify and analyze several motifs and genes that our model suggests may be linked to methylation activity, such as Nodal and Hand1. Moreover, our approach, and most notably the use of attention mechanisms, offers a novel framework with which to extract valuable insights from gene expression data when combined with sequence information. The code and trained models are available at: https://github.com/YakhiniGroup/Methylation

show abstract

Predicting Methylation from Sequence and Gene Expression Using Deep Learning with Attention

Levy-Jurgenson

Tekpli

Kristensen

et al. 2019

View full text Add to dashboard Cite

Assessing heterogeneity in spatial data using the HTA index with applications to spatial transcriptomics and imaging

Levy-Jurgenson

Tekpli

Yakhini

2021

View full text Add to dashboard Cite

Motivation Tumour heterogeneity is being increasingly recognised as an important characteristic of cancer and as a determinant of prognosis and treatment outcome. Emerging spatial transcriptomics data hold the potential to further our understanding of tumour heterogeneity and its implications. However, existing statistical tools are not sufficiently powerful to capture heterogeneity in the complex setting of spatial molecular biology. Results We provide a statistical solution, the HeTerogeneity Average index (HTA), specifically designed to handle the multivariate nature of spatial transcriptomics. We prove that HTA has an approximately normal distribution, therefore lending itself to efficient statistical assessment and inference. We first demonstrate that HTA accurately reflects the level of heterogeneity in simulated data. We then use HTA to analyse heterogeneity in two cancer spatial transcriptomics datasets: spatial RNA sequencing by 10x Genomics and spatial transcriptomics inferred from H&E. Finally, we demonstrate that HTA also applies to 3D spatial data using brain MRI. In spatial RNA sequencing we use a known combination of molecular traits to assert that HTA aligns with the expected outcome for this combination. We also show that HTA captures immune-cell infiltration at multiple resolutions. In digital pathology we show how HTA can be used in survival analysis and demonstrate that high levels of heterogeneity may be linked to poor survival. In brain MRI we show that HTA differentiates between normal ageing, Alzheimer’s disease and two tumours. HTA also extends beyond molecular biology and medical imaging, and can be applied to many domains, including GIS. Availability Python package and source code are available at: https://github.com/alonalj/hta Supplementary information Supplementary data are available at Bioinformatics online.

show abstract

Erratum to: Assessing heterogeneity in spatial data using the HTA index with applications to spatial transcriptomics and imaging

Levy-Jurgenson¹,

Tekpli²,

Yakhini³

2021

View full text Add to dashboard Cite

Assessing heterogeneity in spatial data using the HTA index with applications to spatial transcriptomics and imaging

Levy-Jurgenson

Tekpli

Yakhini

2021

Preprint

View full text Add to dashboard Cite

Tumour heterogeneity is being increasingly recognised as an important characteristic of cancer and as a determinant of prognosis and treatment outcome. Emerging spatial transcriptomics data hold the potential to further our understanding of tumour heterogeneity and its implications. However, existing statistical tools are not sufficiently powerful to capture heterogeneity in the complex setting of spatial molecular biology. We provide a statistical solution, the HeTerogeneity Average index (HTA), specifically designed to handle the multivariate nature of spatial transcriptomics. We prove that HTA has an approximately normal distribution, therefore lending itself to efficient statistical assessment and inference. We first demonstrate that HTA accurately reflects the level of heterogeneity in simulated data. We then use HTA to analyse heterogeneity in two cancer spatial transcriptomics datasets: spatial RNA sequencing by 10x Genomics and spatial transcriptomics inferred from H&E. Finally, we demonstrate that HTA also applies to 3D spatial data using brain MRI. In spatial RNA sequencing we use a known combination of molecular traits to assert that HTA aligns with the expected outcome for this combination. We also show that HTA captures immune-cell infiltration at multiple resolutions. In digital pathology we show how HTA can be used in survival analysis and demonstrate that high levels of heterogeneity may be linked to poor survival. In brain MRI we show that HTA differentiates between normal ageing, Alzheimer's disease and two tumours. HTA also extends beyond molecular biology and medical imaging, and can be applied to many domains, including GIS. Source code and python package are available at https://alonalj.github.io/HTA.html .

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.