Beyond Just Vision: A Review on Self-Supervised Representation Learning on Multimodal and Temporal Data

Deldari, Shohreh; Xue, Hao; Saeed, Aaqib; He, Jun; Smith, Daniel; Salim, Flora D.

doi:10.48550/arxiv.2206.02353

Cited by 6 publications

(6 citation statements)

References 80 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the field of healthcare and physiological applications, SSL faces additional challenges due to the heterogeneous nature of data acquired from different sensors with varying characteristics, sampling rates, and resolutions (Deldari et al, 2022a). The dynamic nature of real-world situations further complicates the aggregation and compression of multimodal sensor data into a coherent global embedding suitable for downstream tasks.…”

Section: Icml Workhop On Machinementioning

confidence: 99%

Time Series Change Point Detection with Self-Supervised Contrastive Predictive Coding

Deldari

Smith

Xue

et al. 2021

Proceedings of the Web Conference 2021

View full text Add to dashboard Cite

Limited availability of labeled data for machine learning on biomedical time-series hampers progress in the field. Self-supervised learning (SSL) is a promising approach to learning data representations without labels. However, current SSL methods require expensive computations for negative pairs and are designed for single modalities, limiting their versatility. To overcome these limitations, we introduce CroSSL (Cross-modal SSL). CroSSL introduces two novel concepts: masking intermediate embeddings from modalityspecific encoders and aggregating them into a global embedding using a cross-modal aggregator. This enables the handling of missing modalities and end-to-end learning of cross-modal patterns without prior data preprocessing or timeconsuming negative-pair sampling. We evaluate CroSSL on various multi-modal time-series benchmarks, including both medical-grade and consumer biosignals. Our results demonstrate superior performance compared to previous SSL techniques and supervised benchmarks with minimal labeled data. We additionally analyze the impact of different masking ratios and strategies and assess the robustness of the learned representations to missing modalities. Overall, our work achieves state-of-the-art performance while highlighting the benefits of masking latent embeddings for cross-modal learning in temporal health data.

show abstract

Section: Icml Workhop On Machinementioning

confidence: 99%

Time Series Change Point Detection with Self-Supervised Contrastive Predictive Coding

Deldari

Smith

Xue

et al. 2021

Proceedings of the Web Conference 2021

View full text Add to dashboard Cite

show abstract

“…This section focuses on approaches that are compatible with biosignals and that were encountered during the survey. Various classification schemes have been proposed for pretext tasks, depending on the domain of application [33]. Here, methodologies will be grouped in the following categories: predictive pretext tasks, generative pretext tasks, contrastive learning pretext tasks.…”

Section: A Pretext Tasksmentioning

confidence: 99%

Applications of Self-Supervised Learning to Biomedical Signals: where are we now

Pup¹,

Atzori²

2023

Preprint

View full text Add to dashboard Cite

<p>Over the last decade, deep learning applications in biomedical research have exploded, demonstrating the ability to often outperform previous machine learning approaches in various tasks. However, training deep learning models requires large amounts of data annotated by experts, whose collection is often time- and cost- prohibitive in the biomedical domain. Self-Supervised Learning (SSL) has emerged as a prominent solution for these problems, as it allows to learn powerful data representations in an unsupervised manner. Despite most applications in biomedical research targeted images, the high amount of recent works targeting biosignals can make it difficult for researchers to have a complete picture of the current situation. The aim of this paper is to outline and clarify the state of the art in the domain. The article briefly summarizes the nature and acquisition modality of biomedical signals, introduces the SSL method, and provides a complete but synthetic overview of the main works applying SSL for the analysis of biomedical signals. The analysis of the scientific literature highlights the importance of SSL, confirming its potential to improve the integration of deep learning into clinical tasks. </p>

show abstract

“…We have demonstrated the feasibility of predicting impaired MFR using a relatively simple CNN with only 72,929 trainable parameters. We expect that CNN model performance may be further improved by additional machine learning technologies such as larger CNN architectures (41), self-supervised model pre-training (62), and transformer-based frameworks (40,63). A further possible extension would be to train the CNN using only rest ECG waveforms or reduced leads which could enable use of ambulatory ECG monitoring data.…”

Section: Future Studiesmentioning

confidence: 99%

Self-supervised deep representation learning of a foundation transformer model enabling efficient ECG-based assessment of cardiac and coronary function with limited labels

Moody,

Poitrasson-Rivière,

Renaud

et al. 2023

Preprint

View full text Add to dashboard Cite

Background: Impaired microvascular and vasomotor function is a common consequence of aging, diabetes, and other risk factors, and is associated with adverse cardiac outcomes. Such impairments are not readily identified by standard clinical methods of cardiovascular testing such as coronary angiography and noninvasive single photon emission tomography (SPECT) myocardial perfusion imaging (MPI). We hypothesized that signals embedded within stress electrocardiograms (ECGs) identify individuals with microvascular and vasomotor dysfunction. Methods: We developed and validated a novel convolutional neural network (CNN) using stress and rest ECG data (ECG-Flow) to identify patients with impaired myocardial flow reserve (MFR) on quantitative positron emission tomography (PET) MPI (N=3887). Diagnostic accuracy was validated with an internal holdout set of patients undergoing stress PET MPI (N=963). The prognostic association of ECG-Flow with mortality was then evaluated in a separate cohort of patients undergoing SPECT MPI (N=5102). Results: ECG-Flow achieved good diagnostic accuracy for impaired MFR in the holdout PET cohort (AUC, sensitivity, specificity: 0.737, 71.1%, 65.7%). Abnormal ECG-Flow was found to be significantly associated with mortality in both PET holdout and SPECT MPI cohorts (adjusted HR 2.12 [95% CI 1.45, 2.10], p=0.0001, and 2.07 [1.82, 2.36], p<0.0001, respectively). Conclusion: Signals predictive of microvascular and vasomotor dysfunction are embedded in stress ECG waveforms. These signals can be identified by deep learning methods and are related to prognosis in patients undergoing both stress PET and SPECT MPI.

show abstract

Beyond Just Vision: A Review on Self-Supervised Representation Learning on Multimodal and Temporal Data

Cited by 6 publications

References 80 publications

Time Series Change Point Detection with Self-Supervised Contrastive Predictive Coding

Time Series Change Point Detection with Self-Supervised Contrastive Predictive Coding

Applications of Self-Supervised Learning to Biomedical Signals: where are we now

Self-supervised deep representation learning of a foundation transformer model enabling efficient ECG-based assessment of cardiac and coronary function with limited labels

Contact Info

Product

Resources

About