2019
DOI: 10.1109/tcyb.2018.2816984
|View full text |Cite
|
Sign up to set email alerts
|

AdaSampling for Positive-Unlabeled and Label Noise Learning With Bioinformatics Applications

Abstract: Class labels are required for supervised learning but may be corrupted or missing in various applications. In binary classification, for example, when only a subset of positive instances is labeled whereas the remaining are unlabeled, positive-unlabeled (PU) learning is required to model from both positive and unlabeled data. Similarly, when class labels are corrupted by mislabeled instances, methods are needed for learning in the presence of class label noise (LN). Here we propose adaptive sampling (AdaSampli… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
36
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 46 publications
(43 citation statements)
references
References 61 publications
0
36
0
Order By: Relevance
“…To this end, PhosR implements a multi-step kinase-substrate scoring method where first the likelihood of a kinase to regulate a phosphosite is scored by combining both kinase recognition motifs and the dynamic phosphorylation profiles of sites. The combined scores across all kinases are then integrated using an adaptive sampling-based positiveunlabelled learning method (Yang et al, 2018) to prioritise the kinase most likely to regulate a phosphosite (see Methods). The application of the proposed scoring method to the myotube phosphoproteome uncovers potential kinase-substrate pairs ( Figure 3A; row dendrogram) and global relationships between kinases ( Figure 3A; column dendrogram).…”
Section: Global Kinase-substrate Relationship Scoring Of Phosphositesmentioning
confidence: 99%
See 2 more Smart Citations
“…To this end, PhosR implements a multi-step kinase-substrate scoring method where first the likelihood of a kinase to regulate a phosphosite is scored by combining both kinase recognition motifs and the dynamic phosphorylation profiles of sites. The combined scores across all kinases are then integrated using an adaptive sampling-based positiveunlabelled learning method (Yang et al, 2018) to prioritise the kinase most likely to regulate a phosphosite (see Methods). The application of the proposed scoring method to the myotube phosphoproteome uncovers potential kinase-substrate pairs ( Figure 3A; row dendrogram) and global relationships between kinases ( Figure 3A; column dendrogram).…”
Section: Global Kinase-substrate Relationship Scoring Of Phosphositesmentioning
confidence: 99%
“…To identify potential kinases that could be responsible for the phosphorylation change of a phosphorylation site, we implement a multi-step framework that contains two major components including (i) a kinaseSubstrateScore function which scores a given phosphosite using kinase recognition motif and phosphoproteomic dynamics, and (ii) a kinaseSubstratePred function which synthesise the scores generated from (i) for predicting kinase-substrate relationships using an adaptive sampling-based positiveunlabelled learning method (Yang et al, 2018). The kinase-substrate scoring function combines both kinase recognition motif (i.e.…”
Section: Kinase-substrate Prioritisation Of Phosphositesmentioning
confidence: 99%
See 1 more Smart Citation
“…Positive-unlabeled (PU) learning (de Campos et al, 2018;Sansone et al, 2018;Yang et al, 2018) is applied to various situations. In PU learning, a supervised learning-based method is designed to learn a classification model from a positive sample set and an unlabeled dataset from an unknown class.…”
Section: Screening Credible Negative Samplesmentioning
confidence: 99%
“…In PU learning, a supervised learning-based method is designed to learn a classification model from a positive sample set and an unlabeled dataset from an unknown class. Yang et al (2018) designed an adaptive sampling framework with class label noise based on PU learning and introduced two new bioinformatic applications: identifying kinase-substrates and identifying transcription factor target genes. Therefore, PU learning may be one strong way to solve the problem of lacking negative LPIs.…”
Section: Screening Credible Negative Samplesmentioning
confidence: 99%