2011 10th International Conference on Machine Learning and Applications and Workshops 2011
DOI: 10.1109/icmla.2011.154
|View full text |Cite
|
Sign up to set email alerts
|

Using Published Medical Results and Non-homogenous Data in Rule Learning

Abstract: Many factors limit researchers from accessing studies' original data sets. As a result, much medical and healthcare research is based off of systematic reviews and meta-analysis of published results. However, when research involves the use of aggregated data from multiple studies, traditional machine learning-based means of analysis cannot be used. This paper describes diversity of data and results available in published manuscripts, and relates them to a rule learning method that can be applied to build class… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
7
0

Year Published

2012
2012
2020
2020

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(7 citation statements)
references
References 12 publications
0
7
0
Order By: Relevance
“…Instead, only aggregated summaries describing groups of patients are available. Such aggregated summaries are also one of the most common forms in which results are presented in medical publications [20].…”
Section: B Aggregated Vs Individual Valuesmentioning
confidence: 99%
See 1 more Smart Citation
“…Instead, only aggregated summaries describing groups of patients are available. Such aggregated summaries are also one of the most common forms in which results are presented in medical publications [20].…”
Section: B Aggregated Vs Individual Valuesmentioning
confidence: 99%
“…Aggregated data are the most common way of describing cohorts of patients [17] [19] [20]. Aggregated values of attributes are given in the form of pairs (μ A , σ A ), where A is a measured attribute, and μ A and σ A denote its mean and standard deviation measured over a group for which the aggregation was done.…”
Section: B Aggregated Vs Individual Valuesmentioning
confidence: 99%
“…For example, this problem appears when predicting the proportion of votes for a given candidate (de Freitas & Kück, 2005); correctly predicting how each individual votes is not required, only which candidate will win. Variants of this problem also appear in many other domains, including in consumer marketing (Chen et al, 2006), medicine and other health domains (Hernández-González et al, 2013;Wojtusiak et al, 2011), image processing (de Freitas & Kück, 2005), physical processes (Musicant et al, 2007), fraud detection (Rüping, 2010), manufacturing (Stolpe & Morik, 2011), and voting networks (Fish et al, 2016). This problem may also arise when attempting to correct for differences between training and testing distributions (Du Plessis & Sugiyama, 2014;Saerens et al, 2002).…”
Section: Introductionmentioning
confidence: 99%
“…Sometimes this is due to limits on the measurement process (Hernández-González et al, 2013;de Freitas & Kück, 2005;Musicant et al, 2007;Stolpe & Morik, 2011). At other times, before data sets are released, labels are purposely detached from their instances in order to maintain privacy (Chen et al, 2006;Rüping, 2010;Wojtusiak et al, 2011). Instead, only the proportion of labels are given for a group of sample instances.…”
Section: Introductionmentioning
confidence: 99%
“…Sometimes this is due to limits on the measurement process [7,4,11,17]. At other times, before datasets are released, labels are purposely detached from their instances in order to maintain privacy [3,15,19]. Instead, only the proportion of labels are given for a group of sample instances.…”
Section: Introductionmentioning
confidence: 99%