2018
DOI: 10.1007/s10664-017-9587-0
|View full text |Cite
|
Sign up to set email alerts
|

Finding better active learners for faster literature reviews

Abstract: Literature reviews can be time-consuming and tedious to complete. By cataloging and refactoring three state-of-the-art active learning techniques from evidence-based medicine and legal electronic discovery, this paper finds and implements FASTREAD, a faster technique for studying a large corpus of documents, combining and parametrizing the most efficient active learning algorithms. This paper assesses FASTREAD using datasets generated from existing SE literature reviews (Hall, Wahono, Radjenović, Kitchenham et… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
95
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
2
1

Relationship

2
5

Authors

Journals

citations
Cited by 65 publications
(105 citation statements)
references
References 53 publications
2
95
0
Order By: Relevance
“…All those aforementioned tactics are built into EMBLEM [114], [116], the active learner used for this work. When reading commits, EMBLEM initially uses uncertainty sampling to fast build a classification model (for bug-fixing or non bug-fixing commit message), then switches to certainty sampling to greedily find bug-fixing commits.…”
Section: Frameworkmentioning
confidence: 99%
See 3 more Smart Citations
“…All those aforementioned tactics are built into EMBLEM [114], [116], the active learner used for this work. When reading commits, EMBLEM initially uses uncertainty sampling to fast build a classification model (for bug-fixing or non bug-fixing commit message), then switches to certainty sampling to greedily find bug-fixing commits.…”
Section: Frameworkmentioning
confidence: 99%
“…values {N 1 = 4000, N 2 = 1, N 3 = 30, N 4 = 95%}). Those decisions where made by Yu et al [114] after exploring 32 different kinds of active learners. They report that, using the above requirements, EMBLEM found more relevant items faster than the previously reported state-of-the-art in incremental text mining retrieval [22], [102].…”
Section: Frameworkmentioning
confidence: 99%
See 2 more Smart Citations
“…Given the information available in automated UI testing, we extract three types of features: • Text feature: the same text mining feature extraction used in the total recall approaches [56,57] Using the foregoing types of features, the proposed framework is described in Algorithm 1 with engineering choices of N 1 , N 2 . N 1 is the batch size of the process.…”
Section: Terminatormentioning
confidence: 99%