2020
DOI: 10.1007/s41019-020-00144-y
|View full text |Cite
|
Sign up to set email alerts
|

Parrot: A Progressive Analysis System on Large Text Collections

Abstract: The size of textual data continues to grow along with the need for timely and cost-effective analysis, while the growth of computation power cannot keep up with the growth of data. The delays when processing huge textual data can negatively impact user activity and insight. This calls for a paradigm shift from blocking fashion to progressive processing. In this paper, we propose a sample-based progressive processing model that focuses on term frequency calculation on text. The model is based on an incremental … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 27 publications
0
2
0
Order By: Relevance
“…In contrast, the machine learning imputation approaches are to train parametric models in machine learning [23], [24] to estimate the missing values, including decision tree models like XGBoost imputation [25], MissFI (MissForest imputation) [11], and Baran [26], and regression models like MICE (multivariate imputation by chained equations) [8] and imputation via individual model [27]. These imputation methods employ the batch gradient descent techniques [28] to calculate the model gradient over the entire dataset.…”
Section: A Existing Imputation Methodsmentioning
confidence: 99%
“…In contrast, the machine learning imputation approaches are to train parametric models in machine learning [23], [24] to estimate the missing values, including decision tree models like XGBoost imputation [25], MissFI (MissForest imputation) [11], and Baran [26], and regression models like MICE (multivariate imputation by chained equations) [8] and imputation via individual model [27]. These imputation methods employ the batch gradient descent techniques [28] to calculate the model gradient over the entire dataset.…”
Section: A Existing Imputation Methodsmentioning
confidence: 99%
“…In contrast, the parametric imputation solution is to train a parametric model [44,47] (i.e., machine learning one or deep learning one) to learn the data distribution from the observed data. This solution employs the gradient descent techniques to estimate missing values.…”
Section: Background 21 Existing Imputation Methodsmentioning
confidence: 99%