2002
DOI: 10.1007/3-540-36127-8_20
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Text Summarization Using a Machine Learning Approach

Abstract: Abstract. In this paper we address the automatic summarization task. Recent research works on extractive-summary generation employ some heuristics, but few works indicate how to select the relevant features. We will present a summarization procedure based on the application of trainable Machine Learning algorithms which employs a set of features extracted directly from the original text. These features are of two kinds: statistical -based on the frequency of some elements in the text; and linguistic -extracted… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
98
0
5

Year Published

2004
2004
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 148 publications
(103 citation statements)
references
References 12 publications
0
98
0
5
Order By: Relevance
“…Machine Learning (ML) approach can be applied if we have a set of training document and their corresponding summary extracts (Neto et al, 2002). The objective of machine learning can be closely related to a classification problem, i.e., to learn from a training model in order to determine the appropriate class where an element belongs to.…”
Section: Machine Learning Approachmentioning
confidence: 99%
See 1 more Smart Citation
“…Machine Learning (ML) approach can be applied if we have a set of training document and their corresponding summary extracts (Neto et al, 2002). The objective of machine learning can be closely related to a classification problem, i.e., to learn from a training model in order to determine the appropriate class where an element belongs to.…”
Section: Machine Learning Approachmentioning
confidence: 99%
“…Aone et al (1999) include feature like tf-idf using noun words and named entities, where they used the corpus consisting of news documents for their experiments. Another extensive investigation using the similar framework was carried by Neto et al (2002). The authors employ a large variety of features, including both statistical and linguistic features.…”
Section: A Naive Bayesmentioning
confidence: 99%
“…The Classification System was proposed by Larocca Neto et al [10] and uses a ML approach to determine relevant segments to extract from source texts. Actually, it is based on a Naïve Bayes classifier.…”
Section: The Classsumm Systemmentioning
confidence: 99%
“…ClassSumm was evaluated on a TIPSTER corpus of 100 news stories for training, and two test procedures, namely, one that has used 100 automatic summaries and another that has used 30 manual extracts [10], in which it outperforms the "from-top" -those from the beginning of the source text, and random order baselines.…”
Section: The Classsumm Systemmentioning
confidence: 99%
“…Only a few have tried using machine learning to accomplish this difficult task (Lin 1999;Aone et al 1999;Neto et al 2002). Most research falls into combining statistical methods with linguistic analysis.…”
Section: Introductionmentioning
confidence: 99%