2012
DOI: 10.1007/978-3-642-28885-2_9
|View full text |Cite
|
Sign up to set email alerts
|

Improving Portuguese Term Extraction

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2013
2013
2018
2018

Publication Types

Select...
2
2

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(13 citation statements)
references
References 7 publications
0
13
0
Order By: Relevance
“…For this purpose, the corpus undergoes a pre-processing step, which usually involves the identification of tokens 10 , removal of stopwords 11 , and the representation of the texts in tables. In these tables, each row represents a document (d i ) and each column represents an n-gram 12 of document (n j ), where cell d i n j may be filled with some measure, for instance, the absolute frequency of n-gram n j in document d i .…”
Section: The Statistical Approachmentioning
confidence: 99%
See 4 more Smart Citations
“…For this purpose, the corpus undergoes a pre-processing step, which usually involves the identification of tokens 10 , removal of stopwords 11 , and the representation of the texts in tables. In these tables, each row represents a document (d i ) and each column represents an n-gram 12 of document (n j ), where cell d i n j may be filled with some measure, for instance, the absolute frequency of n-gram n j in document d i .…”
Section: The Statistical Approachmentioning
confidence: 99%
“…Among the definitions available in the literature, we highlight the definition of Witten et al [35] since it avoids that the tf − idf value drops to 0 if a candidate occurs in all documents of a corpus, as observed in Equation 10.…”
Section: (A) Log Ilkelihood Ratio (Ll)mentioning
confidence: 99%
See 3 more Smart Citations