2011 Sixth International Conference on Digital Information Management 2011
DOI: 10.1109/icdim.2011.6093315
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of stop word lists in text retrieval using Latent Semantic Indexing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
10
0
1

Year Published

2013
2013
2022
2022

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 28 publications
(11 citation statements)
references
References 10 publications
0
10
0
1
Order By: Relevance
“…Examples are articles (e.g., a, an, the), prepositions (e.g., at, by, in, to, from, with) and conjunctions (e.g., and, but, as, because) [13]. As mentioned by Zaman et al [14], removal of stop words plays an important role in different text processing domains and it can be used to get language relations [15]. This was attested by Shin et al [16], as they mentioned that stop word removal improves the quality of the indexed terms.…”
Section: Review Of Related Literaturesmentioning
confidence: 99%
“…Examples are articles (e.g., a, an, the), prepositions (e.g., at, by, in, to, from, with) and conjunctions (e.g., and, but, as, because) [13]. As mentioned by Zaman et al [14], removal of stop words plays an important role in different text processing domains and it can be used to get language relations [15]. This was attested by Shin et al [16], as they mentioned that stop word removal improves the quality of the indexed terms.…”
Section: Review Of Related Literaturesmentioning
confidence: 99%
“…Table (1) shows in detail the statistical distribution of words in the data sets ( L is the average length of the document and L the standard deviation of document length). We used the Glasgow list [8] as a stop words list in our experiments. This list is widely used as English standard stop word; it covers a large number (351 stop words).…”
Section: Pretreatmentmentioning
confidence: 99%
“…Filtering stopwords is an established technique in Information Retrieval (IR). In many domains stopwords reduce noise and increase the efficiency of an inverted index [6,21]. In this paper we examine the use of stopwords, namely filtering out terms from the scoring decision, in neural re-ranking models.…”
Section: Introductionmentioning
confidence: 99%
“…Term importance and stopwords have a long history in IR: from the collection occurrence based Inverse Document Frequency (IDF) [17], the theory of retrievability [1,13], to studies on the impact of stopwords in retrieval and text processing [6,21], and visual inspection tools of neural re-ranking models [10]. Term importance in neural models has been studied in the context of query attention [7,19] and IDF weighting [11], or BERT-based importance measures for indexing [4].…”
Section: Introductionmentioning
confidence: 99%