2007 40th Annual Hawaii International Conference on System Sciences (HICSS'07) 2007
DOI: 10.1109/hicss.2007.213
|View full text |Cite
|
Sign up to set email alerts
|

Essential Dimensions of Latent Semantic Indexing (LSI)

Abstract: Latent Semantic Indexing (LSI) is commonly used to match queries to documents in information retrieval information. We then test this model by developing a modified version of LSI that captures this information, Essential Dimensions of LSI (EDLSI). EDLSI significantly improves retrieval performance on corpora that previously did not benefit from LSI, and offers improved runtime performance when compared with traditional LSI.Traditional LSI requires the use of a dimensionality reduction parameter which must be … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0
1

Year Published

2008
2008
2015
2015

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 38 publications
(23 citation statements)
references
References 24 publications
0
22
0
1
Order By: Relevance
“…Prosesnya dengan menghitung kemiripan dua buah vektor, yaitu antara vektor dari corpus dan vektor dari query (Kontostathis 2007 …”
Section: Vector Space Model (Vsm)unclassified
“…Prosesnya dengan menghitung kemiripan dua buah vektor, yaitu antara vektor dari corpus dan vektor dari query (Kontostathis 2007 …”
Section: Vector Space Model (Vsm)unclassified
“…The query intersects the TDM at the first and last documents, and as mentioned before the TDM is diagonal and therefore the S matrix is also diagonal, and so removing any elements from the diagonal values results in removing the same diagonal values in the original TDM. Thus, when the SVD is applied at the range of k−values (1)(2)(3)(4)(5)(6)(7)(8)(9)(10)(11)(12)(13)(14), only the first document is returned, because at this range of values the last document, which is document number 15, is ignored. It is important to note that, applying the LSI at k − value 15, which means no dimension reduction occurs, returns the two documents, the first and the last documents in the TDM, which have a cosine value of 0.7071.…”
Section: Ii-c the Tdm Diagonalmentioning
confidence: 99%
“…Choosing an optimal dimensionality reduction parameter (k − value) is very important and remains elusive. Traditionally, the optimal k −value has been chosen by running a set of queries with known relevant document sets for multiple values of k [5]. The k − value that returns the best results is chosen as the optimal k−value for each collection.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, in [6], Hoenkamp shows how the technique underlying LSI is just one example of a unitary operator. And the use of the Haar wavelet transform (HWT), as an alternative that shares this unitary property at much reduced computational cost, has been suggested.One of the most recent works has emphasized on dimension reduction in the LSI system [7]. Other researchers have used LSI in field of image IR [8] [9].…”
Section: Tareq Jabermentioning
confidence: 99%