1999
DOI: 10.1137/s0036144598347035
|View full text |Cite
|
Sign up to set email alerts
|

Matrices, Vector Spaces, and Information Retrieval

Abstract: Abstract. The evolution of digital libraries and the Internet has dramatically transformed the processing, storage, and retrieval of information. Efforts to digitize text, images, video, and audio now consume a substantial portion of both academic and industrial activity. Even when there is no shortage of textual materials on a particular topic, procedures for indexing or extracting the knowledge or conceptual information contained in them can be lacking. Recently developed information retrieval technologies a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
312
0
22

Year Published

2004
2004
2018
2018

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 531 publications
(351 citation statements)
references
References 37 publications
1
312
0
22
Order By: Relevance
“…The Vector Space Model (VSM) is an algebraic model used for information filtering, information retrieval, indexing and relevance ranking (Berry, Drmac andElizabeth 1999, Polyvyanyy andKuropka 2007 …”
Section: Vector Space Model (Vsm)mentioning
confidence: 99%
“…The Vector Space Model (VSM) is an algebraic model used for information filtering, information retrieval, indexing and relevance ranking (Berry, Drmac andElizabeth 1999, Polyvyanyy andKuropka 2007 …”
Section: Vector Space Model (Vsm)mentioning
confidence: 99%
“…Latent semantic indexing (LSI) [3][4][5], also called latent semantic analysis (LSA) is an automatic indexing method based on the semantics of documents, which attempts to overcome the two main problems that have the traditional indexing schemes of lexical coincidences: polysemy and synonymy. The first has to do with a word that can have multiple meanings, and therefore, the words of a query may not coincide in meaning with those of the documents; the second means that several terms can have the same meaning and hence the words used in queries can match nonrelevant documents.…”
Section: Latent Semantic Indexingmentioning
confidence: 99%
“…For each vector The correlation is based on the distances among vectors. Seurat uses a cosine distance metric, which is a common similarity measure between binary vectors [9,10]. We define the distance …”
Section: Feature Vector Spacementioning
confidence: 99%