1997
DOI: 10.1177/016555159702300404
|View full text |Cite
|
Sign up to set email alerts
|

Database tomography for information retrieval

Abstract: Database tomography is an information extraction and analysis system which operates on textual databases. Its primary use to date has been to identify pervasive technical thrusts and themes, and the interrelationships among these themes and sub-themes, which are intrinsic to large textual databases. Its two main algorithmic components are multiword phrase frequency analysis and phrase proximity analysis. This paper shows how database tomography can be used to enhance information retrieval from large textual da… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
58
0
4

Year Published

1999
1999
2009
2009

Publication Types

Select...
7

Relationship

5
2

Authors

Journals

citations
Cited by 87 publications
(63 citation statements)
references
References 18 publications
0
58
0
4
Order By: Relevance
“…The parent article to the present Appendix (Kostoff, 1997a) focused on the use of computational linguistics imbedded in an iterative relevance feedback procedure. The final product is a query that will retrieve documents with two aggregate characteristics; the maximum number of relevant documents will be retrieved, and the ratio of relevant to non-relevant documents will be very large; i.e., the signalto-noise ratio will be large.…”
Section: Ii-g Definition Of Quality In Information Retrieval Contextmentioning
confidence: 99%
See 3 more Smart Citations
“…The parent article to the present Appendix (Kostoff, 1997a) focused on the use of computational linguistics imbedded in an iterative relevance feedback procedure. The final product is a query that will retrieve documents with two aggregate characteristics; the maximum number of relevant documents will be retrieved, and the ratio of relevant to non-relevant documents will be very large; i.e., the signalto-noise ratio will be large.…”
Section: Ii-g Definition Of Quality In Information Retrieval Contextmentioning
confidence: 99%
“…Three iterative steps were required; each step required the technical expert(s) to read many hundreds of the retrieved records in order to identify those that were relevant and non-relevant . Then, computational linguistics analyses (Kostoff, 1997a) were performed on both the relevant and non-relevant records to identify phrase patterns and relationships characteristic of the relevant records and the non-relevant records. Substantial time and judgement were required to select the appropriate phrases unique to the relevant records and the non-relevant records, and then modify the query accordingly using the key phrases identified.…”
Section: Ii-g Definition Of Quality In Information Retrieval Contextmentioning
confidence: 99%
See 2 more Smart Citations
“…For the present study, the SCI database (including both the Science Citation Index and the Social Science Citation Index) was used. The approach used for query development was the first author's iterative relevance feedback concept of Simulated Nucleation [Kostoff et al, 1997].…”
Section: Database Generationmentioning
confidence: 99%