2003
DOI: 10.1007/978-3-540-45115-0_6
|View full text |Cite
|
Sign up to set email alerts
|

Reducing Information Variation in Text

Abstract: Abstract. We discuss the nature and the scope of linguistic (morphological, syntactic and semantic) variation of terms and its impact on two information retrieval tasks: term acquisition and automatic indexing. A review of natural language processing techniques existing in these two areas is done, along with an in-depth presentation of FASTR, a corpus processor for the recognition, normalization, and acquisition of multi-word terms.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0
2

Year Published

2005
2005
2018
2018

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(8 citation statements)
references
References 82 publications
0
6
0
2
Order By: Relevance
“…Certainly, this is not an easy task. On the one hand, snippets provide (a) localized contextual paragraphs that are highly related to the query, (b) these localized contextual paragraphs express ideas and concepts by means of different paraphrases, which consist primarily of morphological, semantical, orthographical and syntactical variations of these ideas and concepts (Savary and Jacquemin, 2000), which makes the identification of promising answer candidates easier. On the other hand, search engines insert intentional breaks in snippets, in order to show relations among words relevant to the query that are separated by a large span of text.…”
Section: How Are Inventor and Invention Related? What Is The Relationmentioning
confidence: 99%
“…Certainly, this is not an easy task. On the one hand, snippets provide (a) localized contextual paragraphs that are highly related to the query, (b) these localized contextual paragraphs express ideas and concepts by means of different paraphrases, which consist primarily of morphological, semantical, orthographical and syntactical variations of these ideas and concepts (Savary and Jacquemin, 2000), which makes the identification of promising answer candidates easier. On the other hand, search engines insert intentional breaks in snippets, in order to show relations among words relevant to the query that are separated by a large span of text.…”
Section: How Are Inventor and Invention Related? What Is The Relationmentioning
confidence: 99%
“…Conflations. We define a conflation [2] to be a syntactic paraphrase of a reference to a concept. [knife grinding] can appear in a document as the conflation "grind the knives" and [filter cleaning] can be conflated to "filters are cleaned".…”
Section: Noise Reductionmentioning
confidence: 99%
“…VMWEs have been the focus of much attention, both in linguistics and in natural language processing [11,1,3,15]. From a linguistic point of view, they present restricted variability patterns, licensing phenomena such as passivization, pronominalization of components, reordering, and free PP-movement depending on the VMWE category [14,10,17,8]. Moreover, verbs (and VMWEs) tend to have rich morphological inflection paradigms, and allow many (but not all) syntactic changes [6,5].…”
Section: Introductionmentioning
confidence: 99%