Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing 2017
DOI: 10.18653/v1/d17-1123
|View full text |Cite
|
Sign up to set email alerts
|

A Short Survey on Taxonomy Learning from Text Corpora: Issues, Resources and Recent Advances

Abstract: A taxonomy is a semantic hierarchy, consisting of concepts linked by is-a relations. While a large number of taxonomies have been constructed from human-compiled resources (e.g., Wikipedia), learning taxonomies from text corpora has received a growing interest and is essential for longtailed and domain-specific knowledge acquisition. In this paper, we overview recent advances on taxonomy construction from free texts, reorganizing relevant subtasks into a complete framework. We also overview resources for evalu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
41
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 54 publications
(41 citation statements)
references
References 79 publications
0
41
0
Order By: Relevance
“…We generate weak supervision for the hypernymy inference model from the corpus D of the input text-rich HIN. As a pioneering method, the Hearst pattern [13] has been shown to have decent precision [22,53,57]. We use this method to extract a list S =…”
Section: Weak Supervision Acquisitionmentioning
confidence: 99%
See 1 more Smart Citation
“…We generate weak supervision for the hypernymy inference model from the corpus D of the input text-rich HIN. As a pioneering method, the Hearst pattern [13] has been shown to have decent precision [22,53,57]. We use this method to extract a list S =…”
Section: Weak Supervision Acquisitionmentioning
confidence: 99%
“…Distributional Method for Hypernymy Discovery. Distributional methods constitute one major line of research for hypernymy discovery [50,53] and can be adapted to hypernymy discovery from network data. Early studies proposed symmetric distributional measures for hypernymy discovery that only capture relevance between terms [20].…”
Section: Case Study: Taxonomy Constructionmentioning
confidence: 99%
“…Traditionally, identifying hypernymic relations from text corpora has been addressed with two main approaches: pattern-based and distributional (Wang et al, 2017). Pattern-based (path-based) methods, which provide higher precision at the price of lower coverage, exploit the co-occurrence of a hyponym and its hypernym in a textual corpus (Hearst, 1992;Navigli and Velardi, 2010;Boella and Di Caro, 2013;Flati et al, 2016;Gupta et al, 2016;Pavlick and Pasca, 2017).…”
Section: Related Workmentioning
confidence: 99%
“…Evaluating the quality of an entire taxonomy is challenging due to the existence of multiple aspects that should be considered and the difficulty of obtaining gold standard [43]. Following [5,6,20], we use Ancestor -F 1 and Edдe-F 1 for taxonomy evaluation in this study.…”
Section: Evaluation Metricsmentioning
confidence: 99%
“…Existing methods mostly build taxonomies based on "is-A" relations (e.g., a "panda" is a "mammal" and a "manmal" is an "animal") [42,43,48] by first leveraging pattern-based or distributional methods to extract hypernym-hyponym term pairs and then organizing them into a tree-structured hierarchy. However, such hierarchies cannot satisfy many real-world needs due to its (1) inflexible semantics: many applications may need hierarchies carrying more flexible semantics such as "city-state-country" in a location taxonomy; and (2) limited applicability: the "universal" taxonomy so constructed is unlikely to fit diverse and user-specific application tasks.…”
Section: Introductionmentioning
confidence: 99%