Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) 2016
DOI: 10.18653/v1/p16-2097
|View full text |Cite
|
Sign up to set email alerts
|

Using Sequence Similarity Networks to Identify Partial Cognates in Multilingual Wordlists

Abstract: Increasing amounts of digital data in historical linguistics necessitate the development of automatic methods for the detection of cognate words across languages. Recently developed methods work well on language families with moderate time depths, but they are not capable of identifying cognate morphemes in words which are only partially related. Partial cognacy, however, is a frequently recurring phenomenon, especially in language families with productive derivational morphology. This paper presents a pilot a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
46
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 46 publications
(46 citation statements)
references
References 30 publications
0
46
0
Order By: Relevance
“…Our collection was taken from six major sources (Greenhill et al, 2008;Dunn, 2012;List, 2014b;List et al, 2016b;Mennecier et al, 2016) covers datasets ranging between 100 and 210 concepts translated into 5 to 100 languages from 13 different language families. Modifications introduced in the process of preparing the datasets included (a) the correction of errata (e.g.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Our collection was taken from six major sources (Greenhill et al, 2008;Dunn, 2012;List, 2014b;List et al, 2016b;Mennecier et al, 2016) covers datasets ranging between 100 and 210 concepts translated into 5 to 100 languages from 13 different language families. Modifications introduced in the process of preparing the datasets included (a) the correction of errata (e.g.…”
Section: Methodsmentioning
confidence: 99%
“…We also used some auxiliary features from (Jäger and Sofroniev, 2016), which are derived from string similarities. For the clustering subtask (2), we followed List et al (2016b) and List et al (2017) in using the Infomap algorithm (Rosvall and Bergstrom, 2008).…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…List and Moran (2013) propose an approach based on sound class alignments and an average score clustering algorithm. List et al (2016) extend the approach to include partial cognates within word lists.…”
Section: Related Workmentioning
confidence: 99%
“…The word similarity matrix was converted into a word distance matrix using the following transformation: (1 + exp(x)) −1 where, x is the PMI score between two words. We use the InfoMap clustering algorithm (List et al, 2016) for the purpose of identifying cognate clusters.…”
Section: Phylogenetic Approaches 321 Automatic Cognate Detectionmentioning
confidence: 99%