2017
DOI: 10.1371/journal.pone.0170046
|View full text |Cite
|
Sign up to set email alerts
|

The Potential of Automatic Word Comparison for Historical Linguistics

Abstract: The amount of data from languages spoken all over the world is rapidly increasing. Traditional manual methods in historical linguistics need to face the challenges brought by this influx of data. Automatic approaches to word comparison could provide invaluable help to pre-analyze data which can be later enhanced by experts. In this way, computational approaches can take care of the repetitive and schematic tasks leaving experts to concentrate on answering interesting questions. Here we test the potential of au… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
91
0
2

Year Published

2017
2017
2022
2022

Publication Types

Select...
4
3
2

Relationship

5
4

Authors

Journals

citations
Cited by 74 publications
(94 citation statements)
references
References 32 publications
1
91
0
2
Order By: Relevance
“…We also used some auxiliary features from (Jäger and Sofroniev, 2016), which are derived from string similarities. For the clustering subtask (2), we followed List et al (2016b) and List et al (2017) in using the Infomap algorithm (Rosvall and Bergstrom, 2008).…”
Section: Methodsmentioning
confidence: 99%
“…We also used some auxiliary features from (Jäger and Sofroniev, 2016), which are derived from string similarities. For the clustering subtask (2), we followed List et al (2016b) and List et al (2017) in using the Infomap algorithm (Rosvall and Bergstrom, 2008).…”
Section: Methodsmentioning
confidence: 99%
“…Unfortunately, available tools are not up to the task. Computational methods for automatic cognate detection, which could be used to pre-parse the data for the linguists, usually assume that words are morphologically simple (Steiner et al 2011;List et al 2017) and automatic partial cognate detection is still in its infancy .…”
Section: Partial Cognate Annotationmentioning
confidence: 99%
“…Markov Clustering is very popular in biology and was shown to outperform the popular Affinity Propagation algorithm (Frey and Dueck 2007) in the task of homolog detection in biology (Vlasblom and Wodak 2009). As a third method, we follow List et al (2016b) in testing Infomap (Rosvall and Bergstrom 2008), a method that was originally designed to detect communities in complex networks. Communities are groups that share more links with each other than outside the group (Newman and Girvan 2004).…”
Section: Network Partitioningmentioning
confidence: 99%
“…Infomap uses random walks to find the best partition of a network into communities. Infomap is not a classical partitioning algorithm, and we do not know of any studies which tested its suitability for the task of homolog detection in evolutionary biology, but according to List et al (2016b), Infomap shows a better performance than UPGMA in automatic cognate detection.…”
Section: Network Partitioningmentioning
confidence: 99%