Challenging the Myth of Monolingual Corpora 2017
DOI: 10.1163/9789004276697_010
|View full text |Cite
|
Sign up to set email alerts
|

Semi-automatic Discovery of Multilingual Elements in English Historical Corpora: Methods and Challenges

Abstract: One of the main obstacles standing in the way of large-scale studies of multilingual practices has been the difficulty of discovering secondary, i.e., foreign or minority language words by any other than manual means (see Pahta 2004;Nurmi and Pahta 2012). Although current computational methods for identifying the primary language of a monolingual text are robust (see, e.g., Alex, Dubey and Keller 2007), their accuracy diminishes significantly when the task is to identify short chunks of words and phrases in ot… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
references
References 9 publications
0
0
0
Order By: Relevance