Studies in the History of the English Language VII 2016
DOI: 10.1515/9783110494235-007
|View full text |Cite
|
Sign up to set email alerts
|

The effect of representativeness and size in historical corpora: An empirical study of changes in lexical frequency

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
0
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 0 publications
0
0
0
Order By: Relevance
“…In corpus design, representativeness of a language or community-language practice (however bounded) is a complex yet essential consideration in ensuring the integrity of analysis (Biber, 1993;Davies & Chapman, 2016;Sinclair, 2004). Similarly, as learners interacting with corpora occupies the core of any DDL activity, the question of what data to use in DDL is a primary concern.…”
Section: Selecting Corpora For Corpus-based Language Teachingmentioning
confidence: 99%
“…In corpus design, representativeness of a language or community-language practice (however bounded) is a complex yet essential consideration in ensuring the integrity of analysis (Biber, 1993;Davies & Chapman, 2016;Sinclair, 2004). Similarly, as learners interacting with corpora occupies the core of any DDL activity, the question of what data to use in DDL is a primary concern.…”
Section: Selecting Corpora For Corpus-based Language Teachingmentioning
confidence: 99%
“…The search engine is based on eight million English books, i.e., approximately 20% of the entire Google Books collection, which comprises 40 million scanned books calculated at more than two and a half trillion words. 35 Five million of those books allowed the American linguist Mark Davies to create a user interface to Google Books covering two corpora: Google Books American Corpus (155 billion words) and Google Books British Corpus (34 billion words) (see, e.g., Davies & Chapman 2016). The data provided below are derived from 32 This is impossible in the case of Google Books, which provides access to a restricted number of websites.…”
Section: Diachronic Analysis Of Variant Spellingsmentioning
confidence: 99%