“…The issues related to normalization and annotation are equally applicable to the use of corpora in historical linguistics, sociolinguistics, dialectology, and, in a somewhat different way, language typology. In historical linguistics, token normalization (Azawi, Afzal, & Breuel, ; Bollmann, Dipper, & Petran, ; Bollmann, Petran, & Dipper, ; Jurish, ), sentence segmentation (Petran, ), and extensions of POS tagsets (Dipper et al., ) are actively discussed, which should support fruitful cross‐disciplinary insight for the analysis of learner corpora.…”