We present a cross-constructional approach to the history of the genitive alternation and the dative alternation in Late Modern English (AD 1650 to AD 1999), drawing on richly annotated datasets and modern statistical modeling techniques. We identify cross-constructional similarities in the development of the genitive and the dative alternation over time (mainly with regard to the loosening of the animacy constraint), a development which parallels distributional changes in animacy categories in the corpus material. Theoretically, we transfer the notion of 'probabilistic grammar' to historical data and claim that the corpus models presented reflect past speakers' knowledge about the distribution of genitive and dative variants. The historical data also helps to determine what is constant (and timeless) in the effect of selected factors such as animacy or length, and what is variant.* We express our thanks to the Freiburg Institute for Advanced Studies (FRIAS) for funding the project "Predicting syntax in space and time". We are also grateful to the audiences of the March 2010 workshop on "Probabilistic syntax: Phonetics, diachrony and synchrony" (hosted by the FRIAS) and the November 2010 workshop on "The development of syntactic alternations" (Stanford University) for constructive feedback. We acknowledge helpful comments and suggestions by Ludovic De Cuypere, Beth Levin and Joanna Nykiel, and owe gratitude to Stephanie Shih for advice on coding final sibilancy, and to Katharina Ehret, whose assistance has contributed significantly to the development of our research. Lastly, three anonymous referees provided us with extremely helpful and constructive comments and suggestions. The usual disclaimers apply. The third author is grateful to the University of Paderborn for providing her access to their on-line library resources while working on this article.
This article explores measures, operationalisations and effects of rhythm and weight as two constraints on the variation between the s-genitive and the of-genitive. We base the analysis on interchangeable genitives in the news and letters sections of ARCHER (A Representative Corpus of Historical English Registers), which covers the period between 1650 and 1999. Thus, we are ultimately concerned with the applicability of two factors that have their roots in speech (rhythm: phonology; weight: online processing) to an 'unconventional', written data set with a historical dimension. As for weight, we focus on the comparison of simple single-constituent and more complex multi-constituent measurements. Our notion of rhythm centres on the ideally even distribution of stressed and unstressed syllables. We find that in our data set, both rhythm and weight show theoretically unexpected quadratic effects: rhythmically better-behaved s-genitives are not necessarily preferred over of-genitives, and short constituents exhibit odd weight effects. In conclusion, we argue that while rhythm is only a minor player in our data set, the quadratic quirks it exhibits should inspire further study. Weight, on the other hand, is a crucial factor which, however, likewise comes with measurement and modelling complications.
On the methodological plane, the study contributes to a growing body of literature on the probabilistic nature of the genitive alternation (see, e.g.
This paper is concerned with sketching future directions for corpusbased dialectology. We advocate a holistic approach to the study of geographically conditioned linguistic variability, and we present a suitable methodology, 'corpusbased dialectometry', in exactly this spirit. Specifically, we argue that in order to live up to the potential of the corpus-based method, practitioners need to (i) abandon their exclusive focus on individual linguistic features in favor of the study of feature aggregates, (ii) draw on computationally advanced multivariate analysis techniques (such as multidimensional scaling, cluster analysis, and principal component analysis), and (iii) aid interpretation of empirical results by marshalling state-of-the-art data visualization techniques. To exemplify this line of analysis, we present a case study which explores joint frequency variability of 57 morphosyntax features in 34 dialects all over Great Britain. KEYWORDS: corpus-based dialectology; holistic approach; corpus-based dialectometry; feature aggregates; multivariate analysis; visualization techniques.RESUMO: Este artigo debruça-se sobre o esboço propositivo de futuras direções para a dialetologia baseada em corpus. Defendemos uma abordagem holística para o estudo da variabilidade linguística geograficamente condicionada, e apresentamos uma metodologia adequada para tal -a dialetometria baseada em corpus. Mais especificamente, defendemos que para que se obtenham todos os resultados esperados da metodologia de corpus, pesquisadores devem: (i) abandonar seu foco exclusivo em traços linguísticos individuais em favor do estudo dos agregados de traços, (ii) amparar-se em métodos computacionais avançados de técnicas de análise multivariada (tais como escalagem multidimensional, análise de clusters, e análise de componente principal), e (iii) auxiliar a interpretação de resultados empíricos através da utilização do estado da arte em técnicas de visualização. A fim de exemplificarmos essa linha de análise, apresentamos um estudo de caso que explora a variabilidade da frequência agregada de 57 traços morfossintáticos de 34 dialetos da Grã-Bretanha. PALAVRAS-CHAVE: dialetologia baseada em corpus; abordagem holística; dialetometria baseada em corpus; agregados de traços; análise multivariada; técnicas de visualização.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.