We advance theory formation in cognitive sociolinguistics by exploring the extent to which language users’ probabilistic grammar varies regionally. For this purpose, we investigate the effects of constraints that influence the choice between the two syntactic variants in the well-known dative alternation (
The choice of genitive construction in English is conditioned by numerous semantic, syntactic and phonological factors. The present study explores the influence of these factors across different modalities (speech vs writing) and genres (e.g. press, fiction, etc.), and models the mediating effect of language-external variables on internal cognitive and linguistic factors within the context of a probabilistic grammar of genitive choice. The discussion revolves around debates concerning the driving force(s) behind recent changes in newspaper genitives, concluding that the trend reflects a push toward more economical modes of expression in reportage texts. Curiously, analysis finds few significant interactions with low-level processing-related factors, e.g. possessor frequency and lexical density – a surprising result in light of recent research. However, analysis further reveals significant inter-genre variability among several other crucial factors including possessor animacy and final sibilancy, which are significantly reduced in journalistic prose. These latter findings offer indirect evidence in favor of economization, and offer insight into the connections between external stylistic concerns, specific linguistic practices and internal probabilistic weights associated with specific grammatical constructions.
This study explores variability in particle placement across nine varieties of English around the globe, utilizing data from the International Corpus of English and the Global Corpus of Web-based English. We introduce a quantitative approach for comparative sociolinguistics that integrates linguistic distance metrics and predictive modeling, and use these methods to examine the development of regional patterns in grammatical constraints on particle placement in World Englishes. We find a high degree of uniformity among the conditioning factors influencing particle placement in native varieties (e.g., British, Canadian, and New Zealand English), while English as a second language varieties (e.g., Indian and Singaporean English) exhibit a high degree of dissimilarity with the native varieties and with each other. We attribute the greater heterogeneity among second language varieties to the interaction between general L2 acquisition processes and the varying sociolinguistic contexts of the individual regions. We argue that the similarities in constraint effects represent compelling evidence for the existence of a shared variable grammar and variation among grammatical systems is more appropriately analyzed and interpreted as a continuum rather than multiple distinct grammars.
We sketch a project that marries probabilistic grammar research to scholarship on World Englishes, thus synthesizing two previously rather disjoint lines of research into one unifying project with a coherent focus. This synthesis is hoped to advance usage-based theoretical linguistics by adopting a large-scale comparative and sociolinguistically responsible perspective on grammatical variation. To highlight the descriptive and theoretical benefits of the approach, we present case studies of three syntactic alternations (the particle placement, genitive, and dative alternations) in four varieties of English (British, Canadian, Indian, and Singapore), as represented in the International Corpus of English. We report that the varieties studied share a core probabilistic grammar which is, however, subject to indigenization at various degrees of subtlety, depending on the abstractness of the syntactic patterns studied.
Where a licence is displayed above, please note the terms and conditions of the licence govern your use of this document. When citing, please reference the published version. Take down policy While the University of Birmingham exercises care and attention in making items available there are rare occasions when an item has been uploaded in error or has been deemed to be commercially or otherwise sensitive.
This paper introduces a new resource designed to facilitate the quantitative investigation of syntactic variation in spoken language from a comparative perspective. The datasets comprise homogeneously annotated collections of "interchangeable" (i.e. competing) genitive and dative variants in four varieties of English: American English, British English, Canadian English, and New Zealand English. To showcase the empirical potential of the data source, we present a suggestive analysis that investigates the extent to which the probabilistic grammar of genitive and dative variant choice differs across varieties. The statistical analysis reveals that while there are a number of subtle probabilistic contrasts between the regional varieties under study, there is overall a striking degree of cross-varietal homogeneity. We conclude by outlining directions for future research.
Inspired by work in comparative sociolinguistics and quantitative dialectometry, we sketch a corpus-based method (Variation-Based Distance & Similarity Modeling-VADIS for short) to rigorously quantify the similarity between varieties and dialects as a function of the correspondence of the ways in which language users choose between different ways of saying the same thing. To showcase the potential of the method, we present a case study that investigates three syntactic alternations in some nine international varieties of English. Key findings include that (a) probabilistic grammars are remarkably similar and stable across the varieties under study; (b) in many cases we see a cluster of "native" (a.k.a. Inner Circle) varieties, such as British English, whereas "non-native" (a.k.a. Outer Circle) varieties, such as Indian English, are a more heterogeneous group; and (c) coherence across alternations is less than perfect.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.