We analyze a Big Data set of geo-tagged tweets for a year (Oct. 2013-Oct. 2014) to understand the regional linguistic variation in the U.S. Prior work on regional linguistic variations usually took a long time to collect data and focused on either rural or urban areas. Geo-tagged Twitter data offers an unprecedented database with rich linguistic representation of fine spatiotemporal resolution and continuity. From the one-year Twitter corpus, we extract lexical characteristics for twitter users by summarizing the frequencies of a set of lexical alternations that each user has used. We spatially aggregate and smooth each lexical characteristic to derive county-based linguistic variables, from which orthogonal dimensions are extracted using the principal component analysis (PCA). Finally a regionalization method is used to discover hierarchical dialect regions with using the PCA components. The regionalization results reveal interesting linguistic regional variations in the U.S. The discovered regions not only confirm past research findings in the literature but also provide new insights and a more detailed understanding of very recent linguistic patterns in the U.S.
Abstract. Demographic rates of historical populations have usually been calculated using only data from stayers alone. Can they be extrapolated to the population as a whole? Ruggles has recently pointed out, using both logic and a computer simulation, that stayers experience vital events earlier in life than movers due to migration censorship: those who experience them later in life have often migrated away from the community being studied. We show that stayers do indeed marry and die at younger ages than do movers, using a genealogical database on the American North (1620-1880). These differences are caused, however, both by migration censorship and by genuine differences between the two groups and the places they lived. Therefore changes over time among stayers are not good indicators of changes in the population as a whole because they are affected by changing migration rates. Thus no simple "correction factor" can be extrapolated to estimate the general population; neither stayers (nor movers) constitute a "baseline" or "normal" process: both must be considered together in order to gain an accurate picture of the population as a whole.Kasakoff, A. B. et Adams, J. W. L'influence de la migration sur les ,Sges aux faits d'6tat civil: une critique des reconstitutions de familles en d6mographie historique. European Journal of Population / Revue Europ6erme de D6mographie, 11: 199-241, 1995. R~sum~. Les taux utilis6s en d6mographie historique sont g6n6ralement calcul6s ~t l'aide de donn6es concernant les seuls s6dentaires. Peuvent-ils 8tre extrapol6s pour l'ensemble de la population? Ruggles avait r6cemment indiqu6, en utilisant ~ la fois des arguments logiques et des simulations informatiques, que les s6dentaires connaissent les faits d'6tat civil plus t6t dans leur vie que les migrants, du fait de la sortie d'observation par migration: ceux qui les cormaissent plus tard ont souvent 6migr6 de la communaut6 6tudi6e. Nous montrons ici que les s6dentalres se marient et meurent ~ des ages moins 61ev6s que les migrants, ~ l'aide de donn6es g6n6alogiques d'Am6rique du Nord (1620-1880). Cependant ces diff6rences sont caus6es ~ la fois par sortie d'observation du fait de l'6migration et par des diff6rences entre les deux groupes et entre les lieux off ils vivent. I1 en r6sulte que les changements an cours du temps observ6s parmi les s6dentaires ne sont pas de bons indicateurs des changements de la population dan s son ensemble, car ils sont affect6s par l'6volution des taux de migration. I1 n'y a donc aucun "facteur de correction" simple qui puisse ~tre extrapol6 pour estimer la population darts son ensemble; ni les s6dentaires (ni les migrants) ne constituent un processus "de base" ou "normal": les deux doivent ~tre consid6r6s sirnultan6ment pour obtenir une vue pr6cise de la population dans son ensemble.
This paper describes the migrations of the married men of six New England families from 1620 to 1850 using published genealogies as the data source. Their location at the time of vital events is used to investigate (1) per sistence rates; (2) distances between fathers, sons, and brothers during their life times ; and (3) the extent to which kin were found in the same community. Our data show more mobility than do town studies and reveal the importance of the nuclear family over more distant kin.
BEE KASAKOFF WE HAVE BEEN STUDYING the migrations of the descendants of nine men who came to Massachusetts before I650 and have compiled a computerized database that includes all the people born before I860 in the patrilines. Thus we have what the nine genealogists who studied these families thought was close to a complete list of family members alive in I850. Here we focus on our attempts to find these individuals on the I850 federal census. To facilitate our task, we made up a search list that contained all males alive in I850, but we omitted females known to have married by I850. The search list included both the last known John W. Adams and Alice Bee Kasakoff are professors of anthropology at the University of South Carolina. Their current research, which uses the database described in this article, focuses on changes in migration and family patterns as the American North filled (I620-1880). An earlier version of the article was presented at the November I989 meeting of the Social Science History Association in Washington, DC. The research has been supported by a fellowship from the National Endowment for the Humanities, administered by the Newberry Library; a separate fellowship from the Newberry Library; grants from the National Science Foundation, its Geography and Regional Sciences Program (SEs nos. 8016384 and 850I821), its Anthropology Program (BNS no. 8305214), and its EPSCOR Program, administered through the University of South Carolina; and a Faculty Research and Development Grant from the University of South Carolina. The authors are also grateful to the National Endowment for the Humanities for fellowships to attend a summer seminar in economic history with Stanley Engerman, and to the many students who have collected, coded, and keyed in the census data. Special mention is given to
On retrouve des différences entre les descendants mâles des fermiers en matière de migration, fertilité et richesse dans la tranche agricole du Nord des Etats-Unis entre 1775 et 1875. Elles varient d'après les dimensions des families. Dans les families nombreuses les parents récoltent les bénéfices de leur déplacement avec les enfants, vers des terres nouvelles. Les enfants nés dans des families extrêmement mobiles connurent la réussite d'après le rang à la naissance, les derniers-nés étant les moins lotis. Mais au gré de l'occupation des terres, l'investissement des parents devient l'élément le plus important pour déterminer la fortune des enfants. Dans les petites families on aperçoit un début du modèle moderne qui est le désavantage de l'enfant du rang intermédiare où il connaît moins de réussite que les aînés, les cadets et les enfants uniques.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.