This paper presents a study that sought to identify the dimensions of variation underlying a corpus of Internet texts, using Biber’s (1988) multi-dimensional (MD) analysis framework. The corpus was compiled following the method proposed by Biber (1993), according to which the size of each register subcorpus should be determined based on the linguistic variation across the texts. The corpus was tagged using the Biber Tagger and the features were counted and submitted to a factor analysis, which suggested three factors. The factors were interpreted as three dimensions of variation: involved, interactive discourse versus informational focus; expression of stance: interactional evidentiality; and expression of stance: interactional affect. The amount of register variation captured by the register distinctions on the dimensions ranged from 8.7% to 57.1%. Dimension 1 corroborate the oral/involved versus literate/informational distinction defined in previous MD studies of non-Internet registers, whereas Dimensions 2 and 3 highlight the important role played by stance in social media.
In this paper, I look at four different aspects of metaphor research from a corpus linguistic perspective, namely: (1) the lexicogrammar of metaphors, which refers to the patterning of linguistic metaphor revealed by corpus analysis; (2) metaphor probabilities, which is a facet of metaphor that emerges from frequency-based studies of metaphor; (3) dimensions of metaphor variation, or the search for systematic parameters of variation in metaphor use across different registers; and (4) automated metaphor retrieval, which relates to the development of software to help identify metaphors in corpora. I argue that these four aspects are interrelated, and that advances in one of them can drive changes in the others.
O presente trabalho visa a relatar o desenvolvimento de uma metodologia de identificação de metáforas em corpora eletrônicos. Como exemplo, foi tomado o gênero teleconferências de apresentação de resultados financeiros. A metodologia é do tipo "bottom-up" / "corpus-driven" e se baseia na identificação de palavras com frequência marcante (palavras-chave) e de seus padrões de co-ocorrência, seguido do cálculo de similaridade semântica entre essas palavras. Com isso, chega-se a um conjunto de palavras que são então interpretadas em seu co-texto, por meio de concordâncias.
He earned his PhD at the University of Chicago (1965) specializing in philosophy of science, BD and MA from the University of Chicago (1961) specializing in history of religions and logic, respectively, and BA from Western Reserve University (1957) specializing in history, philosophy, and religion.He has published 26 books and over 115 refereed articles and founded or co-founded 7 scholarly journals.
This paper presents the first entirely linguistic typology of contemporary American television, derived from a
multi-dimensional (MD) analysis of the USTV corpus. The USTV corpus comprises 930 texts from 191 different TV programs, classified into 31
different registers (including nine telecinematic ones: drama series, miniseries, movies, sitcoms, soap operas, general animation,
children’s animation, short-feature animation, and children’s and teens’ shows). The linguistic typology we present in this study is based
on the linguistic characteristics present in the individual programs, with no a priori textual categorizations. A cluster
analysis grouped the individual programs into clusters that shared similar dimensional profiles. The resulting typology comprises nine
different text types – namely Presentation of information, Opinion and discussion, Analysis and debate, Description, Interactive recount,
Engaging demonstration, Playful discourse, Simplified interaction, and Simulated conversation. The paper discusses and illustrates each text
type and considers how telecinematic discourse relates to each of them.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.