Corpus Linguistics

Biber, Douglas; Conrad, Susan; Reppen, Randi

doi:10.1017/cbo9780511804489

Cited by 999 publications

(187 citation statements)

References 0 publications

Supporting

Mentioning

168

Contrasting

Unclassified

Order By: Relevance

“…Will which is supposed to be given the most emphasis in a pedagogic corpus reaches second while can that is ranked third in three major corpora has been overused by standing as the most frequent modal used in the textbook. Indeed, can is well overrepresented throughout Form 1 to 5 textbooks because although it is among the top four used modal auxiliaries, it is well below will and would in terms of frequency occurrence (Leech et al 2009;Biber et al 1998). It is interesting to see that although based on KBSM curriculum modals must, will, may, might and should are the ones that are stipulated to be taught in Form 1, Form 4 and Form 5 textbook, still modal can is used more than any other modals.…”

Section: Summary and Discussionmentioning

confidence: 99%

“…Finally, shall as the lowest frequent modal is lopsided throughout Malaysian textbooks. Although shall has been reported by Biber et al (1998) and Leech et al (2009) to be obsolete in current English, according to Mindt (1995) and Romer (2004a) the prediction meaning of shall (31%) is among one of the most widely used meanings in spoken British English. In the ESL environment, students need to be exposed to the language as much as possible to gain sufficient input and exposure.…”

Section: Summary and Discussionmentioning

confidence: 99%

“…a learner has used" (p. 16). The spoken mini-corpus, however, was compiled because a) there were no ready-made computerized collections of spoken part of Malaysian English textbooks available and it would have been a rather time-consuming to go over each and every dialogue or speech bubble to look for nine modal auxiliary verbs in five textbooks and b) based on the findings of empirical studies on modal auxiliary verbs, different varieties of English and different genres of text-types (spoken vs. written English) plays an important role in the distribution of modal auxiliary verbs (Coates, 1983cited in Kennedy, 1998Biber, Conrad & Reppen, 1998;Mindt, 1995). Altogether, this corpus of spoken-type texts from textbooks has a size of a bit more than 50,000 tokens.…”

Section: Population and Samplingmentioning

confidence: 99%

See 2 more Smart Citations

Non-empirically Based Teaching Materials Can be Positively Misleading: A Case of Modal Auxiliary Verbs in Malaysian English Language Textbooks

Khojasteh¹,

Kafipour²

2012

ELT

View full text Add to dashboard Cite

Using corpus approach, a growing number of researchers blamed textbooks for neglecting important information on the use of grammatical structures in natural English. Likewise, the prescribed Malaysian English textbooks used in schools are reportedly prepared through a process of material development that involves intuition. Hence, a corpus-based study with the population that was sourced from five Malaysian English language textbooks (Forms 1-5) was adopted to identify modal auxiliary verbs' order and ranking in both whole text-types and spoken text-type of these textbooks. The WordSmith Tools 4.0 was used almost entirely to support quantitative and qualitative data analysis in this research. This study has revealed that for almost all modal auxiliaries, there is a discrepancy between frequency order in the textbook corpus and natural English. The findings of this study also show that the currently used pedagogical language in Malaysian textbooks is mainly based on written rather than spoken English.

show abstract

Section: Summary and Discussionmentioning

confidence: 99%

Section: Summary and Discussionmentioning

confidence: 99%

Section: Population and Samplingmentioning

confidence: 99%

See 1 more Smart Citation

Non-empirically Based Teaching Materials Can be Positively Misleading: A Case of Modal Auxiliary Verbs in Malaysian English Language Textbooks

Khojasteh¹,

Kafipour²

2012

ELT

View full text Add to dashboard Cite

show abstract

“…Hunston (2003) holds that it is no exaggeration to say that corpora and the study of corpora have revolutionized the study of language, and of the applications of language over the last few decades, and that the improved accessibility of computers has changed corpus study from a subject for specialists to something that is open to all. According to Biber et al (1998), the essential characteristics that capture the corpus-based approach are the followings: (1) it is empirical, analyzing the actual patterns of use in natural texts; (2) it utilizes a large and principled collection of natural texts, known as a 'corpus', as the basis for analysis; (3) it makes extensive use of computers for analysis, using both automatic and interactive techniques; (4) it depends on both quantitative and qualitative analytical techniques.…”

Section: Literature Reviewmentioning

confidence: 99%

A Corpus-based Contrastive Study of Online News Reports on Economic Crisis ― A Critical Discourse Analysis Perspective

Hai-yan

2015

JLTR

View full text Add to dashboard Cite

show abstract

“…Dentro da abordagem da Linguística de Corpus (doravante LC) (Biber et al, 1998;Berber Sardinha, 2004) com relação à categoria dos pacotes lexicais, encontramos os seguintes trabalhos: (i) Hyland (2008), que examinou um corpus a partir de registros acadêmicos escritos de quatro diferentes disciplinas (Engenharia Elétrica, Biologia, Administração de Empresas e Linguística Aplicada) a fi m de extrair pacotes lexicais baseados numa análise contrastiva das suas formas e funções; (ii) Scott e Tribble (2006) …”

Section: Introductionunclassified

Processamento linguístico-computacional de pacotes lexicais: um estudo de corpus na área de Regulamentação Farmacêutica

Mazza¹

2015

Calidoscópio

View full text Add to dashboard Cite

RESUMO -Este trabalho tem por objetivo demonstrar um aplicativo computacional desenvolvido para a extração de pacotes lexicais de três palavras e apresentar por meio deste as unidades lexicais recorrentes entre documentos de especialidade. O método quantitativo aplicado, em princípio, explora um tipo de texto produzido pelas indústrias do setor farmacêutico, o qual está diretamente relacionado a assuntos regulatórios no âmbito das agências internacionais de vigilância sanitária. No entanto, os procedimentos de análise podem ser adotados para investigar outros aspectos linguísticos dentre a variedade de gêneros e tipos textuais, como também possibilita a identifi cação de termos. O estudo tem como principal enfoque a frequência de ocorrência dos padrões lexicais em corpus autêntico da língua em uso por meio de ferramentas linguístico-computacionais, em particular nas pesquisas voltadas ao estudo da linguagem em contextos empresariais, e busca multiplicar os trabalhos de Douglas Biber com base na combinação de palavras recorrentes em corpora específi cos. O referencial teórico--metodológico baseia-se na Linguística de Corpus, que é capaz de dialogar, especifi camente, com a Linguística Computacional e oferecer meios para o desenvolvimento do aplicativo e ao processamento dos pacotes lexicais. O corpus coletado reúne quinze exemplares do documento escrito na língua inglesa, totalizando cerca de 110 mil palavras, cuja delimitação contempla diferentes localidades do mundo, envolvendo vários autores. Os resultados desvelam a possibilidade de investigação nas divisões internas dos textos mediante o cruzamento entre documentos de uma mesma especialidade.Palavras-chave: pacotes lexicais, corpus de especialidade, ferramenta linguístico-computacional.ABSTRACT -The present paper aims to demonstrate a computational tool developed to extract three-word lexical bundles and show -by working through this -the automatic recognition of recurring lexical items among regulatory documents. In this quantitative analysis a specifi c document prepared by pharmaceutical industries (in which the matter is directed related to the public health protection agencies) is generally examined. Nonetheless, the quantitative data collection methods can also be used to search any other linguistics features within a variety of genres and specifi c type of documents and it allows the linguistics researcher to easily identify which terms fall under a domain of specifi c texts. The study focus their main concern on investigating lexical pattern frequency of language use, particularly across the current context of business, and it seeks to spread Douglas Biber works based on recurrent word combinations that makes use of tools and techniques developed in corpus-based linguistics. As the theoretical framework for this study we primarily draw upon Corpus Linguistics, a theory that is able to connect its concepts over the computational assumptions and design tools for end users and extract the lexical bundles as well. The collected corpus gathers documents i...

show abstract

Corpus Linguistics

Cited by 999 publications

References 0 publications

Non-empirically Based Teaching Materials Can be Positively Misleading: A Case of Modal Auxiliary Verbs in Malaysian English Language Textbooks

Non-empirically Based Teaching Materials Can be Positively Misleading: A Case of Modal Auxiliary Verbs in Malaysian English Language Textbooks

A Corpus-based Contrastive Study of Online News Reports on Economic Crisis ― A Critical Discourse Analysis Perspective

Processamento linguístico-computacional de pacotes lexicais: um estudo de corpus na área de Regulamentação Farmacêutica

Contact Info

Product

Resources

About