Abstract:This study applied systematic meta‐analytic procedures to summarize findings from experimental and quasi‐experimental investigations into the effectiveness of using the tools and techniques of corpus linguistics for second language learning or use, here referred to as data‐driven learning (DDL). Analysis of 64 separate studies representing 88 unique samples reporting sufficient data indicated that DDL approaches result in large overall effects for both control/experimental group comparisons (d = 0.95) and for … Show more
“…Data‐driven learning refers to direct applications of corpora in which learners themselves acquire hands‐on experience of using a corpus for learning purposes, often with guided tasks or materials (see Mizumoto & Chujo, , for different types of data‐driven learning). Its overall effectiveness (i.e., resulting in positive outcomes) for language learning has been reported in meta‐analyses (Boulton & Cobb, ; Mizumoto & Chujo, ). In L2 academic writing studies, corpus consultation as a reference source has been found effective, among other reported benefits (Yoon, ), in correcting writing errors (Gaskell & Cobb, ) and raising awareness about the writing conventions of a specific discourse community (Chang, ; Friginal, ), particularly by using a specialized research article corpus (Lee & Swales, ).…”
Section: Rationale For Developing a Support Tool For Research Articlementioning
With advances in information and computer technology, genre-based writing pedagogy has developed greatly in recent years. In order to further this growth in technology-enhanced genre writing pedagogy, the current study developed a data-driven and theory-based practical writing support tool for research articles (RAs). This web-based, innovative tool, powered by the combination of rhetorical moves and lexical bundles, has an auto-complete feature that suggests the most frequent lexical bundles in a move within an RA section. It was developed based on the proof-of-concept of the bundle-move connection approach. Preliminary user feedback was positive 2 overall, and it was found that the writing support tool brought about beneficial effects that genre writing pedagogy explicitly aims to achieve. In light of these findings, the pedagogical implications of the developed tool are discussed, with particular focus on the potential role that it could play in the teaching and learning of technology-enhanced genre writing.
“…Data‐driven learning refers to direct applications of corpora in which learners themselves acquire hands‐on experience of using a corpus for learning purposes, often with guided tasks or materials (see Mizumoto & Chujo, , for different types of data‐driven learning). Its overall effectiveness (i.e., resulting in positive outcomes) for language learning has been reported in meta‐analyses (Boulton & Cobb, ; Mizumoto & Chujo, ). In L2 academic writing studies, corpus consultation as a reference source has been found effective, among other reported benefits (Yoon, ), in correcting writing errors (Gaskell & Cobb, ) and raising awareness about the writing conventions of a specific discourse community (Chang, ; Friginal, ), particularly by using a specialized research article corpus (Lee & Swales, ).…”
Section: Rationale For Developing a Support Tool For Research Articlementioning
With advances in information and computer technology, genre-based writing pedagogy has developed greatly in recent years. In order to further this growth in technology-enhanced genre writing pedagogy, the current study developed a data-driven and theory-based practical writing support tool for research articles (RAs). This web-based, innovative tool, powered by the combination of rhetorical moves and lexical bundles, has an auto-complete feature that suggests the most frequent lexical bundles in a move within an RA section. It was developed based on the proof-of-concept of the bundle-move connection approach. Preliminary user feedback was positive 2 overall, and it was found that the writing support tool brought about beneficial effects that genre writing pedagogy explicitly aims to achieve. In light of these findings, the pedagogical implications of the developed tool are discussed, with particular focus on the potential role that it could play in the teaching and learning of technology-enhanced genre writing.
“…Nowadays, better software is available. Boulton and Cobb (, p. 382) argued that they did not use meta‐regression because that would “mainly [be] suited to continuous [predictor variables].” However, categorical predictors can easily be included in a regression model through dummy coding or other forms of contrast coding. After all, an analysis of variance model is also a regression model with categorical predictors (Field, ).…”
Section: Literature Reviewmentioning
confidence: 99%
“…The only difference is that, as in metaanalysis, the dependent variable is not the measurement originally used in the Goldschneider &DeKeyser, 2001, andLi, 2010). Instead, researchers often study predictor variables by splitting their data set by the levels of their predictor(s) and calculating separate effect sizes for all these subsets (e.g., Boulton & Cobb, 2017;Mackey & Goo, 2007;Montero Perez et al, 2013). Significance can be determined by considering whether the confidence intervals of the effect sizes of the subsets overlap (Mackey & Goo, 2007) or through Q tests (Montero Perez et al, 2013).…”
Section: Meta-analysis and Meta-regression In L2 Researchmentioning
We meta‐analyzed the effectiveness of incidental second language word learning from spoken input. Our sample contained 105 effect sizes from 32 primary studies employing meaning‐focused word‐learning activities with 1,964 participants with typical cognitive functioning. The random‐effects meta‐analysis yielded a mean effect size of g = 1.05, reflecting generally large vocabulary gains from spoken input in meaning‐focused activities. A meta‐regression with three substantive and two methodological predictors also revealed that adult participants outperformed children in terms of word learning and that interactive learning tasks were more effective than noninteractive ones. Furthermore, learning scores were higher when measured with recognition than with recall tests. Methodologically, the use of a no‐input control group seemed to protect against an overestimation of learning effects, evidenced by smaller effect sizes. Finally, whether a pretest–posttest design was used did not influence effect sizes. All data and the analysis script are publicly available.
Open Practices
This article has been awarded an Open Data badge. All data and the analysis script are publicly accessible via the Open Science Framework at https://osf.io/92vfw. Learn more about the Open Practices badges from the Center for Open Science: https://osf.io/tvyxz/wiki.
“…For our analysis, we used the corpus profiling program AntConc to make word lists based on frequency, and to measure coverage for OBEL and Baumann and Culligan's (1995) version of West's (1953, see www.lextutor.ca/freq/lists_ download) General Service List (GSL). To estimate coverage, we created word lists that exclude OBEL and GSL word families from the SEW texts.…”
Simple English Wikipedia is a user-contributed online encyclopedia intended for young readers and readers whose first language is not English. We compiled a corpus of the entirety of Simple English Wikipedia as of June 20th, 2017. We used lexical frequency profiling tools to investigate the vocabulary size needed to comprehend Simple English Wikipedia texts. We hypothesized that if the texts are indeed simple, learners should need to know far fewer than 8000 words. Our findings indicate that the texts are not as simple as the creators of the authoring guidelines intended. We suggest that authors of simplified texts be encouraged to provide plain language explanations of low-frequency technical terms either in-text or in glossary form. We will discuss implications for researching the pedagogical usefulness of the Simple English Wikipedia.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.