“…Whatever the controversy over the web may be, early this century some corpus linguists' attention was attracted to the Internet and the search engine (Kilgarriff, 2001;Kilgarriff & Grefenstette, 2003;Lu¨deling, Evert, & Baroni, 2005;Resnik, Elkiss, Lau, & Taylor, 2005;Resnik & Smith, 2003), as Johns (2002) discovers ''the potential of the Web in defining and supporting a 'worldwide data-driven learning (DDL) community''', which may reach billions of English words in the web pages out there and accessible to everyone with an Internet connection, and save the trouble of gathering authentic texts in a machine-readable format as those who try to build a DIY corpus with WordSmith Tools 4.0 (corpus software developed by M. Scott, published by Oxford University Press, 2003). Strictly speaking, a search engine is not a corpus.…”