“…Historical text presents numerous challenges for contemporary natural language processing techniques. In particular, the absence of consistent orthographic conventions in historical text presents difficulties for any system requiring reference to a fixed lexicon accessed by orthographic form, such as document indexing systems (Sokirko, 2003;Cafarella and Cutting, 2004), part-of-speech taggers (DeRose, 1988;Brill, 1992;Schmid, 1994), simple word stemmers (Lovins, 1968;Porter, 1980), or more sophisticated morphological analyzers (Geyken and Hanneforth, 2006;Clematide, 2008).…”