2003
DOI: 10.1007/978-94-010-0201-1_1
|View full text |Cite
|
Sign up to set email alerts
|

The Penn Treebank: An Overview

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
148
0
3

Year Published

2006
2006
2023
2023

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 301 publications
(160 citation statements)
references
References 3 publications
0
148
0
3
Order By: Relevance
“…In English language, totally, eight POS [7] are available and are given as follows: Noun, pronoun, adjective, verb, adverb, article, participles, and auxiliaries. Prediction of the possible occurrence of the next word from an existing entry of a word can also make possible with the help of information about POS in a sentence.…”
Section: Pos Taggingmentioning
confidence: 99%
“…In English language, totally, eight POS [7] are available and are given as follows: Noun, pronoun, adjective, verb, adverb, article, participles, and auxiliaries. Prediction of the possible occurrence of the next word from an existing entry of a word can also make possible with the help of information about POS in a sentence.…”
Section: Pos Taggingmentioning
confidence: 99%
“…It involves recognition, removal of errors and inconsistency to improve the quality of the dataset prior to the process of analysis [9]. The tweets were cleaned of irrelevant data to improve their quality.…”
Section: Data Cleaningmentioning
confidence: 99%
“…ME and OE The Penn-Helsinki Parsed Corpus of Middle English (PPCME2) 1 and the York-TorontoHelsinki Parsed Corpus of Old English Prose (Taylor et al, 2003b, YCOE) use a variant of the PTB annotation schema (Taylor et al, 2003a). YCOE contains the full West Saxon Gospel, but PPCME2 contains only a small fragment of a Wycliffite gospel of John, the ME data is thus complemented with parts of Genesis (G) and Numbers (N).…”
Section: Languages and Corpus Datamentioning
confidence: 99%