2012
DOI: 10.1515/9783110303919
|View full text |Cite
|
Sign up to set email alerts
|

Productivity in Argument Selection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
38
0
7

Year Published

2016
2016
2023
2023

Publication Types

Select...
5
3
2

Relationship

1
9

Authors

Journals

citations
Cited by 60 publications
(45 citation statements)
references
References 0 publications
0
38
0
7
Order By: Relevance
“…This assertion neglects a fundamental property of the frequency distribution of words (Baayen, 2001;Baroni, 2008;Zanette & Montemurro, 2005;Zipf, 1935Zipf, /1965, but also of word sequences (Bannard & Lieven, 2009;Baroni, 2008;Ha, Sicilia-Garcia, Ming, & Smith, 2002), in human languages: its Zipfian nature, which has been observed in each analysed natural language and for all the lengths of texts and corpora from a few thousand words up to several tens of millions. In any text or corpus, 'a few words occur with very high frequency while many words occur but rarely' (Zipf, 1935(Zipf, /1965, and this overrepresentation of rare items is larger for smaller texts and corpora (Baayen, 2001;McEnery & Gabrielatos, 2006;Zeldes, 2013;Zipf, 1935Zipf, /1965. However, when the same normalized frequency threshold is used in corpora of different sizes, this overrepresentation of rare words and rare sequences in the smaller corpora is not taken into account and a disproportionately large number of word sequences is selected from them.…”
Section: Discussionmentioning
confidence: 99%
“…This assertion neglects a fundamental property of the frequency distribution of words (Baayen, 2001;Baroni, 2008;Zanette & Montemurro, 2005;Zipf, 1935Zipf, /1965, but also of word sequences (Bannard & Lieven, 2009;Baroni, 2008;Ha, Sicilia-Garcia, Ming, & Smith, 2002), in human languages: its Zipfian nature, which has been observed in each analysed natural language and for all the lengths of texts and corpora from a few thousand words up to several tens of millions. In any text or corpus, 'a few words occur with very high frequency while many words occur but rarely' (Zipf, 1935(Zipf, /1965, and this overrepresentation of rare items is larger for smaller texts and corpora (Baayen, 2001;McEnery & Gabrielatos, 2006;Zeldes, 2013;Zipf, 1935Zipf, /1965. However, when the same normalized frequency threshold is used in corpora of different sizes, this overrepresentation of rare words and rare sequences in the smaller corpora is not taken into account and a disproportionately large number of word sequences is selected from them.…”
Section: Discussionmentioning
confidence: 99%
“…Ces dernières années, le concept a fait son entrée en syntaxe, notamment depuis l'avènement de la Grammaire des Constructions (e.a. Barðdal, 2008;Zeldes, 2012, Perek, 2016. La productivité d'une construction (syntaxique) consiste dans le nombre de lexèmes différents (i.e.…”
Section: La Diversification Au Niveau Des Types Lexicaux 17unclassified
“…The idea behind using such counts to assess productivity can be understood intuitively if we consider that the attested vocabulary size of a certain process corresponds to how productive it has been up until now. Thus a process with more types has a higher 'realized productivity' , in Baayen's terms, than one with fewer types (see also Barðdal 2008;Zeldes 2012). On the other hand, to assess how prone a process is to forming neologisms (regardless of whether it is used often or rarely), we may want to know what the proportion of neologisms is in its output -a process with mostly neologisms is very productive, whereas a repetitive process, with few neologisms, has little 'potential productivity' , no matter how large its realized vocabulary so far.…”
Section: Synthetic Compounds and Productivitymentioning
confidence: 99%