2012
DOI: 10.1007/978-3-642-30947-2_15
|View full text |Cite
|
Sign up to set email alerts
|

Informativeness of Inflective Noun Bigrams in Croatian

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2013
2013
2019
2019

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 4 publications
0
4
0
Order By: Relevance
“…PMI is sensitive to low-frequency intensities and lacks fixed bounds (Bouma, 2009, Jurić et al, 2012). Bouma therefore proposed normalized PMI (NPMI) as (Bouma, 2009): italicNPMIfalse(bold-italicifalse)=italicPMIfalse(bold-italicifalse)/-logpfalse(ifalse) where normalization by − log p ( i ) gives lower weight to low frequency intensity pairs.…”
Section: Experimental Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…PMI is sensitive to low-frequency intensities and lacks fixed bounds (Bouma, 2009, Jurić et al, 2012). Bouma therefore proposed normalized PMI (NPMI) as (Bouma, 2009): italicNPMIfalse(bold-italicifalse)=italicPMIfalse(bold-italicifalse)/-logpfalse(ifalse) where normalization by − log p ( i ) gives lower weight to low frequency intensity pairs.…”
Section: Experimental Methodsmentioning
confidence: 99%
“…p(i) = p(i A )p(i B ). PMI is sensitive to low-frequency intensities and lacks fixed bounds (Bouma 2009, Jurić et al 2012. Bouma therefore proposed normalized PMI (NPMI) as (Bouma 2009)…”
Section: Analysis Of Registration Accuracymentioning
confidence: 99%
“…Goldsmith's program Linguistica was used 21 , and annotation standards adopted by Croatian linguists were applied 22 . We focused on common (non-name) words because names in Croatian follow the same inflectional paradigm, but with lower freedom in appearance and with some inflectional restrictions when multiword names are considered 23 .…”
Section: Grammatical N-gram Systemmentioning
confidence: 99%
“…The same can be extended to n-grams, too. In [23] it was shown that rare 2-grams may have higher informativeness than their more frequent counterparts. If statistical [Ivan Perić worked with Renato Šoić.]…”
Section: B Possible Implications Of the Extreme Frequency Differences To Machine Translationsmentioning
confidence: 99%