The Brazilian Portuguese Lexicon: An Instrument for Psycholinguistic Research

Estivalet, Gustavo Lopez; Meunier, Fanny

doi:10.1371/journal.pone.0144016

Cited by 13 publications

(8 citation statements)

References 44 publications

(202 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The calculator had two parts: the database and the search engine (algorithm based on Vitevitch & Luce, 2004). The database was built from a comprehensive Brazilian-Portuguese corpus (Estivalet & Meunier, 2015) in five steps.…”

Section: Stimulimentioning

confidence: 99%

“…To ensure that both sets were very similar to actual words from Brazilian-Portuguese, we calculated the mean number of insertions, deletions, and substitutions needed to transform PP+ and PP-words into their closest 20 phonetic neighbors (i.e., Levenshtein Distance; Yarkoni et al, 2008). We used the package vwr (Keuleers, 2013) for R (R Core Team, 2017) and the same Brazilian-Portuguese corpus in these calculations (Estivalet & Meunier, 2015; Table 1). The analysis indicated that both sets were very close to actual Brazilian-Portuguese words (1.26 for PP+, 1.52 for PP-; the smaller the number of operations, the closer the words were to Brazilian-Portuguese) and to each otherdifference of 0.26 mean operations.…”

Section: Stimulimentioning

confidence: 99%

See 1 more Smart Citation

When statistics collide: The use of transitional and phonotactic probability cues to word boundaries

2021

View full text Add to dashboard Cite

Statistical regularities in linguistic input, such as transitional probability and phonotactic probability, have been shown to promote speech segmentation. It remains unclear, however, whether or how the combination of transitional probabilities and subtle phonotactic probabilities influence segmentation. The present study provides a fine-grained investigation of the effects of such combined statistics. Adults (N = 81) were tested in one of two conditions. In the Anchor condition, they heard a continuous stream of words with small differences in phonotactic probabilities. In the Uniform condition, all words had comparable phonotactic probabilities. In both conditions, transitional probability was stronger in words than part-words. Only participants from the Anchor condition preferred words at test, indicating that the combination of transitional probabilities and subtle phonotactic probabilities may facilitate speech segmentation. We discuss the methodological implications of our findings which demonstrate that even small phonotactic variations should be accounted for when investigating statistical speech segmentation.

show abstract

Section: Stimulimentioning

confidence: 99%

Section: Stimulimentioning

confidence: 99%

When statistics collide: The use of transitional and phonotactic probability cues to word boundaries

2021

View full text Add to dashboard Cite

show abstract

“…Kettunen (2014) compares a number of European languages using different complexity metrics, particularly focusing on morphology, and finds that English (0.05) and EP (0.06) (and several other romance languages) are similar in their type–token ratios, as well as several other measures. Again, this analysis does not include BP, but other work provides type–token ratios for BP that are much lower (0.01) (Estivalet & Meunier, 2015), and given the large disparity (none of the languages included in the Kettunen study had values lower than 0.05), is likely to be explained by a different methodology for determining type–token ratio.…”

Section: Discussionmentioning

confidence: 99%

“…It is not clear why this might be, and while there has been significant work on the salience of geographic features for wayfinding (e.g., Caduff & Timpf, 2008; Quesnot & Roche, 2015; Raubal & Winter, 2002), we are not aware of any cross‐linguistic studies that address the issue. Vocabulary studies show that the most frequently used word class in English is nouns (Kang & Yu, 2011; Tardif, Gelman, & Xu, 1999), whereas in BP verbs are used most frequently (Estivalet & Meunier, 2015), which might make English speakers more attuned to objects in their environment, but more studies are required to determine the cause of this variation in relata and locata selection frequency.…”

Section: Discussionmentioning

confidence: 99%

A cross‐linguistic study of spatial location descriptions in New Zealand English and Brazilian Portuguese natural language

Fagundes

Stock

Delazari

2021

Transactions in GIS

View full text Add to dashboard Cite

Humans use spatial language on a daily basis, to describe locations, give directions, and ask for information about places. Better understanding of spatial language can assist in developing natural language interfaces and querying tools for GIS and web mapping. However, most previous studies focus on artificial, indoor situations. We conduct cross‐linguistic experiments to compare natural language relative location descriptions (e.g., the house beside the river) in New Zealand English (NZE) and Brazilian Portuguese (BP) using eight real outdoor locations to discover the differences that occur when people describe the same location in the two languages. Our results show that NZE uses a wider range of spatial relation terms (e.g., beside) and reference objects (e.g., river) than BP, that BP uses more projective spatial relation terms than NZE, which prefers directional terms, and that translation between spatial relation terms is context‐dependent.

show abstract

“…Such databases have been created for several languages and are available in the form of a web application or computer software. Among them are the English lexicon project (Balota et al, 2007), eDom (Armstrong, Tokowicz, & Plaut, 2012), N-Watch (Davis, 2005) and MRC database (Coltheart, 1981) for English; DlexDB for German (Heister et al, 2011); Lexique (New, Pallier, Brysbaert, and Ferrand, 2004) for French; EsPal (Duchon, Perea, Sebastián-Gallés, Martí, & Carreiras, 2013) and BuscaPalabras (Davis & Perea, 2005) for Spanish; EHME (Acha, Laka, Landa, & Salaburu, 2014) and E-Hitz (Perea et al, 2006) for Basque; GreekLex (Ktori, van Heuven, & Pitchford, 2008) and GreekLex2 (Kyparissiadis et al, 2017) for Modern Greek; Aralex (Boudelaa & Marslen-Wilson, 2010) for Modern Standard Arabic; the Malay Lexicon Project (Yap, Liow, Jalil, & Faizal, 2010) for Malay; KelemetriK (Erten, Bozsahin, & Zeyrek, 2014) for Turkish; the Brazilian Portuguese Lexicon (Estivalet & Meunier, 2015) for Brazilian Portuguese, etc. All these databases are equipped with effective search and filtering tools.…”

Section: Introducing the Stimulstat Databasementioning

confidence: 99%

StimulStat: A lexical database for Russian

2017

View full text Add to dashboard Cite

In this article, we present StimulStat - a lexical database for the Russian language in the form of a web application. The database contains more than 52,000 of the most frequent Russian lemmas and more than 1.7 million word forms derived from them. These lemmas and forms are characterized according to more than 70 properties that were demonstrated to be relevant for psycholinguistic research, including frequency, length, phonological and grammatical properties, orthographic and phonological neighborhood frequency and size, grammatical ambiguity, homonymy and polysemy. Some properties were retrieved from various dictionaries and are presented collectively in a searchable form for the first time, the others were computed specifically for the database. The database can be accessed freely at http://stimul.cognitivestudies.ru .

show abstract

The Brazilian Portuguese Lexicon: An Instrument for Psycholinguistic Research

Cited by 13 publications

References 44 publications

When statistics collide: The use of transitional and phonotactic probability cues to word boundaries

When statistics collide: The use of transitional and phonotactic probability cues to word boundaries

A cross‐linguistic study of spatial location descriptions in New Zealand English and Brazilian Portuguese natural language

StimulStat: A lexical database for Russian

Contact Info

Product

Resources

About