2020
DOI: 10.1590/1678-460x2020360209
|View full text |Cite
|
Sign up to set email alerts
|

O Corpus de Português Escrito em Periódicos - CoPEP

Abstract: RESUMO O presente estudo tem como objetivo descrever os desafios e soluções encontrados na compilação do Corpus de Português Escrito em Periódicos - CoPEP, que contém aproximadamente 40 milhões de palavras, é equilibrado entre as variedades português brasileiro e português europeu em número de palavras e cobre seis grandes áreas de conhecimento. Primeiramente, apresentaremos o contexto de criação do CoPEP, qual seja, a elaboração de um dicionário on-line de português para universitários, para o qual serviu com… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(8 citation statements)
references
References 4 publications
(5 reference statements)
0
8
0
Order By: Relevance
“…Another specialized Portuguese corpus, however, was employed occasionally to obtain additional evidence: the Corpus of Portuguese from Academic Journals (CoPEP; Kuhn & Ferreira, 2018). Unlike other specialized corpora available in Sketch Engine, the data processing type of this corpus allows keyword extraction, making it suitable to look for supplementary information.…”
Section: Other Structuresmentioning
confidence: 99%
See 4 more Smart Citations
“…Another specialized Portuguese corpus, however, was employed occasionally to obtain additional evidence: the Corpus of Portuguese from Academic Journals (CoPEP; Kuhn & Ferreira, 2018). Unlike other specialized corpora available in Sketch Engine, the data processing type of this corpus allows keyword extraction, making it suitable to look for supplementary information.…”
Section: Other Structuresmentioning
confidence: 99%
“…The keyword analysis of the first stage consisted of a series of five analyses in the following order: (1) enRAs against enTenTen15, (2) enRAs against enTenTen20, (3) ptRAs against ptTenTen11, (4) CoPEP (Kuhn & Ferreira, 2018) against ptTenTen11, and (5) jaRAs against jaTenTen 11 LUW.…”
Section: Cross-linguistic Variationmentioning
confidence: 99%
See 3 more Smart Citations