2021
DOI: 10.4000/jtei.4133
|View full text |Cite
|
Sign up to set email alerts
|

The Parla-CLARIN Recommendations for Encoding Corpora of Parliamentary Proceedings

Abstract: Parliamentary proceedings are a rich source of data that can be used by scholars in various humanities and social sciences disciplines. Unlike the sources of most other language corpora, parliamentary proceedings are not subject to copyright or personal privacy protections, and are typically available online, thus making them ideal for compilation into corpora and for open distribution. For these reasons many countries have already produced corpora of parliamentary proceedings, but each typically in their own … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 9 publications
0
1
0
Order By: Relevance
“…In addition, interdisciplinary workshops on working with parliamentary records and co-locaated with the LREC conference were organised (2018,2020,2022) under the guidance of CLARIN. Finally, the Parla-CLARIN recommendations for encoding parliamentary corpora (Erjavec and Pančur, 2021) 2 were proposed at the "CLARIN ParlaFormat Workshop" in 2019.…”
Section: Encoding the Parlamint Corporamentioning
confidence: 99%
“…In addition, interdisciplinary workshops on working with parliamentary records and co-locaated with the LREC conference were organised (2018,2020,2022) under the guidance of CLARIN. Finally, the Parla-CLARIN recommendations for encoding parliamentary corpora (Erjavec and Pančur, 2021) 2 were proposed at the "CLARIN ParlaFormat Workshop" in 2019.…”
Section: Encoding the Parlamint Corporamentioning
confidence: 99%
“…Finally, we transformed the speeches into two parallel data sets: (1) an RDF (Resource Description Framework) [25] format speech knowledge graph, forming linked data and (2) an XML corpus formed according to the Parla-CLARIN v0.2 specification [26]. More on this transformation can be read from [9].…”
Section: Post-correction and Transformation Into Linked Data And Parl...mentioning
confidence: 99%
“…The corpora are available in two modes: as data (Erjavec T. e., 2021)and in concordancing tools. At the moment version 2.1 are available but version 3.0 is expected at the end of June 2023 and version 3.1 -at the end of September 2023.…”
Section: Exploitation Of Parlamint Corporamentioning
confidence: 99%