Research Methods in Legal Translation and Interpreting 2019
DOI: 10.4324/9781351031226-2
|View full text |Cite
|
Sign up to set email alerts
|

Corpus methods in legal translation studies

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 14 publications
(3 citation statements)
references
References 0 publications
0
3
0
Order By: Relevance
“…There exist publicly available annotated NER data sets for general English text, such as CoNLL-2003 (Sang andDe Meulder, 2003), WNUT17 (Derczynski et al, 2017), and the Wikipedia gold standard corpus (Balasuriya et al, 2009), as well as for other languages (Neudecker, 2016;Sang and De Meulder, 2003;Santos et al, 2006;Ševčíková et al, 2007). For legal domainspecific data sets, non annotated legal text is abundant, as detailed in Pontrandolfo (2012). For example, the pre-training of Legal-BERT (Chalkidis et al, 2020) is performed on a corpus of non annotated documents consisting of legislation, court cases, and contracts from the UK, US, and the European Union.…”
Section: Related Workmentioning
confidence: 99%
“…There exist publicly available annotated NER data sets for general English text, such as CoNLL-2003 (Sang andDe Meulder, 2003), WNUT17 (Derczynski et al, 2017), and the Wikipedia gold standard corpus (Balasuriya et al, 2009), as well as for other languages (Neudecker, 2016;Sang and De Meulder, 2003;Santos et al, 2006;Ševčíková et al, 2007). For legal domainspecific data sets, non annotated legal text is abundant, as detailed in Pontrandolfo (2012). For example, the pre-training of Legal-BERT (Chalkidis et al, 2020) is performed on a corpus of non annotated documents consisting of legislation, court cases, and contracts from the UK, US, and the European Union.…”
Section: Related Workmentioning
confidence: 99%
“…However, the availability of legal resources may vary dramatically given the inherently confidential and private nature of some legal documents and the institutional confines within which they are created. There are several useful overviews of contemporary legal corpora [72,85,86,107]. Worth recommending is also SOULL (Sources of Language and Law), an open on-line platform, regularly updated to provide a wealth of information about existing data collections and copora of legal language [94].…”
Section: Corpora In Legal Discoursementioning
confidence: 99%
“…Other types of corpora have been usually described in terms of dichotomies: general vs. specialized, monolingual vs. bi-or-multi-lingual, comparable vs. parallel, diachronic vs. synchronic, etc. (see [85] for an overview of legal corpora in multilingual settings). The choice of a specific type of corpus depends on research goals but discourse analysis is inevitably comparative.…”
Section: Corpora In Legal Discoursementioning
confidence: 99%