Findings of the Association for Computational Linguistics: EMNLP 2020 2020
DOI: 10.18653/v1/2020.findings-emnlp.380
|View full text |Cite
|
Sign up to set email alerts
|

Contract Discovery: Dataset and a Few-Shot Semantic Retrieval Challenge with Competitive Baselines

Abstract: We propose a new shared task of semantic retrieval from legal texts, in which a so-called contract discovery is to be performed-where legal clauses are extracted from documents, given a few examples of similar clauses from other legal acts. The task differs substantially from conventional NLI and shared tasks on legal information extraction (e.g., one has to identify text span instead of a single document, page, or paragraph). The specification of the proposed task is followed by an evaluation of multiple solu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(16 citation statements)
references
References 29 publications
0
16
0
Order By: Relevance
“…Recently, to alleviate the dependence on user's expertise and improve retrieval effectiveness, a significant amount of effort [12,22,34,40] has been devoted to the automatic classification of legal documents and queries. The extraction of key legal concepts can also facilitate the retrieval process [2,39].…”
Section: Legal Information Retrieval and Extractionmentioning
confidence: 99%
See 1 more Smart Citation
“…Recently, to alleviate the dependence on user's expertise and improve retrieval effectiveness, a significant amount of effort [12,22,34,40] has been devoted to the automatic classification of legal documents and queries. The extraction of key legal concepts can also facilitate the retrieval process [2,39].…”
Section: Legal Information Retrieval and Extractionmentioning
confidence: 99%
“…fact") in the contract. CEE can bring useful insights into contracts and it can facilitate downstream applications, such as relevant clause retrieval or risk assessment [2].…”
Section: Introductionmentioning
confidence: 99%
“…Prior NLP work on contract language is extensive, including summarization (Manor and Li, 2019;Keymanesh et al, 2020) information extraction and understanding (Anish et al, 2019;Borchmann et al, 2020;Agarwal et al, 2021), as well as corpus studies looking at intrinsic properties of contracts (Curtotti and McCreath, 2011;Simonson et al, 2019) or providing new annotations over contract language (Funaki et al, 2020).…”
Section: Prior Workmentioning
confidence: 99%
“…We avoid neural network methods for this task for a few reasons. Results from Borchmann et al (2020) suggest that pre-trained models trained on more general language struggle on the idiosyncrasies of contract language-at least those pretrained on non-legal data. Further, a resourceintensive slot detection system is not scalable in the desired environment.…”
Section: Introductionmentioning
confidence: 99%
“…Its first part, made up of 315 documents, was annotated by three annotators, except that only contexts with some similarity, pre-selected using methods based on semantic similarity (cf. [3]), were taken into account; this was to make the annotation faster and less-labor intensive. The second part, with 195 documents, was annotated entirely by hand.…”
Section: Nda Datasetmentioning
confidence: 99%