Pre-training Transformers on Indian Legal Text

Paul, Sabu; Mandal, Arpan; Goyal, Pawan; Ghosh, Saptarshi

doi:10.48550/arxiv.2209.06049

Cited by 2 publications

(1 citation statement)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We used the pre-trained transformers which are trained on general corpus like XLNet (Yang et al, 2019) and Roberta (Liu et al, 2019b) as well as trained on legal corpus LegalBERT (Chalkidis et al, 2020), InlegalBERT, IncaseLawBERT (Paul et al, 2022) and we also train the BERT large on Indian judgment cases. Since the transformers have restrictions that they can not accommodate more than 512 tokens, so we gave only the last 510 tokens (two special tokens are reserved for CLS and SEP) as (Malik et al, 2021) mentioned in the paper that in general most relevant information is present at the end of the documents.…”

Section: Legal Judgment Prediction (Ljp)mentioning

confidence: 99%

Nonet at SemEval-2023 Task 6: Methodologies for Legal Evaluation

Nigam,

Deroy,

Shallum

et al. 2024

Preprint

View full text Add to dashboard Cite

This paper describes our submission to the SemEval-2023 for Task 6 on LegalEval: Understanding Legal Texts. Our submission concentrated on three subtasks: Legal Named Entity Recognition (L-NER) for Task-B, Legal Judgment Prediction (LJP) for Task-C1, and Court Judgment Prediction with Explanation (CJPE) for Task-C2. We conducted various experiments on these subtasks and presented the results in detail, including data statistics and methodology. It is worth noting that legal tasks, such as those tackled in this research, have been gaining importance due to the increasing need to automate legal analysis and support. Our team obtained competitive rankings of 15th , 11th , and 1st in Task-B, Task-C1, and Task-C2, respectively, as reported on the leaderboard.

show abstract