Back to Main Conference 2022
LREC 2022main

Corpus for Automatic Structuring of Legal Documents

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/543m4q3q6ts6

Abstract

In populous countries, pending legal cases have been growing exponentially. There is a need for developing techniques for processing and organizing legal documents. In this paper, we introduce a new corpus for structuring legal documents. In particular, we introduce a corpus of legal judgment documents in English that are segmented into topical and coherent parts. Each of these parts is annotated with a label coming from a list of pre-defined Rhetorical Roles. We develop baseline models for automatically predicting rhetorical roles in a legal document based on the annotated corpus. Further, we show the application of rhetorical roles to improve performance on the tasks of summarization and legal judgment prediction. We release the corpus and baseline model code along with the paper.

Details

Paper ID
lrec2022-main-470
Pages
pp. 4420-4429
BibKey
kalamkar-etal-2022-corpus
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 June 2022 25 June 2022

Authors

  • PK

    Prathamesh Kalamkar

  • AT

    Aman Tiwari

  • AA

    Astha Agarwal

  • SK

    Saurabh Karn

  • SG

    Smita Gupta

  • VR

    Vivek Raghavan

  • AM

    Ashutosh Modi

Links