Back to Main Conference 2022
LREC 2022main

Using Sentence-level Classification Helps Entity Extraction from Material Science Literature

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/3eippvhm5qk6

Abstract

In the last few years, several attempts have been made on extracting information from material science research domain. Material Science research articles are a rich source of information about various entities related to material science such as names of the materials used for experiments, the computational software used along with its parameters, the method used in the experiments, etc. But the distribution of these entities is not uniform across different sections of research articles. Most of the sentences in the research articles do not contain any entity. In this work, we first use a sentence-level classifier to identify sentences containing at least one entity mention. Next, we apply the information extraction models only on the filtered sentences, to extract various entities of interest. Our experiments for named entity recognition in the material science research articles show that this additional sentence-level classification step helps to improve the F1 score by more than 4%.

Details

Paper ID
lrec2022-main-483
Pages
pp. 4540-4545
BibKey
mullick-etal-2022-using
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 June 2022 25 June 2022

Authors

  • AM

    Ankan Mullick

  • SP

    Shubhraneel Pal

  • TN

    Tapas Nayak

  • SL

    Seung-Cheol Lee

  • SB

    Satadeep Bhattacharjee

  • PG

    Pawan Goyal

Links