Back to Main Conference 2022
LREC 2022main

PhysNLU: A Language Resource for Evaluating Natural Language Understanding and Explanation Coherence in Physics

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/3hagswbrbmr3

Abstract

In order for language models to aid physics research, they must first encode representations of mathematical and natural language discourse which lead to coherent explanations, with correct ordering and relevance of statements. We present a collection of datasets developed to evaluate the performance of language models in this regard, which measure capabilities with respect to sentence ordering, position, section prediction, and discourse coherence. Analysis of the data reveals the classes of arguments and sub-disciplines which are most common in physics discourse, as well as the sentence-level frequency of equations and expressions. We present baselines that demonstrate how contemporary language models are challenged by coherence related tasks in physics, even when trained on mathematical natural language objectives.

Details

Paper ID
lrec2022-main-492
Pages
pp. 4611-4619
BibKey
meadows-etal-2022-physnlu
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 June 2022 25 June 2022

Authors

  • JM

    Jordan Meadows

  • ZZ

    Zili Zhou

  • AF

    André Freitas

Links