Back to DMR 2024
LREC-COLING 2024workshop

PropBank-Powered Data Creation: Utilizing Sense-Role Labelling to Generate Disaster Scenario Data

Proceedings of the Fifth International Workshop on Designing Meaning Representations @ LREC-COLING 2024

DOI:10.63317/56tapmd9cfmc

Abstract

For human-robot dialogue in a search-and-rescue scenario, a strong knowledge of the conditions and objects a robot will face is essential for effective interpretation of natural language instructions. In order to utilize the power of large language models without overwhelming the limited storage capacity of a robot, we propose PropBank-Powered Data Creation. PropBank-Powered Data Creation is an expert-in-the-loop data generation pipeline which creates training data for disaster-specific language models. We leverage semantic role labeling and Rich Event Ontology resources to efficiently develop seed sentences for fine-tuning a smaller, targeted model that could operate onboard a robot for disaster relief. We developed 32 sentence templates, which we used to make 2 seed datasets of 175 instructions for earthquake search and rescue and train derailment response. We further leverage our seed datasets as evaluation data to test our baseline fine-tuned models.

Details

Paper ID
lrec2024-ws-dmr-01
Pages
pp. 1-10
BibKey
shichman-etal-2024-propbank
Editor
N/A
Publisher
European Language Resources Association (ELRA) and ICCL
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the Fifth International Workshop on Designing Meaning Representations @ LREC-COLING 2024
Location
undefined, undefined
Date
20 May 2024 25 May 2024

Authors

  • MS

    Mollie Frances Shichman

  • CB

    Claire Bonial

  • TH

    Taylor A. Hudson

  • AB

    Austin Blodgett

  • FF

    Francis Ferraro

  • RR

    Rachel Rudinger

Links