Back to CL4HEALTH 2024
LREC-COLING 2024workshop

It’s Difficult to Be Neutral – Human and LLM-based Sentiment Annotation of Patient Comments

Proceedings of the First Workshop on Patient-Oriented Language Processing (CL4Health) @ LREC-COLING 2024

DOI:10.63317/2ce52hsv3srb

Abstract

Sentiment analysis is an important tool for aggregating patient voices, in order to provide targeted improvements in healthcare services. A prerequisite for this is the availability of in-domain data annotated for sentiment. This article documents an effort to add sentiment annotations to free-text comments in patient surveys collected by the Norwegian Institute of Public Health (NIPH). However, annotation can be a time-consuming and resource-intensive process, particularly when it requires domain expertise. We therefore also evaluate a possible alternative to human annotation, using large language models (LLMs) as annotators. We perform an extensive evaluation of the approach for two openly available pretrained LLMs for Norwegian, experimenting with different configurations of prompts and in-context learning, comparing their performance to human annotators. We find that even for zero-shot runs, models perform well above the baseline for binary sentiment, but still cannot compete with human annotators on the full dataset.

Details

Paper ID
lrec2024-ws-cl4health-02
Pages
pp. 8-19
BibKey
maehlum-etal-2024-difficult
Editor
N/A
Publisher
European Language Resources Association (ELRA) and ICCL
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the First Workshop on Patient-Oriented Language Processing (CL4Health) @ LREC-COLING 2024
Location
undefined, undefined
Date
20 May 2024 25 May 2024

Authors

  • PM

    Petter Mæhlum

  • DS

    David Samuel

  • RN

    Rebecka Maria Norman

  • EJ

    Elma Jelin

  • ØB

    Øyvind Andresen Bjertnæs

  • Lilja Øvrelid

  • EV

    Erik Velldal

Links