Back to CL4HEALTH 2024
LREC-COLING 2024workshop

Unveiling Voices: Identification of Concerns in a Social Media Breast Cancer Cohort via Natural Language Processing

Proceedings of the First Workshop on Patient-Oriented Language Processing (CL4Health) @ LREC-COLING 2024

DOI:10.63317/5j6m6z5q867f

Abstract

We leveraged a dataset of ∼1.5 million Twitter (now X) posts to develop a framework for analyzing breast cancer (BC) patients’ concerns and possible reasons for treatment discontinuation. Our primary objectives were threefold: (1) to curate and collect data from a BC cohort; (2) to identify topics related to uncertainty/concerns in BC-related posts; and (3) to conduct a sentiment intensity analysis of posts to identify and analyze negatively polarized posts. RoBERTa outperformed other models with a micro-averaged F1 score of 0.894 and a macro-averaged F1 score of 0.853 for (1). For (2), we used GPT-4 and BERTopic, and qualitatively analyzed posts under relevant topics. For (3), sentiment intensity analysis of posts followed by qualitative analyses shed light on potential reasons behind treatment discontinuation. Our work demonstrates the utility of social media mining to discover BC patient concerns. Information derived from the cohort data may help design strategies in the future for increasing treatment compliance.

Details

Paper ID
lrec2024-ws-cl4health-32
Pages
pp. 264-270
BibKey
rajwal-etal-2024-unveiling
Editor
N/A
Publisher
European Language Resources Association (ELRA) and ICCL
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the First Workshop on Patient-Oriented Language Processing (CL4Health) @ LREC-COLING 2024
Location
undefined, undefined
Date
20 May 2024 25 May 2024

Authors

  • SR

    Swati Rajwal

    Dept. of Biomedical Informatics, Emory University

  • AP

    Avinash Kumar Pandey

    Goizueta Business School, Emory University

  • ZH

    Zhishuo Han

    Goizueta Business School, Emory University

  • AS

    Abeed Sarker

    Dept. of Biomedical Informatics, Emory University

Links