Back to Main Conference 2024
LREC-COLING 2024main

MentalHelp: A Multi-Task Dataset for Mental Health in Social Media

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

DOI:10.63317/372jegsog7nt

Abstract

Early detection of mental health disorders is an essential step in treating and preventing mental health conditions. Computational approaches have been applied to users’ social media profiles in an attempt to identify various mental health conditions such as depression, PTSD, schizophrenia, and eating disorders. The interest in this topic has motivated the creation of various depression detection datasets. However, annotating such datasets is expensive and time-consuming, limiting their size and scope. To overcome this limitation, we present MentalHelp, a large-scale semi-supervised mental disorder detection dataset containing 14 million instances. The corpus was collected from Reddit and labeled in a semi-supervised way using an ensemble of three separate models - flan-T5, Disor-BERT, and Mental-BERT.

Details

Paper ID
lrec2024-main-0977
Pages
pp. 11196-11203
BibKey
raihan-etal-2024-mentalhelp
Editor
N/A
Publisher
European Language Resources Association (ELRA) and ICCL
ISSN
2522-2686
ISBN
979-10-95546-34-4
Conference
Joint International Conference on Computational Linguistics, Language Resources and Evaluation
Location
Turin, Italy
Date
20 May 2024 25 May 2024

Authors

  • NR

    Nishat Raihan

  • SP

    Sadiya Sayara Chowdhury Puspo

  • SF

    Shafkat Farabi

  • AB

    Ana-Maria Bucur

  • TR

    Tharindu Ranasinghe

  • MZ

    Marcos Zampieri

Links