Back to Main Conference 2022
LREC 2022main

The Bull and the Bear: Summarizing Stock Market Discussions

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/3v5fmvb6ccch

Abstract

Stock market investors debate and heavily discuss stock ideas, investing strategies, news and market movements on social media platforms. The discussions are significantly longer in length and require extensive domain expertise for understanding. In this paper, we curate such discussions and construct a first-of-its-kind of abstractive summarization dataset. Our curated dataset consists of 7888 Reddit posts and manually constructed summaries for 400 posts. We robustly evaluate the summaries and conduct experiments on SOTA summarization tools to showcase their limitations. We plan to make the dataset publicly available. The sample dataset is available here: https://dhyeyjani.github.io/RSMC

Details

Paper ID
lrec2022-main-746
Pages
pp. 6909-6913
BibKey
kumar-etal-2022-bull
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 June 2022 25 June 2022

Authors

  • AK

    Ayush Kumar

  • DJ

    Dhyey Jani

  • JS

    Jay Shah

  • DT

    Devanshu Thakar

  • VJ

    Varun Jain

  • MS

    Mayank Singh

Links