Back to FINNLP 2024
LREC-COLING 2024workshop

KRX Bench: Automating Financial Benchmark Creation via Large Language Models

Proceedings of the Joint Workshop of the 7th Financial Technology and Natural Language Processing, the 5th Knowledge Discovery from Unstructured Data in Financial Services, and the 4th Workshop on Economics and Natural Language Processing

DOI:10.63317/3h43ax9b5w5q

Abstract

In this work, we introduce KRX-Bench, an automated pipeline for creating financial benchmarks via GPT-4. To demonstrate the effectiveness of the pipeline, we create KRX-Bench-POC, a benchmark assessing the knowledge of LLMs in real-world companies. This dataset comprises 1,002 questions, each focusing on companies across the U.S., Japanese, and Korean stock markets. We make our pipeline and dataset publicly available and integrate the evaluation code into EleutherAI’s Language Model Evaluation Harness.

Details

Paper ID
lrec2024-ws-finnlp-02
Pages
pp. 10-20
BibKey
son-etal-2024-krx
Editor
N/A
Publisher
European Language Resources Association (ELRA) and ICCL
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the Joint Workshop of the 7th Financial Technology and Natural Language Processing, the 5th Knowledge Discovery from Unstructured Data in Financial Services, and the 4th Workshop on Economics and Natural Language Processing
Location
undefined, undefined
Date
20 May 2024 25 May 2024

Authors

  • GS

    Guijin Son

  • HJ

    Hyunjun Jeon

  • CH

    Chami Hwang

  • HJ

    Hanearl Jung

Links