Back to Main Conference 2024
LREC-COLING 2024main

Automatic Data Visualization Generation from Chinese Natural Language Questions

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

DOI:10.63317/2vxnbckuix9g

Abstract

Data visualization has emerged as an effective tool for getting insights from massive datasets. Due to the hardness of manipulating the programming languages of data visualization, automatic data visualization generation from natural languages (Text-to-Vis) is becoming increasingly popular. Despite the plethora of research effort on the English Text-to-Vis, studies have yet to be conducted on data visualization generation from questions in Chinese. Motivated by this, we propose a Chinese Text-to-Vis dataset in the paper and demonstrate our first attempt to tackle this problem. Our model integrates multilingual BERT as the encoder, boosts the cross-lingual ability, and infuses the n-gram information into our word representation learning. Our experimental results show that our dataset is challenging and deserves further research.

Details

Paper ID
lrec2024-main-0169
Pages
pp. 1889-1898
BibKey
ge-etal-2024-automatic
Editor
N/A
Publisher
European Language Resources Association (ELRA) and ICCL
ISSN
2522-2686
ISBN
979-10-95546-34-4
Conference
Joint International Conference on Computational Linguistics, Language Resources and Evaluation
Location
Turin, Italy
Date
20 May 2024 25 May 2024

Authors

  • YG

    Yan Ge

  • VW

    Victor Junqiu Wei

  • YS

    Yuanfeng Song

  • JZ

    Jason Chen Zhang

  • RW

    Raymond Chi-Wing Wong

Links