Back to Main Conference 2026
LREC 2026main

AnswerCarefully: Creating a Dataset for LLM Safety in Japanese

Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026)

DOI:10.63317/289xxmwcworn

Abstract

In this paper we present JLLMSafety, a dataset for promoting the safety of Japanese LLM outputs. The dataset consists of 1,800 pairs of questions and reference answers, where the questions require special attention in answering. It covers a wide range of risk categories established in prior English-language datasets, but the data samples are original in that they are manually curated to reflect the socio-cultural context of LLM usage in Japan. We show that using this dataset for instruction to fine-tune a Japanese LLM led to improved output safety without compromising the utility of general responses. We also report the results of a safety evaluation of 12 Japanese LLMs using this dataset as a benchmark. Finally, we discuss the significance of creating regionally specific datasets of LLM safety, and describe the meta tags we added to the dataset to facilitate the creation of similar datasets in different languages and regions. The dataset is made available publicly for the sole purpose of improving LLM safety without any other usage restrictions.

Details

Paper ID
lrec2026-main-311
Pages
pp. 3921-3931
BibKey
suzuki-etal-2026-answercarefully
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-493814-49-4
Conference
The Fifteenth Language Resources and Evaluation Conference (LREC 2026)
Location
Palma, Mallorca, Spain
Date
11 May 2026 16 May 2026

Authors

  • HS

    Hisami Suzuki

  • SK

    Satoru Katsumata

  • TK

    Takashi Kodama

  • TT

    Tetsuro Takahashi

  • KN

    Kouta Nakayama

  • SS

    Satoshi Sekine

Links