Back to NSLP 2026
LREC 2026workshop
Identifying Implicit Research Data References in Paper Citations
Proceedings of Natural Scientific Language Processing (NSLP) @ LREC 2026
Abstract
To encourage the public release of research data under open science, it is beneficial to establish mechanisms for evaluating research data based on metrics such as citation counts. In scholarly papers, authors sometimes cite papers that report the creation or release of research data instead of citing the research data themselves. In this paper, as a step toward computing citation counts of research data, we investigate the feasibility of identifying paper citations that refer to research data. We conducted an identification experiment using large language models and evaluated their performance.