Back to Main Conference 2026
LREC 2026main

Extracting Medical Image-Related Entities from Spanish Electronic Health Records Using NER Methods

Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026)

DOI:10.63317/4t6agzu5ygqr

Abstract

This paper presents a novel corpus in Spanish tailored for the extraction of medical image-related entities from radiological reports using Named Entity Recognition (NER) methods. The dataset was created by aggregating and refining multiple existing corpora, focusing on entities that can be visually interpreted in associated medical images. This resource aims to bridge the gap between natural language processing and computer vision in the biomedical domain. The study evaluates various NER methods, including encoder-only, encoder-decoder, and decoder-only architectures. It explores fine-tuning, zero-shot, and few-shot In-Context Learning (ICL) strategies to determine the most effective approach for entity extraction. The resulting dataset is publicly available.

Details

Paper ID
lrec2026-main-829
Pages
pp. 10569-10578
BibKey
platas-etal-2026-extracting
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-493814-49-4
Conference
The Fifteenth Language Resources and Evaluation Conference (LREC 2026)
Location
Palma, Mallorca, Spain
Date
11 May 2026 16 May 2026

Authors

  • AP

    Alexander Platas

  • MM

    Marcos Merino

  • EZ

    Elena Zotova

  • MC

    Montse Cuadros

  • KL

    Karen López-Linares

  • MM

    Mikel Pérez de Mendiola

  • MG

    María Gálvez

  • CB

    Cristina Barba

  • AA

    Antón Asla

Links