Proceedings of the SIGUL 2026 Joint Workshop with ELE, EURALI, and DCLRL "Towards Inclusivity and Equality: Language Resources and Technologies for Under-Resourced and Endangered Languages
LREC 2026 Workshop
How Well Do Large Language Models Reason in Under-Resourced Languages? Evidence from Vietnamese
Tuan Anh Do, Jelke Bloem
Register Sensitivity in Scalar MT Evaluation: Evidence from Spanish–Basque Informal Discourse
Nora Aranberri
Corpus-Linguists’ Little Helpers? Evaluating LLMs for Linguistic Annotation: The Case of Sensationalist Headlines Corpus
Petra Bago, Virna Karlić
LLM as a Morphological Disambiguator for Belarusian: A Preliminary Study
Vladislav Poritski, Oksana Volchek, Ilia Afanasev
Keyboards for the Endangered Idu Mishmi Language
Akhilesh Kakolu Ramarao
SAINT: Multilingual Span-Level Interpretability for Sentiment Analysis
Seid Muhie Yimam, Tadesse Destaw Belay, Robert Geislinger, Shamsuddeen Hassan Muhammad, Adaeze Ngozi Ohuoba, Sukairaj Hafiz Imam, Abinew Ali Ayele, Martin Semmann, Chris Biemann, Serge Sharoff
AlbanianLLMSafety: A Safety Evaluation Dataset for Large Language Models in Albanian
Wajdi Zaghouani, Kholoud Khalil Aldous, Isra Fejzullaj
Urdu-CLEVR: A Novel Benchmark for Visual Reasoning in an Under-Resourced Linguistic Context
Sohail Ashraf, Adeel Zafar, Slawomir Nowaczyk, Ahthasham Sajid
A Database of Romance Clitics With Speech Samples
Abdelrahim Qaddoumi, Owen Rambow, Lori Repetti, Francisco Ordóñez
GreekCommonGen: A Benchmark for Evaluating Generative Commonsense Reasoning in Greek
Aristotelis Stamopoulos, Dimitrios Galanis
Transfer Learning for Creole TTS: A Pilot Study on Whether Substrate Phonologies or Lexifier Vocabularies Matter More
Emmett Strickland, Marc Evrard, Valentina Fedchenko
KZ-SafetyPrompts: A Kazakh Safety Evaluation Prompt Dataset for Large Language Models
Wajdi Zaghouani, Shimaa Amer Ibrahim, Aruzhan Muratbek, Olzhasbek Zhakenov, Adiya Akhmetzhanova
SimLex-999 for Modern Greek
Leonidas Mylonadis, Jelke Bloem
Quality and Appropriateness of Large Text Datasets for Irish NLP
Abigail Walsh, Mark Andrade, Jane Lauren Adkins, Ornait O'Connell, Éanna O'Connor, Ellen Rushe, Brian Davis
BiST: A Gold Standard Bangla-English Bilingual Corpus for Sentence Structure and Tense Classification with Inter-Annotator Agreement
Abdullah Al Shafi, Swapnil Kundu Argha, M. A. Moyeen, Abdul Muntakim, Shoumik Barman Polok
LLM-Assisted Spanish Dialect Corpus Construction
Jessica Claribel RAMIREZ VIDAL, Hiroki Ouchi, Sakriani Sakti
Structured Entity Extraction from Hawaiian Television Chyrons Using Vision-Language Models
Kelley Lynch, Owen King, Kyeongmin Rim, Gabrielle Keen, Yangyang Chen, James Pustejovsky
Towards a general theory of linguistic diversity
Steven Bird
Interlinear Glosses as a Multilingual Pivot for Machine Translation: An Updated Study on Turkish with Restricted Resources
Volkan Ozer, Shu Okabe, Alexander Fraser
Benchmarking Multilingual LLM Translation Accuracy for Fuzhounese
Sue Zheng, Jelke Bloem
Showing 20 of 31 papers | Page 1 of 2