AraREQ: A Dataset and End-to-End System for Conflict Detection and Resolution in Software Requirements
Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026)
Abstract
Conflict detection in software requirements is essential for ensuring specification consistency, improving project efficiency, and ensuring overall software quality. Despite its importance, research on this task, particularly for Arabic, remains limited due to the scarcity of annotated data and linguistic challenges. To address this gap, we introduce AraREQ, a large-scale Arabic dataset for requirement-level conflict detection and resolution. The dataset is constructed through a semi-automated Arabization process using Large Language Models (LLMs), followed by manual augmentation to address class imbalance. The final dataset comprises 27K Arabic requirement pairs. We benchmark four state-of-the-art LLMs under zero-shot and few-shot settings, establishing the first comprehensive evaluation for Arabic requirements conflict detection. Experimental results show that few-shot prompting consistently improves performance, particularly on the minority conflict class, demonstrating the effectiveness of example-based prompting. Finally, we introduce an end-to-end system that automatically detects potential conflicts in Arabic software requirements and generates resolution suggestions. All datasets, codes, and the end-to-end system are open-source and available at: https://sina.birzeit.edu/ArReqConflicts/