Back to CLSSTS 2020
LREC 2020workshop
The Effect of Linguistic Parameters in CLIR Performance
Proceedings of the workshop on Cross-Language Search and Summarization of Text and Speech (CLSSTS2020)
Abstract
This paper will detail how IARPA’s MATERIAL Cross-Language Information Retrieval (CLIR) program investigated certain linguistic parameters to guide language choice, data collection and partitioning, and understand evaluation results. Discerning which linguistic parameters correlated with overall performance enabled the evaluation of progress when different languages were measured, and also was an important factor in determining the most effective CLIR pipeline design, customized to handle language-specific properties deemed necessary to address.