Back to Main Conference 2018
LREC 2018main

Error Analysis of Uyghur Name Tagging: Language-specific Techniques and Remaining Challenges

Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

DOI:10.63317/4oqn93a5zuqv

Abstract

Regardless of numerous efforts at name tagging for Uyghur, there is limited understanding on the performance ceiling. In this paper, we take a close look at the successful cases and perform careful analysis on the remaining errors of a state-of-the-art Uyghur name tagger, systematically categorize challenges, and propose possible solutions. We conclude that simply adopting a machine learning model which is proven successful for high-resource languages along with language-independent superficial features is unlikely to be effective for Uyghur, or low-resource languages in general. Further advancement requires exploiting rich language-specific knowledge and non-traditional linguistic resources, and novel methods to encode them into machine learning frameworks.

Details

Paper ID
lrec2018-main-700
Pages
N/A
BibKey
abudukelimu-etal-2018-error
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-00-9
Conference
Eleventh International Conference on Language Resources and Evaluation
Location
Miyazaki, Japan
Date
7 May 2018 12 May 2018

Authors

  • HA

    Halidanmu Abudukelimu

  • AA

    Abudoukelimu Abulizi

  • BZ

    Boliang Zhang

  • XP

    Xiaoman Pan

  • DL

    Di Lu

  • HJ

    Heng Ji

  • YL

    Yang Liu

Links