Error Analysis of Uyghur Name Tagging: Language-specific Techniques and Remaining Challenges

Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

Abstract

Regardless of numerous efforts at name tagging for Uyghur, there is limited understanding on the performance ceiling. In this paper, we take a close look at the successful cases and perform careful analysis on the remaining errors of a state-of-the-art Uyghur name tagger, systematically categorize challenges, and propose possible solutions. We conclude that simply adopting a machine learning model which is proven successful for high-resource languages along with language-independent superficial features is unlikely to be effective for Uyghur, or low-resource languages in general. Further advancement requires exploiting rich language-specific knowledge and non-traditional linguistic resources, and novel methods to encode them into machine learning frameworks.

Resources

Details

Paper ID

lrec2018-main-700

Pages

N/A

DOI

10.63317/4oqn93a5zuqv

BibKey

abudukelimu-etal-2018-error

Editors

Nicoletta Calzolari, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Koiti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis, Takenobu Tokunaga

Publisher

European Language Resources Association (ELRA)

ISSN

2522-2686

ISBN

79-10-95546-00-9

Conference

Eleventh International Conference on Language Resources and Evaluation

Location

Miyazaki, Japan

Date

7 - 12 May 2018

Authors

HA
Halidanmu Abudukelimu
AA
Abudoukelimu Abulizi
BZ
Boliang Zhang
XP
Xiaoman Pan
DL
Di Lu
HJ
Heng Ji
YL
Yang Liu

Links

URL

DOI