Back to Main Conference 2012
LREC 2012main

Grammatical Error Annotation for Korean Learners of Spoken English

Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012)

DOI:10.63317/2ujzffue8qcu

Abstract

The goal of our research is to build a grammatical error-tagged corpus for Korean learners of Spoken English dubbed Postech Learner Corpus. We collected raw story-telling speech from Korean university students. Transcription and annotation using the Cambridge Learner Corpus tagset were performed by six Korean annotators fluent in English. For the annotation of the corpus, we developed an annotation tool and a validation tool. After comparing human annotation with machine-recommended error tags, unmatched errors were rechecked by a native annotator. We observed different characteristics between the spoken language corpus built in this study and an existing written language corpus.

Details

Paper ID
lrec2012-main-035
Pages
pp. 1628-1631
BibKey
seo-etal-2012-grammatical
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-7-7
Conference
Eighth International Conference on Language Resources and Evaluation
Location
Istanbul, Turkey
Date
21 May 2012 27 May 2012

Authors

  • HS

    Hongsuck Seo

  • KL

    Kyusong Lee

  • GL

    Gary Geunbae Lee

  • SK

    Soo-Ok Kweon

  • HK

    Hae-Ri Kim

Links