HomeLREC 2020WorkshopsWILDRElrec2020-ws-wildre-01
Back to WILDRE 2020
LREC 2020workshop

Part-of-Speech Annotation Challenges in Marathi

Proceedings of the WILDRE5– 5th Workshop on Indian Language Data: Resources and Evaluation

DOI:10.63317/4o3b74deoxe7

Abstract

Part of Speech (POS) annotation is a significant challenge in natural language processing. The paper discusses issues and challenges faced in the process of POS annotation of the Marathi data from four domains viz., tourism, health, entertainment and agriculture. During POS annotation, a lot of issues were encountered. Some of the major ones are discussed in detail in this paper. Also, the two approaches viz., the lexical (L approach) and the functional (F approach) of POS tagging have been discussed and presented with examples. Further, some ambiguous cases in POS annotation are presented in the paper.

Details

Paper ID
lrec2020-ws-wildre-01
Pages
pp. 1-6
BibKey
rane-etal-2020-part
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the WILDRE5– 5th Workshop on Indian Language Data: Resources and Evaluation
Location
undefined, undefined
Date
11 May 2020 16 May 2020

Authors

  • GR

    Gajanan Rane

  • NJ

    Nilesh Joshi

  • GR

    Geetanjali Rane

  • HR

    Hanumant Redkar

  • MK

    Malhar Kulkarni

  • PB

    Pushpak Bhattacharyya

Links