Back to Main Conference 2018
LREC 2018main

A Multi- versus a Single-classifier Approach for the Identification of Modality in the Portuguese Language

Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

DOI:10.63317/3rx2ukbdbzs7

Abstract

This work presents a comparative study between two different approaches to build an automatic classification system for Modality values in the Portuguese language. One approach uses a single multi-class classifier with the full dataset that includes eleven modal verbs; the other builds different classifiers, one for each verb. The performance is measured using precision, recall and F 1 . Due to the unbalanced nature of the dataset a weighted average approach was calculated for each metric. We use support vector machines as our classifier and experimented with various SVM kernels to find the optimal classifier for the task at hand. We experimented with several different types of feature attributes representing parse tree information and compare these complex feature representation against a simple bag-of-words feature representation as baseline. The best obtained F 1 values are above 0.60 and from the results it is possible to conclude that there is no significant difference between both approaches.

Details

Paper ID
lrec2018-main-161
Pages
N/A
BibKey
sequeira-etal-2018-multi
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-00-9
Conference
Eleventh International Conference on Language Resources and Evaluation
Location
Miyazaki, Japan
Date
7 May 2018 12 May 2018

Authors

  • JS

    João Sequeira

  • TG

    Teresa Gonçalves

  • PQ

    Paulo Quaresma

  • AM

    Amália Mendes

  • IH

    Iris Hendrickx

Links