HomeLREC 2020WorkshopsCALCSlrec2020-ws-calcs-8
Back to CALCS 2020
LREC 2020workshop

Code-mixed parse trees and how to find them

Proceedings of the 4th Workshop on Computational Approaches to Code Switching

DOI:10.63317/4n9xsmocquz7

Abstract

In this paper, we explore the methods of obtaining parse trees of code-mixed sentences and analyse the obtained trees. Existing work has shown that linguistic theories can be used to generate code-mixed sentences from a set of parallel sentences. We build upon this work, using one of these theories, the Equivalence-Constraint theory to obtain the parse trees of synthetically generated code-mixed sentences and evaluate them with a neural constituency parser. We highlight the lack of a dataset non-synthetic code-mixed constituency parse trees and how it makes our evaluation difficult. To complete our evaluation, we convert a code-mixed dependency parse tree set into “pseudo constituency trees” and find that a parser trained on synthetically generated trees is able to decently parse these as well.

Details

Paper ID
lrec2020-ws-calcs-8
Pages
pp. 57-64
BibKey
srinivasan-etal-2020-code
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the 4th Workshop on Computational Approaches to Code Switching
Location
undefined, undefined
Date
11 May 2020 16 May 2020

Authors

  • AS

    Anirudh Srinivasan

  • SD

    Sandipan Dandapat

  • MC

    Monojit Choudhury

Links