HomeLREC 2020WorkshopsCALCSlrec2020-ws-calcs-5
Back to CALCS 2020
LREC 2020workshop

Understanding Script-Mixing: A Case Study of Hindi-English Bilingual Twitter Users

Proceedings of the 4th Workshop on Computational Approaches to Code Switching

DOI:10.63317/3bwgxnhedwh5

Abstract

In a multi-lingual and multi-script society such as India, many users resort to code-mixing while typing on social media. While code-mixing has received a lot of attention in the past few years, it has mostly been studied within a single-script scenario. In this work, we present a case study of Hindi-English bilingual Twitter users while considering the nuances that come with the intermixing of different scripts. We present a concise analysis of how scripts and languages interact in communities and cultures where code-mixing is rampant and offer certain insights into the findings. Our analysis shows that both intra-sentential and inter-sentential script-mixing are present on Twitter and show different behavior in different contexts. Examples suggest that script can be employed as a tool for emphasizing certain phrases within a sentence or disambiguating the meaning of a word. Script choice can also be an indicator of whether a word is borrowed or not. We present our analysis along with examples that bring out the nuances of the different cases.

Details

Paper ID
lrec2020-ws-calcs-5
Pages
pp. 36-44
BibKey
srivastava-etal-2020-understanding
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the 4th Workshop on Computational Approaches to Code Switching
Location
undefined, undefined
Date
11 May 2020 16 May 2020

Authors

  • AS

    Abhishek Srivastava

  • KB

    Kalika Bali

  • MC

    Monojit Choudhury

Links