HomeLREC 2020WorkshopsWILDRElrec2020-ws-wildre-02
Back to WILDRE 2020
LREC 2020workshop

A Dataset for Troll Classification of TamilMemes

Proceedings of the WILDRE5– 5th Workshop on Indian Language Data: Resources and Evaluation

DOI:10.63317/5985jzftt5qm

Abstract

Social media are interactive platforms that facilitate the creation or sharing of information, ideas or other forms of expression among people. This exchange is not free from offensive, trolling or malicious contents targeting users or communities. One way of trolling is by making memes, which in most cases combines an image with a concept or catchphrase. The challenge of dealing with memes is that they are region-specific and their meaning is often obscured in humour or sarcasm. To facilitate the computational modelling of trolling in the memes for Indian languages, we created a meme dataset for Tamil (TamilMemes). We annotated and released the dataset containing suspected trolls and not-troll memes. In this paper, we use the a image classification to address the difficulties involved in the classification of troll memes with the existing methods. We found that the identification of a troll meme with such an image classifier is not feasible which has been corroborated with precision, recall and F1-score.

Details

Paper ID
lrec2020-ws-wildre-02
Pages
pp. 7-13
BibKey
suryawanshi-etal-2020-dataset
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the WILDRE5– 5th Workshop on Indian Language Data: Resources and Evaluation
Location
undefined, undefined
Date
11 May 2020 16 May 2020

Authors

  • SS

    Shardul Suryawanshi

  • BC

    Bharathi Raja Chakravarthi

  • PV

    Pranav Verma

  • MA

    Mihael Arcan

  • JM

    John P. McCrae

  • PB

    Paul Buitelaar

Links