Back to Workshops

Proceedings of the Workshop on Dataset Creation for Lower-Resourced Languages within the 13th Language Resources and Evaluation Conference

LREC 2022 Workshop

undefined, undefined 20 June 2022 - 25 June 2022 10 papers
Show20per page
01

SyntAct: A Synthesized Database of Basic Emotions

Felix Burkhardt, Florian Eyben, Björn Schuller

pp. 1-9 DOI: 10.63317/3ocj5ga83oh5
02

Data Sets of Eating Disorders by Categorizing Reddit and Tumblr Posts: A Multilingual Comparative Study Based on Empirical Findings of Texts and Images

Christina Baskal, Amelie Elisabeth Beutel, Jessika Keberlein, Malte Ollmann, Esra Üresin, Jana Vischinski, Janina Weihe, Linda Achilles, Christa Womser-Hacker

pp. 10-18 DOI: 10.63317/34v5yxu4ugne
03

Construction and Validation of a Japanese Honorific Corpus Based on Systemic Functional Linguistics

Muxuan Liu, Ichiro Kobayashi

pp. 19-26 DOI: 10.63317/4vu44xgdewob
04

Building an Icelandic Entity Linking Corpus

Steinunn Rut Friðriksdóttir, Valdimar Ágúst Eggertsson, Benedikt Geir Jóhannesson, Hjalti Daníelsson, Hrafn Loftsson, Hafsteinn Einarsson

pp. 27-35 DOI: 10.63317/4mpjpcgxcck2
05

Crawling Under-Resourced Languages - a Portal for Community-Contributed Corpus Collection

Erik Körner, Felix Helfer, Christopher Schröder, Thomas Eckart, Dirk Goldhahn

pp. 36-43 DOI: 10.63317/5jv9n3w5vd45
06

Fine-grained Entailment: Resources for Greek NLI and Precise Entailment

Eirini Amanaki, Jean-Philippe Bernardy, Stergios Chatzikyriakidis, Robin Cooper, Simon Dobnik, Aram Karimi, Adam Ek, Eirini Chrysovalantou Giannikouri, Vasiliki Katsouli, Ilias Kolokousis, Eirini Chrysovalantou Mamatzaki, Dimitrios Papadakis, Olga Petrova, Erofili Psaltaki, Charikleia Soupiona, Effrosyni Skoulataki, Christina Stefanidou

pp. 44-52 DOI: 10.63317/53kr9fcudejq
07

Words.hk: A Comprehensive Cantonese Dictionary Dataset with Definitions, Translations and Transliterated Examples

Chaak-ming Lau, Grace Wing-yan Chan, Raymond Ka-wai Tse, Lilian Suet-ying Chan

pp. 53-62 DOI: 10.63317/3do4ttc9npfn
08

LiSTra Automatic Speech Translation: English to Lingala Case Study

Salomon Kabongo Kabenamualu, Vukosi Marivate, Herman Kamper

pp. 63-67 DOI: 10.63317/4it9vjqajt52
09

Ara-Women-Hate: An Annotated Corpus Dedicated to Hate Speech Detection against Women in the Arabic Community

Imane Guellil, Ahsan Adeel, Faical Azouaou, Mohamed Boubred, Yousra Houichi, Akram Abdelhaq Moumna

pp. 68-75 DOI: 10.63317/35vysaesfbkx
10

Word-level Language Identification Using Subword Embeddings for Code-mixed Bangla-English Social Media Data

Aparna Dutta

pp. 76-82 DOI: 10.63317/3zk78fhcy2gm

Showing all 10 papers