LRECLREC Language Resources and Evaluation Conference
Conferences
  • LREC 2026944
  • LREC 20242170
    Workshops603
    bucc15
    cawl8
    cl4health33
    cogalex19
    delite7
    determit18
    dlnld8
    dmr17
    ecnlp15
    eurali8
    finnlp34
    games12
    htres9
    humeval26
    isa18
    ldl15
    legal11
    lt4hala33
    mathnlp5
    mwe27
    neusymbridge5
    nlperspectives16
    osact17
    parlaclarin25
    politicalnlp10
    rail17
    rapid11
    readi9
    rfp5
    safety4convai5
    signlang45
    sigul50
    tdle6
    trac17
    unlp16
    wildre11
  • LREC 20221271
    Workshops467
    bucc9
    cltw18
    cmlc6
    csrnlp8
    dclrl10
    digitam6
    eurali18
    fnp24
    games7
    gwll13
    isa19
    lateraisse6
    law20
    legal15
    lt4hala31
    mwe17
    nidcp9
    nlperspectives15
    osact28
    parlaclarin19
    politicalnlp14
    pvlam6
    rapid12
    readi9
    restup4
    salld6
    signlang32
    sigul27
    sltat19
    smila10
    tdle6
    term7
    wildre17
  • LREC 20201318
    Workshops423
    aespen11
    ai4hi5
    bucc11
    calcs9
    cllrd8
    clssts11
    cmlc9
    computerm15
    framenet12
    gamnlp12
    globalex18
    isa12
    iwltp17
    ldl12
    lincr8
    lr4sshoc9
    lt4gov6
    lt4hala21
    mmw7
    multilingualbio6
    onion5
    osact18
    parlaclarin13
    rail9
    readi14
    restup4
    signlang36
    sltu52
    stoc8
    trac25
    wac8
    wildre12
  • LREC 2018728
  • LREC 2016745
  • LREC 2014746
  • LREC 2012670
  • LREC 2010645
  • LREC 2008620
  • LREC 2006513
  • LREC 2004524
  • LREC 2002354
  • LREC 2000280
  • LREC 1998212
HomeLREC 2020WorkshopsWAC
Back to Workshops

Proceedings of the 12th Web as Corpus Workshop

LREC 2020 Workshop

undefined, undefined 11 May 2020 - 16 May 2020 8 papers
DOI:10.63317/2va68regv5ni
Show20per page
1

Current Challenges in Web Corpus Building

Miloš Jakubíček, Vojtěch Kovář, Pavel Rychlý, Vit Suchomel

pp. 1-4 DOI: 10.63317/5hpusmwfudpu
2

Out-of-the-Box and into the Ditch? Multilingual Evaluation of Generic Text Extraction Tools

Adrien Barbaresi, Gaël Lejeune

pp. 5-13 DOI: 10.63317/3vjqvd9vwtch
3

From Web Crawl to Clean Register-Annotated Corpora

Veronika Laippala, Samuel Rönnqvist, Saara Hellström, Juhani Luotolahti, Liina Repo, Anna Salmela, Valtteri Skantsi, Sampo Pyysalo

pp. 14-22 DOI: 10.63317/3ewgn53qox7h
4

Building Web Corpora for Minority Languages

Heidi Jauhiainen, Tommi Jauhiainen, Krister Lindén

pp. 23-32 DOI: 10.63317/29i28sv6ykra
5

The ELTE.DH Pilot Corpus – Creating a Handcrafted Gigaword Web Corpus with Metadata

Balázs Indig, Árpád Knap, Zsófia Sárközi-Lindner, Mária Timári, Gábor Palkó

pp. 33-41 DOI: 10.63317/3zmqt8f6rop7
6

Hypernym-LIBre: A Free Web-based Corpus for Hypernym Detection

Shaurya Rawat, Mariano Rico, Oscar Corcho

pp. 42-49 DOI: 10.63317/5ipbwy62x4z7
7

A Cross-Genre Ensemble Approach to Robust Reddit Part of Speech Tagging

Shabnam Behzad, Amir Zeldes

pp. 50-56 DOI: 10.63317/2uksnupeninw
8

Streaming Language-Specific Twitter Data with Optimal Keywords

Tim Kreutz, Walter Daelemans

pp. 57-64 DOI: 10.63317/2cxom8fhdgb5

Showing all 8 papers

LREC Proceedings• © ELRA •2026
All LREC proceedings (including proceedings from workshops) are licenced under CC-BY-NC-4.0, the Creative Commons Attribution-NonCommercial 4.0 International License .
Legal Mentions • Data Protection