Back to Workshops
Proceedings of the Workshop on Challenges in the Management of Large Corpora (CMLC-10)
LREC 2022 Workshop
undefined, undefined 20 June 2022 - 25 June 2022 6 papers
Show20per page
1
Challenges in Creating a Representative Corpus of Romanian Micro-Blogging Text
Vasile Pais, Maria Mitrofan, Verginica Barbu Mititelu, Elena Irimia, Roxana Micu, Carol Luca Gasan
pp. 1-7 DOI: 10.63317/2epbq7y3x3vc
2
Exhaustive Indexing of PubMed Records with Medical Subject Headings
Modest von Korff
pp. 8-15 DOI: 10.63317/2rieh8zv2eno
3
UDeasy: a Tool for Querying Treebanks in CoNLL-U Format
Luca Brigada Villa
pp. 16-19 DOI: 10.63317/4d8fexumos7v
4
Matrix and Double-Array Representations for Efficient Finite State Tokenization
Nils Diewald
pp. 20-26 DOI: 10.63317/4n2t2f6a5p34
5
Count-Based and Predictive Language Models for Exploring DeReKo
Peter Fankhauser, Marc Kupietz
pp. 27-31 DOI: 10.63317/5b9472vjkdm4
6
“The word expired when that world awoke.” New Challenges for Research with Large Text Corpora and Corpus-Based Discourse Studies in Totalitarian Times
Hanno Biber
pp. 32-35 DOI: 10.63317/4tnkk69wrjuu
Showing all 6 papers