Back to Main Conference 2018
LREC 2018main

ILCM - A Virtual Research Infrastructure for Large-Scale Qualitative Data

Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

DOI:10.63317/2meydjiquiv4

Abstract

The iLCM project pursues the development of an integrated research environment for the analysis of structured and unstructured data in a “Software as a Service” architecture (SaaS). The research environment addresses requirements for the quantitative evaluation of large amounts of qualitative data with text mining methods as well as requirements for the reproducibility of data-driven research designs in the social sciences. For this, the iLCM research environment comprises two central components. First, the Leipzig Corpus Miner (LCM), a decentralized SaaS application for the analysis of large amounts of news texts developed in a previous Digital Humanities project. Second, the text mining tools implemented in the LCM are extended by an “Open Research Computing” (ORC) environment for executable script documents, so-called “notebooks”. This novel integration allows to combine generic, high-performance methods to process large amounts of unstructured text data and with individual program scripts to address specific research requirements in computational social science and digital humanities. ilcm.informatik.uni-leipzig.de

Details

Paper ID
lrec2018-main-209
Pages
N/A
BibKey
niekler-etal-2018-ilcm
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-00-9
Conference
Eleventh International Conference on Language Resources and Evaluation
Location
Miyazaki, Japan
Date
7 May 2018 12 May 2018

Authors

  • AN

    Andreas Niekler

  • AB

    Arnim Bleier

  • CK

    Christian Kahmann

  • LP

    Lisa Posch

  • GW

    Gregor Wiedemann

  • KE

    Kenan Erdogan

  • GH

    Gerhard Heyer

  • MS

    Markus Strohmaier

Links