Back to Main Conference 2016
LREC 2016main
A Document Repository for Social Media and Speech Conversations
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)
Abstract
We present a successfully implemented document repository REST service for flexible SCRUD (search, crate, read, update, delete) storage of social media conversations, using a GATE/TIPSTER-like document object model and providing a query language for document features. This software is currently being used in the SENSEI research project and will be published as open-source software before the project ends. It is, to the best of our knowledge, the first freely available, general purpose data repository to support large-scale multimodal (i.e., speech or text) conversation analytics.