Summary of the paper

Title The GeneReg Corpus for Gene Expression Regulation Events ― An Overview of the Corpus and its In-Domain and Out-of-Domain Interoperability
Authors Ekaterina Buyko, Elena Beisswanger and Udo Hahn
Abstract Despite the large variety of corpora in the biomedical domain their annotations differ in many respects, e.g., the coverage of different, highly specialized knowledge domains, varying degrees of granularity of targeted relations, the specificity of linguistic anchoring of relations and named entities in documents, etc. We here present GeneReg (Gene Regulation Corpus), the result of an annotation campaign led by the Jena University Language & Information Engineering (JULIE) Lab. The GeneReg corpus consists of 314 abstracts dealing with the regulation of gene expression in the model organism E. coli. Our emphasis in this paper is on the compatibility of the GeneReg corpus with the alternative Genia event corpus and with several in-domain and out-of-domain lexical resources, e.g., the Specialist Lexicon, FrameNet, and WordNet. The links we established from the GeneReg corpus to these external resources will help improve the performance of the automatic relation extraction engine JREx trained and evaluated on GeneReg.
Topics Corpus (creation, annotation, etc.), Knowledge Discovery/Representation, Information Extraction, Information Retrieval
Full paper The GeneReg Corpus for Gene Expression Regulation Events ― An Overview of the Corpus and its In-Domain and Out-of-Domain Interoperability
Slides -
Bibtex @InProceedings{BUYKO10.407,
  author = {Ekaterina Buyko and Elena Beisswanger and Udo Hahn},
  title = {The GeneReg Corpus for Gene Expression Regulation Events ― An Overview of the Corpus and its In-Domain and Out-of-Domain Interoperability},
  booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA