 |
INTRODUCTORY MESSAGES:
INVITED TALK:
KEYNOTES SPEECHES:
PANEL:
SESSIONS: Browse articles of the conference sorted by session number
|
Session O2 - LR Infrastructures and Standards |
Chairperson : Christopher Cieri |
11:35-11:55 |
Lars Borin, Markus Forsberg and Dimitrios Kokkinakis |
Diabase: Towards a Diachronic BLARK in Support of Historical Studies |
11:55-12:15 |
Daan Broeder, Marc Kemps-Snijders, Dieter Van Uytvanck, Menzo Windhouwer, Peter Withers, Peter Wittenburg and Claus Zinn |
A Data Category Registry- and Component-based Metadata Framework |
12:15-12:35 |
Jan Odijk |
The CLARIN-NL Project |
12:35-12:55 |
Samuel Cruz-Lara, Gil Francopoulo, Laurent Romary and Nasredine Semmar |
MLIF : A Metamodel to Represent and Exchange Multilingual Textual Information |
12:55-13:15 |
Peter Wittenburg, Nuria Bel, Lars Borin, Gerhard Budin, Nicoletta Calzolari, Eva Hajicova, Kimmo Koskenniemi, Lothar Lemnitzer, Bente Maegaard, Maciej Piasecki, Jean-Marie Pierrel, Stelios Piperidis, Inguna Skadina, Dan Tufis, Remco van Veenendaal, Tamas Váradi and Martin Wynne |
Resource and Service Centres as the Backbone for a Sustainable Service Infrastructure |
|
Session O4 - Text-to-Speech Corpora |
Chairperson : Harald Höge |
11:35-11:55 |
Didier Cadic, Cédric Boidin and Christophe d'Alessandro |
Towards Optimal TTS Corpora |
11:55-12:15 |
Michael Pucher, Friedrich Neubarth, Volker Strom, Sylvia Moosmüller, Gregor Hofer, Christian Kranzler, Gudrun Schuchmann and Dietmar Schabus |
Resources for Speech Synthesis of Viennese Varieties |
12:15-12:35 |
Pavel Skrelin, Nina Volskaya, Daniil Kocharov, Karina Evgrafova, Olga Glotova and Vera Evdokimova |
A Fully Annotated Corpus of Russian Speech |
12:35-12:55 |
Francisco Campillo, Daniela Braga, Ana Belén Mourín, Carmen García-Mateo, Pedro Silva, Miguel Sales Dias and Francisco Méndez |
Building High Quality Databases for Minority Languages such as Galician |
12:55-13:15 |
Alexandros Lazaridis, Theodoros Kostoulas, Todor Ganchev, Iosif Mporas and Nikos Fakotakis |
Vergina: A Modern Greek Speech Database for Speech Synthesis |
|
Session O8 - Sign Language |
Chairperson : Eleni Efthimiou |
14:45-15:05 |
Annelies Braffort, Laurence Bolot, Emilie Chételat-Pelé, Annick Choisier, Maxime Delorme, Michael Filhol, Jérémie Segouat, Cyril Verrecchia, Flora Badin and Nadège Devos |
Sign Language Corpora for Analysis, Processing and Evaluation |
15:05-15:25 |
Onno Crasborn |
The Sign Linguistics Corpora Network: Towards Standards for Signed Language Resources |
15:25-15:45 |
Kyle Duarte and Sylvie Gibet |
Heterogeneous Data Sources for Signed Language Analysis and Synthesis: The SignCom Project |
15:45-16:05 |
Antonio Balvet, Cyril Courtin, Dominique Boutet, Christian Cuxac, Ivani Fusellier-Souza, Brigitte Garcia, Marie-Thérèse LHuillier and Marie-Anne Sallandre |
The Creagest Project: a Digitized and Annotated Corpus for French Sign Language (LSF) and Natural Gestural Languages |
16:05-16:25 |
Philippe Dreuw, Hermann Ney, Gregorio Martinez, Onno Crasborn, Justus Piater, Jose Miguel Moya and Mark Wheatley |
The SignSpeak Project - Bridging the Gap Between Signers and Speakers |
|
Session O21 - Emotion, Sentiment |
Chairperson : Inma Hernaez Rioja |
11:45-12:05 |
Alexander Schmitt, Gregor Bertrand, Tobias Heinroth, Wolfgang Minker and Jackson Liscombe |
WITcHCRafT: A Workbench for Intelligent exploraTion of Human ComputeR conversaTions |
12:05-12:25 |
Ulli Waltinger |
GermanPolarityClues: A Lexical Resource for German Sentiment Analysis |
12:25-12:45 |
Björn Schuller, Riccardo Zaccarelli, Nicolas Rollet and Laurence Devillers |
CINEMO ― A French Spoken Language Resource for Complex Emotions: Facts and Baselines |
12:45-13:05 |
Gregor Bertrand, Florian Nothdurft, Steffen Walter, Andreas Scheck, Henrik Kessler and Wolfgang Minker |
Towards Investigating Effective Affective Dialogue Strategies |
|
Session O22 - Corpus Building, Annotation and Methodology |
Chairperson : Dimitrios Kokkinasis |
11:45-12:05 |
Martin Volk, Noah Bubenhofer, Adrian Althaus, Maya Bangerter, Lenz Furrer and Beni Ruef |
Challenges in Building a Multilingual Alpine Heritage Corpus |
12:05-12:25 |
Marc Carmen, Paul Felt, Robbie Haertel, Deryle Lonsdale, Peter McClanahan, Owen Merkling, Eric Ringger and Kevin Seppi |
Tag Dictionaries Accelerate Manual Annotation |
12:25-12:45 |
Dan Flickinger, Stephan Oepen and Gisle Ytrestøl |
WikiWoods: Syntacto-Semantic Annotation for English Wikipedia |
12:45-13:05 |
Hai Zhao, Yan Song and Chunyu Kit |
How Large a Corpus Do We Need: Statistical Method Versus Rule-based Method |
|
Session O23 - Broadcast News |
Chairperson : Carmen García-Mateo |
11:45-12:05 |
Luis Javier Rodríguez-Fuentes, Mikel Penagarikano, Germán Bordel, Amparo Varona and Mireia Díez |
KALAKA: A TV Broadcast Speech Database for the Evaluation of Language Recognition Systems |
12:05-12:25 |
Yannick Estève, Thierry Bazillon, Jean-Yves Antoine, Frédéric Béchet and Jérôme Farinas |
The EPAC Corpus: Manual and Automatic Annotations of Conversational Speech in French Broadcast News |
12:25-12:45 |
Kwanchiva Saykham, Ananlada Chotimongkol and Chai Wutiwiwatchai |
Online Temporal Language Model Adaptation for a Thai Broadcast News Transcription System |
12:45-13:05 |
Doris Baum, Daniel Schneider, Rolf Bardeli, Jochen Schwenninger, Barbara Samlowski, Thomas Winkler and Joachim Köhler |
DiSCo - A German Evaluation Corpus for Challenging Problems in the Broadcast Domain |
|
Session O26 - Corpus Tools |
Chairperson : Martha Palmer |
14:55-15:15 |
Dekang Lin, Kenneth Church, Heng Ji, Satoshi Sekine, David Yarowsky, Shane Bergsma, Kailash Patil, Emily Pitler, Rachel Lathbury, Vikram Rao, Kapil Dalwani and Sushant Narsale |
New Tools for Web-Scale N-grams |
15:15-15:35 |
Verena Henrich and Erhard Hinrichs |
GernEdiT - The GermaNet Editing Tool |
15:35-15:55 |
Véronika Lux-Pogodalla, Dominique Besagni and Karën Fort |
FastKwic, an Intelligent Concordancer Using FASTR |
15:55-16:15 |
Giuseppe Attardi, Stefano Dei Rossi, Giulia Di Pietro, Alessandro Lenci, Simonetta Montemagni and Maria Simi |
A Resource and Tool for Super-sense Tagging of Italian Texts |
16:15-16:35 |
Richard Schwarz, Hinrich Schütze, Fabienne Martin and Achim Stein |
Identification of Rare & Novel Senses Using Translations in a Parallel Corpus |
|
Session O31 - Multimodal Annotation |
Chairperson : Jean Claude Martin |
16:55-17:15 |
Harry Bunt, Jan Alexandersson, Jean Carletta, Jae-Woong Choe, Alex Chengyu Fang, Koiti Hasida, Kiyong Lee, Volha Petukhova, Andrei Popescu-Belis, Laurent Romary, Claudia Soria and David Traum |
Towards an ISO Standard for Dialogue Act Annotation |
17:15-17:35 |
Volha Petukhova and Harry Bunt |
Towards an Integrated Scheme for Semantic Annotation of Multimodal Dialogue Data |
17:35-17:55 |
Pierre Tirilly, Vincent Claveau and Patrick Gros |
News Image Annotation on a Large Parallel Text-image Corpus |
17:55-18:15 |
Isabella Poggi, Francesca D'Errico and Laura Vincze |
Types of Nods. The Polysemy of a Social Signal |
|
Session O33 - Question Answering |
Chairperson : Gilles Adda |
18:20-18:40 |
Guillaume Bernard, Sophie Rosset, Martine Adda-Decker and Olivier Galibert |
A Question-answer Distance Measure to Investigate QA System Progress |
18:40-19:00 |
Peter Adolphs, Xiwen Cheng, Tina Klüwer, Hans Uszkoreit and Feiyu Xu |
Question Answering Biographic Information and Social Network Powered by the Semantic Web |
19:00-19:20 |
Nicolas Moreau, Olivier Hamon, Djamel Mostefa, Sophie Rosset, Olivier Galibert, Lori Lamel, Jordi Turmo, Pere R. Comas, Paolo Rosso, Davide Buscaldi and Khalid Choukri |
Evaluation Protocol and Tools for Question-Answering on Speech Transcripts |
19:20-19:40 |
Pamela Forner, Danilo Giampiccolo, Bernardo Magnini, Anselmo Peñas, Álvaro Rodrigo and Richard Sutcliffe |
Evaluating Multilingual Question Answering Systems at CLEF |
|
Session O35 - Disordered Speech Corpus |
Chairperson : Florian Schiel |
18:20-18:40 |
Oscar Saz, Eduardo Lleida, Carlos Vaquero and W.-Ricardo Rodríguez |
The Alborada-I3A Corpus of Disordered Speech |
18:40-19:00 |
Jakob Schou Pedersen and Lars Bo Larsen |
A Speech Corpus for Dyslexic Reading Training |
19:00-19:20 |
Caroline Williams, Andrew Thwaites, Paula Buttery, Jeroen Geertzen, Billi Randall, Meredith Shafto, Barry Devereux and Lorraine Tyler |
The Cambridge Cookie-Theft Corpus: A Corpus of Directed and Spontaneous Speech of Brain-Damaged Patients and Healthy Individuals |
19:20-19:40 |
Cécile Fougeron, Lise Crevier-Buchman, Corinne Fredouille, Alain Ghio, Christine Meunier, Claude Chevrie-Muller, Jean-Francois Bonastre, Antonia Colazo-Simon, Céline Delooze, Danielle Duez, Cédric Gendrot, Thierry Legou, Nathalie Lévêque, Claire Pillot-Loiseau, Serge Pinto, Gilles Pouchoulin, Danièle Robert, Jacqueline Vaissière, François Viallet and Coralie Vincent |
The DesPho-APaDy Project: Developing an Acoustic-phonetic Characterization of Dysarthric Speech in French |
|
Session O38 - Corpus Tools |
Chairperson : Oi Yee Kwong |
9:45-10:05 |
Ting Qian, Kristy Hollingshead, Su-youn Yoon, Kyoung-young Kim and Richard Sproat |
A Python Toolkit for Universal Transliteration |
10:05-10:25 |
Sowmya V. B., Monojit Choudhury, Kalika Bali, Tirthankar Dasgupta and Anupam Basu |
Resource Creation for Training and Testing of Transliteration Systems for Indian Languages |
10:25-10:45 |
Fabienne Fritzinger, Marion Weller and Ulrich Heid |
A Survey of Idiomatic Preposition-Noun-Verb Triples on Token Level |
10:45-11:05 |
Meghan Lammie Glenn, Stephanie M. Strassel, Haejoong Lee, Kazuaki Maeda, Ramez Zakhary and Xuansong Li |
Transcription Methods for Consistency, Volume and Efficiency |
11:05-11:25 |
Muhammad Kamran Malik, Tafseer Ahmed, Sebastian Sulger, Tina Bögel, Atif Gulzar, Ghulam Raza, Sarmad Hussain and Miriam Butt |
Transliterating Urdu for a Broad-Coverage Urdu/Hindi LFG Grammar |
|
Session O41 - Multiword Expressions and Collocations |
Chairperson : Benjamin Tsou |
11:45-12:05 |
Marion Weller and Ulrich Heid |
Extraction of German Multiword Expressions from Parsed Corpora Using Context Features |
12:05-12:25 |
Stefania Spina |
The Dictionary of Italian Collocations: Design and Integration in an Online Learning Environment |
12:25-12:45 |
Margarita Alonso Ramos, Leo Wanner, Orsolya Vincze, Gerard Casamayor del Bosque, Nancy Vázquez Veiga, Estela Mosqueira Suárez and Sabela Prieto González |
Towards a Motivated Annotation Schema of Collocation Errors in Learner Corpora |
12:45-13:05 |
Ulrich Heid, Fabienne Fritzinger, Erhard Hinrichs, Marie Hinrichs and Thomas Zastrow |
Term and Collocation Extraction by Means of Complex Linguistic Web Services |
13:05-13:25 |
Francesca Bonin, Felice Dell'Orletta, Simonetta Montemagni and Giulia Venturi |
A Contrastive Approach to Multi-word Extraction from Domain-specific Corpora |
|
Session O43 - Speech Corpus Processing |
Chairperson : Catia Cucchiarini |
11:45-12:05 |
Philippe Blache, Roxane Bertrand, Mathilde Guardiola, Marie-Laure Guénot, Christine Meunier, Irina Nesterenko, Berthille Pallaud, Laurent Prévot, Béatrice Priego-Valverde and Stéphane Rauzy |
The OTIM Formal Annotation Model: A Preliminary Step before Annotation Scheme |
12:05-12:25 |
Grégory Senay, Georges Linarès, Benjamin Lecouteux, Stanislas Oger and Thierry Michel |
Transcriber Driving Strategies for Transcription Aid System |
12:25-12:45 |
Rena Nemoto, Martine Adda-Decker and Jacques Durand |
Word Boundaries in French: Evidence from Large Speech Corpora |
12:45-13:05 |
Christina Leitner, Martin Schickbichler and Stefan Petrik |
Example-Based Automatic Phonetic Transcription |
13:05-13:25 |
Brigitte Bigi, Christine Meunier, Irina Nesterenko and Roxane Bertrand |
Automatic Detection of Syllable Boundaries in Spontaneous Speech |
|
Session P1 - Anaphora, Coreference and Evaluation |
Chair : Antonio Pareja-Lora |
11:35-13:15 |
Ruud Koolen and Emiel Krahmer |
The D-TUNA Corpus: A Dutch Dataset for the Evaluation of Referring Expression Generation Algorithms |
11:35-13:15 |
Azad Abad, Luisa Bentivogli, Ido Dagan, Danilo Giampiccolo, Shachar Mirkin, Emanuele Pianta and Asher Stern |
A Resource for Investigating the Impact of Anaphora and Coreference on Inference. |
11:35-13:15 |
Cristina Nicolae, Gabriel Nicolae and Kirk Roberts |
C-3: Coherence and Coreference Corpus |
11:35-13:15 |
Claudiu Mihăilă, Iustina Ilisei and Diana Inkpen |
Romanian Zero Pronoun Distribution: A Comparative Study |
11:35-13:15 |
Marta Recasens, Eduard Hovy and M. Antònia Martí |
A Typology of Near-Identity Relations for Coreference (NIDENT) |
11:35-13:15 |
Kepa Joseba Rodríguez, Francesca Delogu, Yannick Versley, Egon W. Stemle and Massimo Poesio |
Anaphoric Annotation of Wikipedia and Blogs in the Live Memories Corpus |
11:35-13:15 |
Samuel Broscheit, Simone Paolo Ponzetto, Yannick Versley and Massimo Poesio |
Extending BART to Provide a Coreference Resolution System for German |
11:35-13:15 |
Jiří Mírovský, Petr Pajas and Anna Nedoluzhko |
Annotation Tool for Extended Textual Coreference and Bridging Anaphora |
11:35-13:15 |
Petya Osenova, Laska Laskova and Kiril Simov |
Exploring Co-Reference Chains for Concept Annotation of Domain Texts |
11:35-13:15 |
Heather Simpson, Stephanie Strassel, Robert Parker and Paul McNamee |
Wikipedia and the Web of Confusable Entities: Experience from Entity Linking Query Creation for TAC 2009 Knowledge Base Population |
|
Session P2 - Tools, Systems and Evaluation |
Chair : Marc Verhagen |
11:35-13:15 |
Athanasios Karasimos and Evanthia Petropoulou |
A Crash Test with Linguistica in Modern Greek: The Case of Derivational Affixes and Bound Stems |
11:35-13:15 |
Anil Kumar Singh and Bharat Ram Ambati |
An Integrated Digital Tool for Accessing Language Resources |
11:35-13:15 |
Paul Felt, Owen Merkling, Marc Carmen, Eric Ringger, Warren Lemmon, Kevin Seppi and Robbie Haertel |
CCASH: A Web Application Framework for Efficient, Distributed Language Resource Development |
11:35-13:15 |
Rüdiger Gleim and Alexander Mehler |
Computational Linguistics for Mere Mortals - Powerful but Easy-to-use Linguistic Processing for Scientists in the Humanities |
11:35-13:15 |
Bernd Bohnet and Leo Wanner |
Open Soucre Graph Transducer Interpreter and Grammar Development Environment |
11:35-13:15 |
Federico Sangati, Willem Zuidema and Rens Bod |
Efficiently Extract Rrecurring Tree Fragments from Large Treebanks |
11:35-13:15 |
José João Almeida, André Santos and Alberto Simões |
Bigorna -- A Toolkit for Orthography Migration Challenges |
11:35-13:15 |
Carl Christensen, Ross Hendrickson and Deryle Lonsdale |
Principled Construction of Elicited Imitation Tests |
11:35-13:15 |
Jan Jona Javoršek and Tomaž Erjavec |
Experimental Deployment of a Grid Virtual Organization for Human Language Technologies |
11:35-13:15 |
Peter Nabende |
Applying a Dynamic Bayesian Network Framework to Transliteration Identification |
|
Session P3 - Lexical Resources |
Chair : Anna Braasch |
11:35-13:15 |
Adrien Lardilleux, Julien Gosme and Yves Lepage |
Bilingual Lexicon Induction: Effortless Evaluation of Word Alignment Tools and Production of Resources for Improbable Language Pairs |
11:35-13:15 |
Akira Utsumi |
Exploring the Relationship between Semantic Spaces and Semantic Relations |
11:35-13:15 |
C. Anton Rytting, Paul Rodrigues, Tim Buckwalter, David Zajic, Bridget Hirsch, Jeff Carnes, Nathanael Lynn, Sarah Wayland, Chris Taylor, Jason White, Charles Blake III, Evelyn Browne, Corey Miller and Tristan Purvis |
Error Correction for Arabic Dictionary Lookup |
11:35-13:15 |
Noureddine Loukil, Kais Haddar and Abdelmajid Benhamadou |
A Syntactic Lexicon for Arabic Verbs |
11:35-13:15 |
Amit Kirschenbaum and Shuly Wintner |
A General Method for Creating a Bilingual Transliteration Dictionary |
11:35-13:15 |
Thomas Proisl and Besim Kabashi |
Using High-Quality Resources in NLP: The Valency Dictionary of English as a Resource for Left-Associative Grammars |
11:35-13:15 |
Grigori Sidorov, Alberto Barrón-Cedeño and Paolo Rosso |
English-Spanish Large Statistical Dictionary of Inflectional Forms |
11:35-13:15 |
Majdi Sawalha and Eric Atwell |
Constructing and Using Broad-coverage Lexical Resource for Enhancing Morphological Analysis of Arabic |
11:35-13:15 |
Rania Al-Sabbagh and Roxana Girju |
Mining the Web for the Induction of a Dialectical Arabic Lexicon |
11:35-13:15 |
Benoît Sagot, Laurence Danlos and Rosa Stern |
A Lexicon of French Quotation Verbs for Automatic Quotation Extraction |
11:35-13:15 |
Benoît Sagot and Géraldine Walther |
A Morphological Lexicon for the Persian Language |
11:35-13:15 |
Jana Šindlerová and Ondřej Bojar |
Building a Bilingual ValLex Using Treebank Token Alignment: First Observations |
11:35-13:15 |
Óscar Ferrández, Michael Ellsworth, Rafael Muñoz and Collin F. Baker |
Aligning FrameNet and WordNet based on Semantic Neighborhoods |
11:35-13:15 |
Anca Dinu |
Building a Generative Lexicon for Romanian |
11:35-13:15 |
Hiroaki SATO |
How FrameSQL Shows the Japanese FrameNet Data |
11:35-13:15 |
Svetla Koeva |
Lexicon and Grammar in Bulgarian FrameNet |
11:35-13:15 |
Bento Carlos Dias-da-Silva and Ariani Di-Felippo |
REBECA: Turning WordNet Databases into ""Ontolexicons"" |
11:35-13:15 |
Karel Pala, Christiane Fellbaum and Sonja Bosch |
Lexical Resources for Noun Compounds in Czech, English and Zulu |
11:35-13:15 |
Michael Gasser |
Expanding the Lexicon for a Resource-Poor Language Using a Morphological Analyzer and a Web Crawler |
11:35-13:15 |
Gerard de Melo and Gerhard Weikum |
Providing Multilingual, Multimodal Answers to Lexical Database Queries |
11:35-13:15 |
Sabine Ploux, Armelle Boussidan and Hyungsuk Ji |
The Semantic Atlas: an Interactive Model of Lexical Representation |
|
Session P4 - Web Services |
Chair : Bruno Cartoni |
14:45-16:25 |
Adam Funk and Kalina Bontcheva |
Ontology-Based Categorization of Web Services with Machine Learning |
14:45-16:25 |
Marie Hinrichs, Thomas Zastrow and Erhard Hinrichs |
WebLicht: Web-based LRT Services in a Distributed eScience Infrastructure |
14:45-16:25 |
Ulrich Heid, Helmut Schmid, Kerstin Eckart and Erhard Hinrichs |
A Corpus Representation Format for Linguistic Web Services: The D-SPIN Text Corpus Format and its Relationship with ISO Standards |
14:45-16:25 |
Donghui Lin, Yoshiaki Murakami, Toru Ishida, Yohei Murakami and Masahiro Tanaka |
Composing Human and Machine Translation Services: Language Grid for Improving Localization Processes |
14:45-16:25 |
Bora Savas, Yoshihiko Hayashi, Monica Monachini, Claudia Soria and Nicoletta Calzolari |
An LMF-based Web Service for Accessing WordNet-type Semantic Lexicons |
14:45-16:25 |
Virach Sornlertlamvanich, Thatsanee Charoenporn and Hitoshi Isahara |
Language Resource Management System for Asian WordNet Collaboration and Its Web Service Application |
|
Session P5 - Named Entity Recognition |
Chair : Valia Kordoni |
14:45-16:25 |
Rita Marinelli |
Lexical Resources and Ontological Classifications for the Recognition of Proper Names Sense Extension |
14:45-16:25 |
Damien Nouvel, Jean-Yves Antoine, Nathalie Friburger and Denis Maurel |
An Analysis of the Performances of the CasEN Named Entities Recognition System in the Ester2 Evaluation Campaign |
14:45-16:25 |
Olivier Galibert, Sophie Rosset, Xavier Tannier and Fanny Grandry |
Hybrid Citation Extraction from Patents |
14:45-16:25 |
Bart Desmet and Véronique Hoste |
Towards a Balanced Named Entity Corpus for Dutch |
14:45-16:25 |
Satoshi Sato and Sayoko Kaide |
A Person-Name Filter for Automatic Compilation of Bilingual Person-Name Lexicons |
14:45-16:25 |
Michael Tanenblatt, Anni Coden and Igor Sominsky |
The ConceptMapper Approach to Named Entity Recognition |
14:45-16:25 |
Grzegorz Chrupała and Dietrich Klakow |
A Named Entity Labeler for German: Exploiting Wikipedia and Distributional Clusters |
14:45-16:25 |
Keith J. Miller, Sarah McLeod, Elizabeth Schroeder, Mark Arehart, Kenneth Samuel, James Finley, Vanesa Jurica and John Polk |
Improving Personal Name Search in the TIGR System |
14:45-16:25 |
Wajdi Zaghouani, Bruno Pouliquen, Mohamed Ebrahim and Ralf Steinberger |
Adapting a resource-light highly multilingual Named Entity Recognition system to Arabic |
14:45-16:25 |
Dietrich Rebholz-Schuhmann, Antonio José Jimeno-Yepes, Erik M. van Mulligen, Ning Kang, Jan Kors, David Milward, Peter Corbett, Ekaterina Buyko, Katrin Tomanek, Elena Beisswanger and Udo Hahn |
The CALBC Silver Standard Corpus for Biomedical Named Entities ― A Study in Harmonizing the Contributions from Four Independent Named Entity Taggers |
14:45-16:25 |
Ana Cristina Mendes, Luísa Coheur and Paula Vaz Lobo |
Named Entity Recognition in Questions: Towards a Golden Collection |
|
Session P6 - Pronunciation Variants |
Chair : Fernando Fernández Martínez |
14:45-16:25 |
Alexander Schmitt, Tim Polzehl, Wolfgang Minker and Jackson Liscombe |
The Influence of the Utterance Length on the Recognition of Aged Voices |
14:45-16:25 |
Nikos Tsourakis, Agnes Lisowska, Manny Rayner and Pierrette Bouillon |
Examining the Effects of Rephrasing User Input on Two Mobile Spoken Language Systems |
14:45-16:25 |
Damjan Vlaj, Aleksandra Zögling Markuš, Marko Kos and Zdravko Kačič |
Acquisition and Annotation of Slovenian Lombard Speech Database |
14:45-16:25 |
Natalie D. Snoeren, Martine Adda-Decker and Gilles Adda |
The Study of Writing Variants in an Under-resourced Language: Some Evidence from Mobile N-Deletion in Luxembourgish |
14:45-16:25 |
Jean-Luc Rouas, Mayumi Beppu and Martine Adda-Decker |
Comparison of Spectral Properties of Read, Prepared and Casual Speech in French |
14:45-16:25 |
Marijn Schraagen and Gerrit Bloothooft |
Evaluating Repetitions, or how to Improve your Multilingual ASR System by doing Nothing |
14:45-16:25 |
Elena Grishina, Svetlana Savchuk and Alexej Poljakov |
Design and Data Collection for the Accentological Corpus of the Russian Language |
14:45-16:25 |
Siim Orasmaa, Reina Käärik, Jaak Vilo and Tiit Hennoste |
Information Retrieval of Word Form Variants in Spoken Language Corpora Using Generalized Edit Distance |
|
Session P7 - Multiword Expressions and Collocations |
Chair : Beatrice Daille |
14:45-16:25 |
Meng Wang, Chu-Ren Huang, Shiwen Yu and Weiwei Sun |
Automatic Acquisition of Chinese Novel Noun Compounds |
14:45-16:25 |
Luka Nerima, Eric Wehrli and Violeta Seretan |
A Recursive Treatment of Collocations |
14:45-16:25 |
Caroline Sporleder, Linlin Li, Philip Gorinski and Xaver Koch |
Idioms in Context: The IDIX Corpus |
14:45-16:25 |
Laura Street, Nathan Michalov, Rachel Silverstein, Michael Reynolds, Lurdes Ruela, Felicia Flowers, Angela Talucci, Priscilla Pereira, Gabriella Morgon, Samantha Siegel, Marci Barousse, Antequa Anderson, Tashom Carroll and Anna Feldman |
Like Finding a Needle in a Haystack: Annotating the American National Corpus for Idiomatic Expressions |
14:45-16:25 |
Andrea Zaninello and Malvina Nissim |
Creation of Lexical Resources for a Characterisation of Multiword Expressions in Italian |
14:45-16:25 |
Carlos Ramisch, Aline Villavicencio and Christian Boitet |
mwetoolkit: a Framework for Multiword Expression Identification |
14:45-16:25 |
Junko Kubo, Keita Tsuji and Shigeo Sugimoto |
Automatic Term Recognition Based on the Statistical Differences of Relative Frequencies in Different Corpora |
|
Session P10 - Morphology |
Chair : Miriam Butt |
16:45-18:05 |
Gertrud Faaß, Ulrich Heid and Helmut Schmid |
Design and Application of a Gold Standard for Morphological Analysis: SMOR as an Example of Morphological Evaluation |
16:45-18:05 |
Niraj Aswani and Robert Gaizauskas |
Developing Morphological Analysers for South Asian Languages: Experimenting with the Hindi and Gujarati Languages |
16:45-18:05 |
Cvetana Krstev, Ranka Stanković and Duško Vitas |
A Description of Morphological Features of Serbian: a Revision using Feature System Declaration |
16:45-18:05 |
Çağrı Çöltekin |
A Freely Available Morphological Analyzer for Turkish |
16:45-18:05 |
Iñaki Alegria, Garbiñe Aranbarri, Klara Ceberio, Gorka Labaka, Bittor Laskurain and Ruben Urizar |
A Morphological Processor Based on Foma for Biscayan (a Basque dialect) |
16:45-18:05 |
Yugo Murawaki and Sadao Kurohashi |
Online Japanese Unknown Morpheme Detection using Orthographic Variation |
16:45-18:05 |
Bruno Cartoni and Marie-Aude Lefer |
The MuLeXFoR Database: Representing Word-Formation Processes in a Multilingual Lexicographic Environment |
16:45-18:05 |
Ting-Hao Huang, Lun-Wei Ku and Hsin-Hsi Chen |
Predicting Morphological Types of Chinese Bi-Character Words by Machine Learning Approaches |
16:45-18:05 |
Mohamed Altantawy, Nizar Habash, Owen Rambow and Ibrahim Saleh |
Morphological Analysis and Generation of Arabic Nouns: A Morphemic Functional Approach |
16:45-18:05 |
Mehrnoush Shamsfard, Hoda Sadat Jafari and Mahdi Ilbeygi |
STeP-1: A Set of Fundamental Tools for Persian Text Processing |
16:45-18:05 |
Sara Tonelli, Emanuele Pianta, Rodolfo Delmonte and Michele Brunelli |
VenPro: A Morphological Analyzer for Venetan |
|
Session P11 - Tools for Multimodal Corpus |
Chair : Katerina Pastra |
16:45-18:05 |
Nick Campbell and Akiko Tabata |
A Software Toolkit for Viewing Annotated Multimodal Data Interactively over the Web |
16:45-18:05 |
Nick Webb, David Benyon, Jay Bradley, Preben Hansen and Oil Mival |
Wizard of Oz Experiments for a Companion Dialogue System: Eliciting Companionable Conversation |
16:45-18:05 |
Volker Fritzsch, Stefan Scherer and Friedhelm Schwenker |
An Open Source Process Engine Framework for Realtime Pattern Recognition and Information Fusion Tasks |
16:45-18:05 |
Jens Allwood, Harald Hammarström, Andries Hendrikse, Mtholeni N. Ngcobo, Nozibele Nomdebevana, Laurette Pretorius and Mac van der Merwe |
Work on Spoken (Multimodal) Language Corpora in South Africa |
16:45-18:05 |
Eric Auer, Albert Russel, Han Sloetjes, Peter Wittenburg, Oliver Schreer, S. Masnieri, Daniel Schneider and Sebastian Tschöpel |
ELAN as Flexible Annotation Framework for Sound and Image Processing Detectors |
|
Session P12 - Language Resource Infrastructures |
Chair : Hamish Cunningham |
16:45-18:05 |
Claus Zinn, Peter Wittenburg and Jacquelijn Ringersma |
An Evolving eScience Environment for Research Data in Linguistics |
16:45-18:05 |
Dieter Van Uytvanck, Claus Zinn, Daan Broeder, Peter Wittenburg and Mariano Gardellini |
Virtual Language Observatory: The Portal to the Language Resources and Technology Universe |
16:45-18:05 |
Adam Kilgarriff, Siva Reddy, Jan Pomikálek and Avinesh PVS |
A Corpus Factory for Many Languages |
16:45-18:05 |
Erhard Hinrichs, Verena Henrich and Thomas Zastrow |
Sustainability of Linguistic Data and Analysis in the Context of a Collaborative eScience Environment |
16:45-18:05 |
Armando Stellato, Heiko Stoermer, Stefano Bortoli, Noemi Scarpato, Andrea Turbati, Paolo Bouquet and Maria Teresa Pazienza |
Maskkot ― An Entity-centric Annotation Platform |
16:45-18:05 |
Maite Melero, Gemma Boleda, Montse Cuadros, Cristina España-Bonet, Lluís Padró, Martí Quixal, Carlos Rodríguez and Roser Saurí |
Language Technology Challenges of a Small Language (Catalan) |
16:45-18:05 |
Lluís Padró, Miquel Collado, Samuel Reese, Marina Lloberes and Irene Castellón |
FreeLing 2.1: Five Years of Open-source Language Processing Tools |
16:45-18:05 |
Bartosz Broda, Michał Marcińczuk and Maciej Piasecki |
Building a Node of the Accessible Language Technology Infrastructure |
16:45-18:05 |
Peter Menke and Alexander Mehler |
The Ariadne System: A Flexible and Extensible Framework for the Modeling and Storage of Experimental Data in the Humanities. |
16:45-18:05 |
Nicoletta Calzolari, Claudia Soria, Riccardo Del Gratta, Sara Goggi, Valeria Quochi, Irene Russo, Khalid Choukri, Joseph Mariani and Stelios Piperidis |
The LREC Map of Language Resources and Technologies |
16:45-18:05 |
Nick Rizzolo and Dan Roth |
Learning Based Java for Rapid Development of NLP Systems |
16:45-18:05 |
Kazuaki Maeda, Haejoong Lee, Stephen Grimes, Jonathan Wright, Robert Parker, David Lee and Andrea Mazzucchi |
Technical Infrastructure at Linguistic Data Consortium: Software and Hardware Resources for Linguistic Data Creation |
16:45-18:05 |
Thepchai Supnithi, Taneth Ruangrajitpakorn, Kanokorn Trakultaweekool and Peerachet Porkaew |
AutoTagTCG : A Framework for Automatic Thai CG Tagging |
16:45-18:05 |
Javier Couto, Helena Blancafort, Somara Seng, Nicolas Kuchmann-Beauger, Anass Talby and Claude de Loupy |
OAL: A NLP Architecture to Improve the Development of Linguistic Resources for NLP |
16:45-18:05 |
Girish Nath Jha |
The TDIL Program and the Indian Langauge Corpora Intitiative (ILCI) |
16:45-18:05 |
Stephanie Strassel, Dan Adams, Henry Goldberg, Jonathan Herr, Ron Keesing, Daniel Oblinger, Heather Simpson, Robert Schrag and Jonathan Wright |
The DARPA Machine Reading Program - Encouraging Linguistic and Reasoning Research with a Series of Reading Tasks |
16:45-18:05 |
Adam Przepiórkowski, Rafał L. Górski, Marek Łaziński and Piotr Pęzik |
Recent Developments in the National Corpus of Polish |
16:45-18:05 |
Drahomíra ""johanka"" Spoustová, Miroslav Spousta and Pavel Pecina |
Building a Web Corpus of Czech |
16:45-18:05 |
Brigitte Jörg, Hans Uszkoreit and Alastair Burt |
LT World: Ontology and Reference Information Portal |
|
Session P13 - Subjectivity: Sentiments, Emotions, Opinions |
Chair : Silke Scheible |
18:10-19:30 |
Vassiliki Rentoumi, Stefanos Petrakis, Manfred Klenner, George A. Vouros and Vangelis Karkaletsis |
United we Stand: Improving Sentiment Analysis by Joining Machine Learning and Rule Based Methods |
18:10-19:30 |
Plaban Kr. Bhowmick, Anupam Basu and Pabitra Mitra |
Determining Reliability of Subjective and Multi-label Emotion Annotation through Novel Fuzzy Agreement Measure |
18:10-19:30 |
Aleksander Wawer |
Is Sentiment a Property of Synsets? Evaluating Resources for Sentiment Classification using Machine Learning |
18:10-19:30 |
Patrick Paroubek, Alexander Pak and Djamel Mostefa |
Annotations for Opinion Mining Evaluation in the Industrial Context of the DOXA project |
18:10-19:30 |
Huan-An Kao and Hsin-Hsi Chen |
Comment Extraction from Blog Posts and Its Applications to Opinion Mining |
18:10-19:30 |
Sophia Yat Mei Lee, Ying Chen, Shoushan Li and Chu-Ren Huang |
Emotion Cause Events: Corpus Construction and Analysis |
18:10-19:30 |
Horacio Saggion and Adam Funk |
Interpreting SentiWordNet for Opinion Classification |
18:10-19:30 |
Polina Panicheva, John Cardiff and Paolo Rosso |
Personal Sense and Idiolect: Combining Authorship Attribution and Opinion Analysis |
18:10-19:30 |
Antonio Reyes, Martin Potthast, Paolo Rosso and Benno Stein |
Evaluating Humour Features on Web Comments |
18:10-19:30 |
Shu Zhang, Wenjie Jia, Yingju Xia, Yao Meng and Hao Yu |
Extracting Product Features and Sentiments from Chinese Customer Reviews |
18:10-19:30 |
Changqin Quan and Fuji Ren |
Automatic Annotation of Word Emotion in Sentences Based on Ren-CECps |
18:10-19:30 |
Bal Krishna Bal and Patrick Saint Dizier |
Towards Building Annotated Resources for Analyzing Opinions and Argumentation in News Editorials |
18:10-19:30 |
Irene Russo |
Discovering Polarity for Ambiguous and Objective Adjectives through Adverbial Modification |
18:10-19:30 |
Željko Agić, Nikola Ljubešić and Marko Tadić |
Towards Sentiment Analysis of Financial Texts in Croatian |
18:10-19:30 |
Robert Remus, Uwe Quasthoff and Gerhard Heyer |
SentiWS - A Publicly Available German-language Resource for Sentiment Analysis |
18:10-19:30 |
Stefan Scherer, Ingo Siegert, Lutz Bigalke and Sascha Meudt |
Developing an Expressive Speech Labeling Tool Incorporating the Temporal Characteristics of Emotion |
|
Session P17 - Semantic Annotation |
Chair : Satoshi Sato |
9:45-11:25 |
Antonio Balvet, Lucie Barque and Rafael Marín |
Building a Lexicon of French Deverbal Nouns from a Semantically Annotated Corpus |
9:45-11:25 |
Izaskun Aldezabal, María Jesús Aranzabe, Arantza Díaz de Ilarraza and Ainara Estarrona |
Building the Basque PropBank |
9:45-11:25 |
Samuel Reese, Gemma Boleda, Montse Cuadros, Lluís Padró and German Rigau |
Wikicorpus: A Word-Sense Disambiguated Multilingual Wikipedia Corpus |
9:45-11:25 |
Aina Peris, Mariona Taulé, Gemma Boleda and Horacio Rodríguez |
ADN-Classifier:Automatically Assigning Denotation Types to Nominalizations |
9:45-11:25 |
Roser Morante |
Descriptive Analysis of Negation Cues in Biomedical Texts |
9:45-11:25 |
Diana Santos and Cristina Mota |
Experiments in Human-computer Cooperation for the Semantic Annotation of Portuguese Corpora |
9:45-11:25 |
Magali Sanches Duran, Marcelo Adriano Amâncio and Sandra Maria Aluísio |
Assigning Wh-Questions to Verbal Arguments: Annotation Tools Evaluation and Corpus Building |
9:45-11:25 |
Stuart Moore, Sabine Buchholz and Anna Korhonen |
Annotating the Enron Email Corpus with Number Senses |
9:45-11:25 |
Suguru Matsuyoshi, Megumi Eguchi, Chitose Sao, Koji Murakami, Kentaro Inui and Yuji Matsumoto |
Annotating Event Mentions in Text with Modality, Focus, and Source Information |
9:45-11:25 |
Elisabetta Jezek and Valeria Quochi |
Capturing Coercions in Texts: a First Annotation Exercise |
9:45-11:25 |
Paula Vaz Lobo and David Martins de Matos |
Fairy Tale Corpus Organization Using Latent Semantic Mapping and an Item-to-item Top-n Recommendation Algorithm |
|
Session P18 - Corpus and Morphological Annotation |
Chair : Joan Soler Bou |
9:45-11:25 |
Antonio Pareja-Lora and Guadalupe Aguado de Cea |
Ontology-based Interoperation of Linguistic Tools for an Improved Lemma Annotation in Spanish |
9:45-11:25 |
Kikuo Maekawa, Makoto Yamazaki, Takehiko Maruyama, Masaya Yamaguchi, Hideki Ogura, Wakako Kashino, Toshinobu Ogiso, Hanae Koiso and Yasuharu Den |
Design, Compilation, and Preliminary Analyses of Balanced Corpus of Contemporary Written Japanese |
9:45-11:25 |
Bracha Nir, Brian MacWhinney and Shuly Wintner |
A Morphologically-Analyzed CHILDES Corpus of Hebrew |
9:45-11:25 |
Jarmila Panevová and Magda Ševčíková |
Annotation of Morphological Meanings of Verbs Revisited |
9:45-11:25 |
Seth Kulick, Ann Bies and Mohamed Maamouri |
Consistent and Flexible Integration of Morphological Annotation in the Arabic Treebank |
|
Session P19 - Applications of Speech Technology |
Chair : Norihide Kitaoka |
9:45-11:25 |
Justus Roux, Pieter Scholtz, Daleen Klop, Claus Povlsen, Bart Jongejan and Asta Magnusdottir |
Incorporating Speech Synthesis in the Development of a Mobile Platform for e-learning. |
9:45-11:25 |
Alejandro Abejón, Doroteo T. Toledano, Danilo Spada, González Victor and Daniel Hernández López |
A Study of the Influence of Speech Type on Automatic Language Recognition Performance |
9:45-11:25 |
Joseph Polifroni, Imre Kiss and Mark Adler |
Bootstrapping Named Entity Extraction for the Creation of Mobile Services |
9:45-11:25 |
Jesús Tomás, Alejandro Canovas, Jaime Lloret, Miguel García Pineda and Jose L. Abad |
Speech Translation in Pedagogical Environment Using Additional Sources of Knowledge |
9:45-11:25 |
Koichiro Honda and Tomoyosi Akiba |
Language Modeling Approach for Retrieving Passages in Lecture Audio Data |
9:45-11:25 |
Manny Rayner, Pierrette Bouillon, Nikos Tsourakis, Johanna Gerlach, Maria Georgescul, Yukie Nakao and Claudia Baur |
A Multilingual CALL Game Based on Speech Translation |
9:45-11:25 |
Iker Luengo, Eva Navas, Igor Odriozola, Ibon Saratxaga, Inmaculada Hernaez, Iñaki Sainz and Daniel Erro |
Modified LTSE-VAD Algorithm for Applications Requiring Reduced Silence Frame Misclassification |
9:45-11:25 |
Michal Gishri, Vered Silber-Varod and Ami Moyal |
Lexicon Design for Transcription of Spontaneous Voice Messages |
9:45-11:25 |
Kevin Walker, Christopher Caruso and Denise DiPersio |
Large Scale Multilingual Broadcast Data Collection to Support Machine Translation and Distillation Technology Development |
|
Session P22 - Machine Translation and Evaluation |
Chair : |
11:45-13:05 |
Hercules Dalianis, Hao-chun Xing and Xin Zhang |
Creating a Reusable English-Chinese Parallel Corpus for Bilingual Dictionary Construction |
11:45-13:05 |
Marta R. Costa-jussà, Mireia Farrús, José B. Mariño and José A. R. Fonollosa |
Automatic and Human Evaluation Study of a Rule-based and a Statistical Catalan-Spanish Machine Translation Systems |
11:45-13:05 |
Marta R. Costa-jussà and José A. R. Fonollosa |
Using Linear Interpolation and Weighted Reordering Hypotheses in the Moses System |
11:45-13:05 |
Maxim Khalilov, José A. R. Fonollosa, Inguna Skadina, Edgars Brālītis and Lauma Pretkalnina |
Towards Improving English-Latvian Translation: A System Comparison and a New Rescoring Feature |
11:45-13:05 |
Yanli Sun |
Mining the Correlation between Human and Automatic Evaluation at Sentence Level |
11:45-13:05 |
Christian Federmann |
Appraise: An Open-Source Toolkit for Manual Phrase-Based Evaluation of Translations |
11:45-13:05 |
Olivier Hamon |
Is my Judge a good One? |
11:45-13:05 |
Mark Fishel and Harri Kirik |
Linguistically Motivated Unsupervised Segmentation for Machine Translation |
11:45-13:05 |
Yu Chen and Andreas Eisele |
Integrating a Rule-based with a Hierarchical Translation System |
11:45-13:05 |
Aurélien Max, Josep Maria Crego and François Yvon |
Contrastive Lexical Evaluation of Machine Translation |
11:45-13:05 |
Yiou Wang, Kiyotaka Uchimoto, Junichi Kazama, Canasai Kruengkrai and Kentaro Torisawa |
Adapting Chinese Word Segmentation for Machine Translation Based on Short Units |
11:45-13:05 |
Masaki Murata, Tomohiro Ohno, Shigeki Matsubara and Yasuyoshi Inagaki |
Construction of Chunk-Aligned Bilingual Lecture Corpus for Simultaneous Machine Translation |
11:45-13:05 |
Ondřej Bojar, Pavel Straňák and Daniel Zeman |
Data Issues in English-to-Hindi Machine Translation |
11:45-13:05 |
Taiji Nagasaka, Ran Shimanouchi, Akiko Sakamoto, Takafumi Suzuki, Yohei Morishita, Takehito Utsuro and Suguru Matsuyoshi |
Utilizing Semantic Equivalence Classes of Japanese Functional Expressions in Translation Rule Acquisition from Parallel Patent Sentences |
11:45-13:05 |
Niraj Aswani and Robert Gaizauskas |
English-Hindi Transliteration using Multiple Similarity Metrics |
|
Session P23 - Corpora and Treebanks, Grammar and Syntax |
Chair : Patrick Saint Dizier |
11:45-13:05 |
Cristina Bosco, Simonetta Montemagni, Alessandro Mazzei, Vincenzo Lombardo, Felice Dell'Orletta, Alessandro Lenci, Leonardo Lesmo, Giuseppe Attardi, Maria Simi, Alberto Lavelli, Johan Hall, Jens Nilsson and Joakim Nivre |
Comparing the Influence of Different Treebank Annotations on Dependency Parsing |
11:45-13:05 |
Olga Lyashevskaya |
Bank of Russian Constructions and Valencies |
11:45-13:05 |
Tomaž Erjavec, Darja Fišer, Simon Krek and Nina Ledinek |
The JOS Linguistically Tagged Corpus of Slovene |
11:45-13:05 |
António Branco, Francisco Costa, João Silva, Sara Silveira, Sérgio Castro, Mariana Avelãs, Clara Pinto and João Graça |
Developing a Deep Linguistic Databank Supporting a Collection of Treebanks: the CINTIL DeepGramBank |
11:45-13:05 |
Katarzyna Głowińska and Adam Przepiórkowski |
The Design of Syntactic Annotation Levels in the National Corpus of Polish |
11:45-13:05 |
Kais Dukes, Eric Atwell and Abdul-Baquee M. Sharaf |
Syntactic Annotation Guidelines for the Quranic Arabic Dependency Treebank |
11:45-13:05 |
Jan Štěpánek and Petr Pajas |
Querying Diverse Treebanks in a Uniform Way |
11:45-13:05 |
Marie Mikulová and Jan Štěpánek |
Ways of Evaluation of the Annotators in Building the Prague Czech-English Dependency Treebank |
11:45-13:05 |
Marie Candito, Benoît Crabbé and Pascal Denis |
Statistical French Dependency Parsing: Treebank Conversion and First Results |
11:45-13:05 |
Marc Kupietz, Cyril Belica, Holger Keibel and Andreas Witt |
The German Reference Corpus DeReKo: A Primordial Sample for Linguistic Research |
11:45-13:05 |
Veronika Vincze, Dóra Szauter, Attila Almási, György Móra, Zoltán Alexin and János Csirik |
Hungarian Dependency Treebank |
11:45-13:05 |
Archna Bhatia, Rajesh Bhatt, Bhuvana Narasimhan, Martha Palmer, Owen Rambow, Dipti Misra Sharma, Michael Tepper, Ashwini Vaidya and Fei Xia |
Empty Categories in a Hindi Treebank |
11:45-13:05 |
Jinho D. Choi, Claire Bonial and Martha Palmer |
Propbank Instance Annotation Guidelines Using a Dedicated Editor, Jubilee |
11:45-13:05 |
Hiroki Hanaoka, Hideki Mima and Jun'ichi Tsujii |
A Japanese Particle Corpus Built by Example-Based Annotation |
11:45-13:05 |
Stephen A. Boxwell and Chris Brew |
A Pilot Arabic CCGbank |
11:45-13:05 |
Simon Mille and Leo Wanner |
Syntactic Dependencies for Multilingual and Multilevel Corpus Annotation |
11:45-13:05 |
Adriane Boyd |
EAGLE: an Error-Annotated Corpus of Beginning Learner German |
11:45-13:05 |
José M. García-Miguel, Gael Vaamonde and Fita González Domínguez |
ADESSE, a Database with Syntactic and Semantic Annotation of a Corpus of Spanish |
11:45-13:05 |
Jan Strunk |
Enriching a Treebank to Investigate Relative Clause Extraposition in German |
11:45-13:05 |
John Lee and Dag Haug |
Porting an Ancient Greek and Latin Treebank |
|
Session P25 - Discourse Annotation |
Chair : Dan Cristea |
14:55-16:35 |
Piroska Lendvai, Thierry Declerck, Sándor Darányi, Pablo Gervás, Raquel Hervás, Scott Malec and Federico Peinado |
Integration of Linguistic Markup into Semantic Models of Folk Narratives: The Fairy Tale Use Case |
14:55-16:35 |
Šárka Zikánová, Lucie Mladová, Jiří Mírovský and Pavlína Jínová |
Typical Cases of Annotators Disagreement in Discourse Annotations in Prague Dependency Treebank |
14:55-16:35 |
Samira Shaikh, Tomek Strzalkowski, Aaron Broadwell, Jennifer Stromer-Galley, Sarah Taylor and Nick Webb |
MPC: A Multi-Party Chat Corpus for Modeling Social Phenomena in Discourse |
14:55-16:35 |
Raffaella Bernardi, Manuel Kirschner and Zorana Ratkovic |
Context Fusion: The Role of Discourse Structure and Centering Theory |
14:55-16:35 |
Xuchen Yao, Irina Borisova and Mehwish Alam |
PDTB XML: the XMLization of the Penn Discourse TreeBank 2.0 |
14:55-16:35 |
Horacio Saggion, Elena Stein-Sparvieri, David Maldavsky and Sandra Szasz |
NLP Resources for the Analysis of Patient/Therapist Interviews |
14:55-16:35 |
Nicole Novielli and Carlo Strapparava |
Studying the Lexicon of Dialogue Acts |
14:55-16:35 |
Nils Reiter, Oliver Hellwig, Anand Mishra, Anette Frank and Jens Burkhardt |
Using NLP Methods for the Analysis of Rituals |
14:55-16:35 |
Amal Al-Saif and Katja Markert |
The Leeds Arabic Discourse Treebank: Annotating Discourse Connectives for Arabic |
14:55-16:35 |
Maria Liakata, Simone Teufel, Advaith Siddharthan and Colin Batchelor |
Corpora for the Conceptualisation and Zoning of Scientific Papers |
14:55-16:35 |
Oi Yee Kwong |
Constructing an Annotated Story Corpus: Some Observations and Issues |
14:55-16:35 |
David K. Elson and Kathleen R. McKeown |
Building a Bank of Semantically Encoded Narratives |
14:55-16:35 |
Rashmi Prasad, Aravind Joshi and Bonnie Webber |
Exploiting Scope for Shallow Discourse Parsing |
|
Session P26 - Dialogue Annotation |
Chair : Jens Allwood |
14:55-16:35 |
Sara Tonelli, Giuseppe Riccardi, Rashmi Prasad and Aravind Joshi |
Annotation of Discourse Relations for Conversational Spoken Dialogs |
14:55-16:35 |
Thomas Schmidt and Wilfried Schütte |
FOLKER: An Annotation Tool for Efficient Transcription of Natural, Multi-party Interaction |
14:55-16:35 |
Agnieszka Mykowiecka, Katarzyna Głowińska and Joanna Rabiega-Wiśniewska |
Domain-related Annotation of Polish Spoken Dialogue Corpus LUNA.PL |
14:55-16:35 |
Yasuharu Den, Hanae Koiso, Takehiko Maruyama, Kikuo Maekawa, Katsuya Takanashi, Mika Enomoto and Nao Yoshida |
Two-level Annotation of Utterance-units in Japanese Dialogs: An Empirically Emerged Scheme |
14:55-16:35 |
Olivier Blanc, Matthieu Constant, Anne Dister and Patrick Watrin |
Partial Parsing of Spontaneous Spoken French |
14:55-16:35 |
Mohamed Maamouri, Ann Bies, Seth Kulick, Wajdi Zaghouani, Dave Graff and Mike Ciul |
From Speech to Trees: Applying Treebank Annotation to Arabic Broadcast News |
14:55-16:35 |
Kiyonori Ohtake, Teruhisa Misu, Chiori Hori, Hideki Kashioka and Satoshi Nakamura |
Dialogue Acts Annotation for NICT Kyoto Tour Dialogue Corpus to Construct Statistical Dialogue Systems |
14:55-16:35 |
Iris Eshkol, Denis Maurel and Nathalie Friburger |
Eslo: From Transcription to Speakers' Personal Information Annotation |
14:55-16:35 |
Roberta Catizone, Alexiei Dingli and Robert Gaizauskas |
Using Dialogue Corpora to Extend Information Extraction Patterns for Natural Language Understanding of Dialogue |
14:55-16:35 |
Renata Savy |
Pr.A.Ti.D: A Coding Scheme for Pragmatic Annotation of Dialogues. |
|
Session P28 - Terminological Lexicons, Ontologies, Corpora |
Chair : Monica Monachini |
16:55-18:15 |
Ranka Stanković, Ivan Obradović and Olivera Kitanović |
GIS Application Improvement with Multilingual Lexical and Terminological Resources |
16:55-18:15 |
Rita Marinelli, Adriana Roventini, Giovanni Spadoni and Sebastiana Cucurullo |
Lexical Semantic Resources in a Terminological Network |
16:55-18:15 |
Nelleke Oostdijk, Suzan Verberne and Cornelis Koster |
Constructing a Broad-coverage Lexicon for Text Mining in the Patent Domain |
16:55-18:15 |
Rodrigo Agerri and Ana García-Serrano |
Q-WordNet: Extracting Polarity from WordNet Senses |
16:55-18:15 |
Aya Nishikawa, Ryo Nishimura, Yasuhiko Watanabe and Yoshihiro Okada |
A Context Sensitive Variant Dictionary for Supporting Variant Selection |
16:55-18:15 |
Montse Cuadros, Egoitz Laparra, German Rigau, Piek Vossen and Wauter Bosma |
Integrating a Large Domain Ontology of Species into WordNet |
16:55-18:15 |
Andrejs Vasiljevs and Kaspars Balodis |
Corpus Based Analysis for Multilingual Terminology Entry Compounding |
16:55-18:15 |
Arianne Reimerink, Pilar León Araúz and Pedro J. Magaña Redondo |
EcoLexicon: An Environmental TKB |
16:55-18:15 |
Dimitrios Kokkinakis and Ulla Gerdin |
A Swedish Scientific Medical Corpus for Terminology Management and Linguistic Exploration |
|
Session P29 - Question Answering and Evaluation |
Chair : Giuseppe Attardi |
16:55-18:15 |
Silvia Quarteroni and Alessandro Moschitti |
A Comprehensive Resource to Evaluate Complex Open Domain Question Answering |
16:55-18:15 |
Alessandra Giordani and Alessandro Moschitti |
Corpora for Automatically Learning to Map Natural Language Questions into SQL Queries |
16:55-18:15 |
Fang Xu and Dietrich Klakow |
Paragraph Acquisition and Selection for List Question Using Amazons Mechanical Turk |
16:55-18:15 |
Diana Santos, Luís Miguel Cabral, Corina Forascu, Pamela Forner, Fredric Gey, Katrin Lamm, Thomas Mandl, Petya Osenova, Anselmo Peñas, Álvaro Rodrigo, Julia Schulz, Yvonne Skalban and Erik Tjong Kim Sang |
GikiCLEF: Crosscultural Issues in Multilingual Information Access |
16:55-18:15 |
Sarra El Ayari, Brigitte Grau and Anne-Laure Ligozat |
Fine-grained Linguistic Evaluation of Question Answering Systems |
16:55-18:15 |
Arnaud Grappy, Brigitte Grau, Olivier Ferret, Cyril Grouin, Véronique Moriceau, Isabelle Robba, Xavier Tannier, Anne Vilnat and Vincent Barbier |
A Corpus for Studying Full Answer Justification |
16:55-18:15 |
Ludovic Quintard, Olivier Galibert, Gilles Adda, Brigitte Grau, Dominique Laurent, Véronique Moriceau, Sophie Rosset, Xavier Tannier and Anne Vilnat |
Question Answering on Web Data: The QA Evaluation in Quæro |
16:55-18:15 |
Xavier Tannier and Véronique Moriceau |
FIDJI: Web Question-Answering at Quaero 2009 |
16:55-18:15 |
Bernard Jacquemin |
A Derivational Rephrasing Experiment for Question Answering |
|
Session P31 - Dialogue Corpora |
Chair : Laurent Prevot |
16:55-18:15 |
Keyan Zhou, Aijun Li, Zhigang Yin and Chengqing Zong |
CASIA-CASSIL: a Chinese Telephone Conversation Corpus in Real Scenarios with Multi-leveled Annotation |
16:55-18:15 |
Yuki Kamiya, Tomohiro Ohno, Shigeki Matsubara and Hideki Kashioka |
Construction of Back-Channel Utterance Corpus for Responsive Spoken Dialogue System Development |
16:55-18:15 |
Werner Spiegl, Korbinian Riedhammer, Stefan Steidl and Elmar Nöth |
FAU IISAH Corpus -- A German Speech Database Consisting of Human-Machine and Human-Human Interaction Acquired by Close-Talking and Far-Distance Microphones |
16:55-18:15 |
Rodolfo Delmonte, Antonella Bristot and Vincenzo Pallotta |
Deep Linguistic Processing with GETARUNS for Spoken Dialogue Understanding |
16:55-18:15 |
Helena Spilková, Daniel Brenner, Anton Öttl, Pavel Vondřička, Wim van Dommelen and Mirjam Ernestus |
The Kachna L1/L2 Picture Replication Corpus |
16:55-18:15 |
Linda Brandschain, David Graff, Christopher Cieri, Kevin Walker, Chris Caruso and Abby Neely |
Greybeard Longitudinal Speech Study |
16:55-18:15 |
Linda Brandschain, David Graff, Chris Cieri, Kevin Walker, Chris Caruso and Abby Neely |
Mixer 6 |
|
Session P33 - Information Extraction, Terminology, Corpora |
Chair : Pierre Zweigenbaum |
18:20-19:40 |
Claudia Borg, Mike Rosner and Gordon J. Pace |
Automatic Grammar Rule Extraction and Ranking for Definitions |
18:20-19:40 |
Alberto Tretti and Barbara Di Eugenio |
Analysis and Presentation of Results for Mobile Local Search |
18:20-19:40 |
Atsushi Fujii |
Modeling Wikipedia Articles to Enhance Encyclopedic Search |
18:20-19:40 |
Christian Federmann and Thierry Declerck |
Extraction, Merging, and Monitoring of Company Data from Heterogeneous Sources |
18:20-19:40 |
Alberto Simões, José João Almeida and Rita Farinha |
Processing and Extracting Data from Dicionário Aberto |
18:20-19:40 |
Ziqi Zhang, José Iria and Fabio Ciravegna |
Improving Domain-specific Entity Recognition with Automatic Term Recognition and Feature Extraction |
18:20-19:40 |
Jakob Halskov, Dorte Haltrup Hansen, Anna Braasch and Sussi Olsen |
Quality Indicators of LSP Texts ― Selection and Measurements Measuring the Terminological Usefulness of Documents for an LSP Corpus |
18:20-19:40 |
Eric Charton and Juan-Manuel Torres-Moreno |
NLGbAse: A Free Linguistic Resource for Natural Language Processing Systems |
18:20-19:40 |
Cécile Grivaz |
Human Judgements on Causation in French Texts |
18:20-19:40 |
Heng Ji, Xiang Li, Angelo Lucia and Jianting Zhang |
Annotating Event Chains for Carbon Sequestration Literature |
18:20-19:40 |
Kumutha Swampillai and Mark Stevenson |
Inter-sentential Relations in Information Extraction Corpora |
18:20-19:40 |
Christopher R. Walker and Hannah Copperman |
Evaluating Complex Semantic Artifacts |
18:20-19:40 |
Marc Kemps-Snijders, Thomas Koller, Han Sloetjes and Huib Verwey |
LAT Bridge: Bridging Tools for Annotation and Exploration of Rich Linguistic Data |
|
Session P35 - Text Corpora and Language Resources |
Chair : Toma? Erjavec |
18:20-19:40 |
Henk van den Heuvel, René van Horik, Stef Scagliola, Eric Sanders and Paula Witkamp |
The VeteranTapes: Research Corpus, Fragment Processing Tool, and Enhanced Publications for the e-Humanities |
18:20-19:40 |
Martin Reynaert, Nelleke Oostdijk, Orphée De Clercq, Henk van den Heuvel and Franciska de Jong |
Balancing SoNaR: IPR versus Processing Issues in a 500-Million-Word Written Dutch Reference Corpus |
18:20-19:40 |
Youssef Aït Ouguengay and Aïcha Bouhjar |
For Standardised Amazigh Linguistic Resources |
18:20-19:40 |
Dafydd Gibbon, Moses Ekpenyong and Eno-Abasi Urua |
Medefaidrin: Resources Documenting the Birth and Death Language Life-cycle |
18:20-19:40 |
Nicolas Serrano, Francisco Castro and Alfons Juan |
The RODRIGO Database |
18:20-19:40 |
Cristina Sánchez-Marco, Gemma Boleda, Josep Maria Fontana and Judith Domingo |
Annotation and Representation of a Diachronic Corpus of Spanish |
18:20-19:40 |
Roser Sanromà and Gemma Boleda |
The Database of Catalan Adjectives |
18:20-19:40 |
Graham Neubig and Shinsuke Mori |
Word-based Partial Annotation for Efficient Corpus Construction |
|
Session P36 - Multimodal and Audiovisual Corpora |
Chair : Daniel Sonntag |
9:45-11:25 |
Elena Grishina |
Multimodal Russian Corpus (MURCO): First Steps |
9:45-11:25 |
Kristiina Jokinen |
Non-verbal Signals for Turn-taking and Feedback |
9:45-11:25 |
Patrizia Paggio, Jens Allwood, Elisabeth Ahlsén, Kristiina Jokinen and Costanza Navarretta |
The NOMCO Multimodal Nordic Resource - Goals and Characteristics |
9:45-11:25 |
Fernando Fernández-Martínez, Juan Manuel Lucas-Cuesta, Roberto Barra Chicote, Javier Ferreiros and Javier Macías-Guarasa |
HIFI-AV: An Audio-visual Corpus for Spoken Language Human-Machine Dialogue Research in Spanish |
9:45-11:25 |
Francisco Torreira and Mirjam Ernestus |
The Nijmegen Corpus of Casual Spanish |
9:45-11:25 |
Rein Ove Sikveland, Anton Öttl, Ingunn Amdal, Mirjam Ernestus, Torbjørn Svendsen and Jens Edlund |
Spontal-N: A Corpus of Interactional Spoken Norwegian |
9:45-11:25 |
Jens Edlund, Jonas Beskow, Kjell Elenius, Kahl Hellmer, Sofia Strönbergsson and David House |
Spontal: A Swedish Spontaneous Dialogue Corpus of Audio, Video and Motion Capture |
9:45-11:25 |
Jérôme Urbain, Elisabetta Bevacqua, Thierry Dutoit, Alexis Moinet, Radoslaw Niewiadomski, Catherine Pelachaud, Benjamin Picart, Joëlle Tilmanne and Johannes Wagner |
The AVLaughterCycle Database |
9:45-11:25 |
Carlos Gómez Gallo, T. Florian Jaeger and Katrina Furth |
A Database for the Exploration of Spanish Planning |
9:45-11:25 |
Stavros Ntalampiras, Todor Ganchev, Ilyas Potamitis and Nikos Fakotakis |
Heterogeneous Sensor Database in Support of Human Behaviour Analysis in Unrestricted Environments: The Audio Part |
9:45-11:25 |
Theodoros Kostoulas, Otilia Kocsis, Todor Ganchev, Fernando Fernández-Aranda, Juan J. Santamaría, Susana Jiménez-Murcia, Maher Ben Moussa, Nadia Magnenat-Thalmann and Nikos Fakotakis |
The PlayMancer Database: A Multimodal Affect Database in Support of Research and Development Activities in Serious Game Environment |
9:45-11:25 |
Alexander Vorwerk, Xiaohui Wang, Dorothea Kolossa, Steffen Zeiler and Reinhold Orglmeister |
WAPUSK20 - A Database for Robust Audiovisual Speech Recognition |
9:45-11:25 |
Peng-Wen Chen, Snehal Kumar Chennuru and Ying Zhang |
A Language Approach to Modeling Human Behaviors |
9:45-11:25 |
Kathleen Eberhard, Hannele Nicholson, Sandra Kübler, Susan Gundersen and Matthias Scheutz |
The Indiana ``Cooperative Remote Search Task"" (CReST) Corpus |
9:45-11:25 |
Katerina Pastra, Christian Wallraven, Michael Schultze, Argyro Vataki and Kathrin Kaulard |
The POETICON Corpus: Capturing Language Use and Sensorimotor Experience in Everyday Interaction |
9:45-11:25 |
Quan Nguyen and Michael Kipp |
Annotation of Human Gesture using 3D Skeleton Controls |
9:45-11:25 |
Massimo Poesio, Marco Baroni, Oswald Lanz, Alessandro Lenci, Alexandros Potamianos, Hinrich Schütze, Sabine Schulte im Walde and Luca Surian |
BabyExp: Constructing a Huge Multimodal Resource to Acquire Commonsense Knowledge Like Children Do |
|
Session P45 - Evaluation Methodologies |
Chair : Alessandro Moschitti |
11:45-13:05 |
Elin Carlsson and Hercules Dalianis |
Influence of Module Order on Rule-Based De-identification of Personal Names in Electronic Patient Records Written in Swedish |
11:45-13:05 |
Olga Babko-Malaya, Dan Hunter, Connie Fournelle and Jim White |
Evaluation of Document Citations in Phase 2 Gale Distillation |
11:45-13:05 |
Olivier Galibert, Ludovic Quintard, Sophie Rosset, Pierre Zweigenbaum, Claire Nédellec, Sophie Aubin, Laurent Gillard, Jean-Pierre Raysz, Delphine Pois, Xavier Tannier, Louise Deléger and Dominique Laurent |
Named and Specific Entity Detection in Varied Data: The Quæro Named Entity Baseline Evaluation |
11:45-13:05 |
Marco Guerini, Carlo Strapparava and Oliviero Stock |
Evaluation Metrics for Persuasive NLP with Google AdWords |
11:45-13:05 |
Joana Hois |
Inter-Annotator Agreement on a Linguistic Ontology for Spatial Language - A Case Study for GUM-Space |
11:45-13:05 |
Petra-Maria Strauß, Stefan Scherer, Georg Layher and Holger Hoffmann |
Evaluation of the PIT Corpus Or What a Difference a Face Makes? |
|
Session P47 - Corpora, Annotation and Tools |
Chair : Satoshi Sekine |
14:55-16:35 |
Marc Verhagen |
The Brandeis Annotation Tool |
14:55-16:35 |
Georgios Petasis and Dimitrios Petasis |
BlogBuster: A Tool for Extracting Corpora from the Blogosphere |
14:55-16:35 |
Jinho D. Choi, Claire Bonial and Martha Palmer |
Propbank Frameset Annotation Guidelines Using a Dedicated Editor, Cornerstone |
14:55-16:35 |
Dain Kaplan, Ryu Iida and Takenobu Tokunaga |
Annotation Process Management Revisited |
14:55-16:35 |
Takeshi Abekawa, Masao Utiyama, Eiichiro Sumita and Kyo Kageura |
Community-based Construction of Draft and Final Translation Corpus Through a Translation Hosting Site Minna no Hon'yaku (MNH) |
14:55-16:35 |
Maarten Marx and Anne Schuth |
DutchParl. The Parliamentary Documents in Dutch |
14:55-16:35 |
Svetla Koeva, Diana Blagoeva and Siya Kolkovska |
Bulgarian National Corpus Project |
14:55-16:35 |
Khalil Dahab and Anja Belz |
A Game-based Approach to Transcribing Images of Text |
14:55-16:35 |
Ghulam Raza |
Inferring Subcat Frames of Verbs in Urdu |
14:55-16:35 |
Romaric Besançon, Gaël de Chalendar, Olivier Ferret, Faiza Gara, Olivier Mesnard, Meriama Laïb and Nasredine Semmar |
LIMA : A Multilingual Framework for Linguistic Analysis and Linguistic Resources Development and Evaluation |
14:55-16:35 |
Catarina Magro |
When CORDIAL Becomes Friendly: Endowing the CORDIAL Corpus with a Syntactic Annotation Layer |
14:55-16:35 |
Richard Johansson and Alessandro Moschitti |
A Flexible Representation of Heterogeneous Annotation Data |
14:55-16:35 |
Roberto Navigli, Paola Velardi and Juana María Ruiz-Martínez |
An Annotated Dataset for Extracting Definitions and Hypernyms from the Web |
|
Session P49 - WordNet, Framenet, Ontologies |
Chair : Karel Pala |
14:55-16:35 |
Winston Anderson, Laurette Pretorius and Albert Kotzé |
Base Concepts in the African Languages Compared to Upper Ontologies and the WordNet Top Ontology |
14:55-16:35 |
Yue Ma, Adeline Nazarenko and Laurent Audibert |
Formal Description of Resources for Ontology-based Semantic Annotation |
14:55-16:35 |
Roxane Segers and Piek Vossen |
Facilitating Non-expert Users of the KYOTO Platform: the TMEKO Editing Protocol for Synset to Ontology Mappings |
14:55-16:35 |
Chris Irwin Davis and Dan Moldovan |
Feasibility of Automatically Bootstrapping a Persian WordNet |
14:55-16:35 |
Pushpak Bhattacharyya |
IndoWordNet |
14:55-16:35 |
Zygmunt Vetulani, Marek Kubis and Tomasz Obrębski |
PolNet ― Polish WordNet: Data and Tools |
14:55-16:35 |
Mehrnoush Shamsfard, Hakimeh Fadaei and Elham Fekri |
Extracting Lexico-conceptual Knowledge for Developing Persian WordNet |
14:55-16:35 |
Prasanth Kolachina, Sudheer Kolachina, Anil Kumar Singh, Samar Husain, Viswanath Naidu, Rajeev Sangal and Aksar Bharati |
Grammar Extraction from Treebanks for Hindi and Telugu |
14:55-16:35 |
Emiliano Giovannetti |
An Unsupervised Approach for Semantic Relation Interpretation |
14:55-16:35 |
Gabor Melli |
Concept Mentions within KDD-2009 Abstracts (kdd09cma1) Linked to a KDD Ontology (kddo1) |
14:55-16:35 |
Min-Jae Kwon, Hae-Yun Lee and Hee-Rahk Chae |
Linking Korean Words with an Ontology |
14:55-16:35 |
Hassina Aliane, Zaia Alimazighi and Ahmed Cherif Mazari |
Al ―Khalil : The Arabic Linguistic Ontology Project |
14:55-16:35 |
Cássia Trojahn, Paulo Quaresma and Renata Vieira |
An API for Multi-lingual Ontology Matching |
14:55-16:35 |
Thierry Declerck and Piroska Lendvai |
Towards a Standardized Linguistic Annotation of the Textual Content of Labels in Knowledge Representation Systems |
14:55-16:35 |
Kiril Simov and Petya Osenova |
Constructing of an Ontology-based Lexicon for Bulgarian |
14:55-16:35 |
René Witte, Ninus Khamis and Juergen Rilling |
Flexible Ontology Population from Text: The OwlExporter |
14:55-16:35 |
Takehiro Teraoka, Jun Okamoto and Shun Ishizaki |
An Associative Concept Dictionary for Verbs and its Application to Elliptical Word Estimation |
14:55-16:35 |
Nao Tatsumi, Jun Okamoto and Shun Ishizaki |
Evaluating Semantic Relations and Distances in the Associative Concept Dictionary using NIRS-imaging |
14:55-16:35 |
Giulio Paci, Giorgio Pedrazzi and Roberta Turra |
Wikipedia-based Approach for Linking Ontology Concepts to their Realisations in Text |
14:55-16:35 |
Pradeep Dantuluri, Brian Davis and Siegfried Handschuh |
A Use Case for Controlled Languages as Interfaces to Semantic Web Applications |
14:55-16:35 |
Alessandro Oltramari, Guido Vetere, Maurizio Lenzerini, Aldo Gangemi and Nicola Guarino |
Senso Comune |
|
 |