9.00-9.40 | Keynote Speech: Jaime Carbonell | Intelligence Resource Collection for Low-Density Languages | Republic Hall | |
9.40-9.45 |
5 minutes break |
|||
O16 - LRs: Infrastructures and Strategies | Temi Zamit | |||
9.45-10.05 | 954 |
Adapting to Trends in Language Resource Development: A Progress Report on LDC Activities | Christopher Cieri and Mark Liberman | |
10.05-10.25 | 948 |
Update on the ELRA activities and perspectives | Victoria Arranz | |
10.25-10.45 | 369 |
Preparing the field for an Open Resource Infrastructure: the role of the FLaReNet Network of Excellence | Nicoletta Calzolari and Claudia Soria | |
10.45-11.05 | 570 |
LoonyBin: Keeping Language Technologists Sane through Automated Management of Experimental (Hyper)Workflows | Jonathan H. Clark and Alon Lavie | |
11.05-11.25 | 798 |
Enhanced Infrastructure for Creation and Collection of Translation Resources | Zhiyi Song, Stephanie Strassel, Gary Krug and Kazuaki Maeda | |
O17 - Opinion Mining and Emotions | Vassali | |||
9.45-10.05 | 242 |
Construction of a Chinese Opinion Treebank | Lun-Wei Ku, Ting-Hao Huang and Hsin-Hsi Chen | |
10.05-10.25 | 385 |
Twitter as a Corpus for Sentiment Analysis and Opinion Mining | Alexander Pak and Patrick Paroubek | |
10.25-10.45 | 165 |
Annotation Scheme and Gold Standard for Dutch Subjective Adjectives | Isa Maks and Piek Vossen | |
10.45-11.05 | 208 |
Learning Subjectivity Phrases missing from Resources through a Large Set of Semantic Tests | Matthieu Vernier, Laura Monceaux and Béatrice Daille | |
11.05-11.25 | 607 |
Predicting Persuasiveness in Political Discourses | Carlo Strapparava, Marco Guerini and Oliviero Stock | |
O18 - Information Extraction | Michel'angelo Grima | |||
9.45-10.05 | 54 |
Arabic Word Segmentation for Better Unit of Analysis | Yassine Benajiba and Imed Zitouni | |
10.05-10.25 | 63 |
Dictionary and Monolingual Corpus-based Query Translation for Basque-English CLIR | Xabier Saralegi and Maddalen Lopez de Lacalle | |
10.25-10.45 | 215 |
Czech Information Retrieval with Syntax-based Language Models | Jana Straková and Pavel Pecina | |
10.45-11.05 | 499 |
Building a Cross-lingual Relatedness Thesaurus using a Graph Similarity Measure | Lukas Michelbacher, Florian Laws, Beate Dorow, Ulrich Heid and Hinrich Schütze | |
11.05-11.25 | 353 |
Building a Domain-specific Document Collection for Evaluating Metadata Effects on Information Retrieval | Walid Magdy, Jinming Min, Johannes Leveling and Gareth J. F. Jones | |
O19 - Semantics | David Bruce | |||
9.45-10.05 | 93 |
The More the Better? Assessing the Influence of Wikipedia’s Growth on Semantic Relatedness Measures | Torsten Zesch and Iryna Gurevych | |
10.05-10.25 | 632 |
Comparing Computational Models of Selectional Preferences - Second-order Co-Occurrence vs. Latent Semantic Clusters | Sabine Schulte im Walde | |
10.25-10.45 | 733 |
Acquiring Reliable Predicate-argument Structures from Raw Corpora for Case Frame Compilation | Daisuke Kawahara and Sadao Kurohashi | |
10.45-11.05 | 292 |
A Random Graph Walk based Approach to Computing Semantic Relatedness Using Knowledge from Wikipedia | Ziqi Zhang, Anna Lisa Gentile, Lei Xia, José Iria and Sam Chapman | |
11.05-11.25 | 446 |
A Modality Lexicon and its use in Automatic Tagging | Kathrin Baker, Michael Bloodgood, Bonnie Dorr, Nathaniel W. Filardo, Lori Levin and Christine Piatko | |
9.45-11.25 | Poster Sessions | Sacra Infermeria | ||
P17 - Semantic Annotation | ||||
183 |
Building a Lexicon of French Deverbal Nouns from a Semantically Annotated Corpus | Antonio Balvet, Lucie Barque and Rafael Marín | ||
217 |
Building the Basque PropBank | Izaskun Aldezabal, María Jesús Aranzabe, Arantza Díaz de Ilarraza and Ainara Estarrona | ||
222 |
Wikicorpus: A Word-Sense Disambiguated Multilingual Wikipedia Corpus | Samuel Reese, Gemma Boleda, Montse Cuadros, Lluís Padró and German Rigau | ||
252 |
ADN-Classifier:Automatically Assigning Denotation Types to Nominalizations | Aina Peris, Mariona Taulé, Gemma Boleda and Horacio Rodríguez | ||
335 |
Descriptive Analysis of Negation Cues in Biomedical Texts | Roser Morante | ||
457 |
Experiments in Human-computer Cooperation for the Semantic Annotation of Portuguese Corpora | Diana Santos and Cristina Mota | ||
564 |
Assigning Wh-Questions to Verbal Arguments: Annotation Tools Evaluation and Corpus Building | Magali Sanches Duran, Marcelo Adriano Amâncio and Sandra Maria Aluísio | ||
653 |
Annotating the Enron Email Corpus with Number Senses | Stuart Moore, Sabine Buchholz and Anna Korhonen | ||
682 |
Annotating Event Mentions in Text with Modality, Focus, and Source Information | Suguru Matsuyoshi, Megumi Eguchi, Chitose Sao, Koji Murakami, Kentaro Inui and Yuji Matsumoto | ||
713 |
Capturing Coercions in Texts: a First Annotation Exercise | Elisabetta Jezek and Valeria Quochi | ||
786 |
Fairy Tale Corpus Organization Using Latent Semantic Mapping and an Item-to-item Top-n Recommendation Algorithm | Paula Vaz Lobo and David Martins de Matos | ||
P18 - Corpus and Morphological Annotation | ||||
92 |
Ontology-based Interoperation of Linguistic Tools for an Improved Lemma Annotation in Spanish | Antonio Pareja-Lora and Guadalupe Aguado de Cea | ||
99 |
Design, Compilation, and Preliminary Analyses of Balanced Corpus of Contemporary Written Japanese | Kikuo Maekawa, Makoto Yamazaki, Takehiko Maruyama, Masaya Yamaguchi, Hideki Ogura, Wakako Kashino, Toshinobu Ogiso, Hanae Koiso and Yasuharu Den | ||
172 |
A Morphologically-Analyzed CHILDES Corpus of Hebrew | Bracha Nir, Brian MacWhinney and Shuly Wintner | ||
395 |
Annotation of Morphological Meanings of Verbs Revisited | Jarmila Panevová and Magda Ševčíková | ||
566 |
Consistent and Flexible Integration of Morphological Annotation in the Arabic Treebank | Seth Kulick, Ann Bies and Mohamed Maamouri | ||
P19 - Applications of Speech Technology | ||||
103 |
Incorporating Speech Synthesis in the Development of a Mobile Platform for e-learning. | Justus Roux, Pieter Scholtz, Daleen Klop, Claus Povlsen, Bart Jongejan and Asta Magnusdottir | ||
174 |
A Study of the Influence of Speech Type on Automatic Language Recognition Performance | Alejandro Abejón, Doroteo T. Toledano, Danilo Spada, González Victor and Daniel Hernández López | ||
280 |
Bootstrapping Named Entity Extraction for the Creation of Mobile Services | Joseph Polifroni, Imre Kiss and Mark Adler | ||
357 |
Speech Translation in Pedagogical Environment Using Additional Sources of Knowledge | Jesús Tomás, Alejandro Canovas, Jaime Lloret, Miguel García Pineda and Jose L. Abad | ||
462 |
Language Modeling Approach for Retrieving Passages in Lecture Audio Data | Koichiro Honda and Tomoyosi Akiba | ||
610 |
A Multilingual CALL Game Based on Speech Translation | Manny Rayner, Pierrette Bouillon, Nikos Tsourakis, Johanna Gerlach, Maria Georgescul, Yukie Nakao and Claudia Baur | ||
741 |
Modified LTSE-VAD Algorithm for Applications Requiring Reduced Silence Frame Misclassification | Iker Luengo, Eva Navas, Igor Odriozola, Ibon Saratxaga, Inmaculada Hernaez, Iñaki Sainz and Daniel Erro | ||
953 |
Lexicon Design for Transcription of Spontaneous Voice Messages | Michal Gishri, Vered Silber-Varod and Ami Moyal | ||
667 |
Large Scale Multilingual Broadcast Data Collection to Support Machine Translation and Distillation Technology Development | Kevin Walker, Christopher Caruso and Denise DiPersio | ||
P20 - Speech Data Collection | ||||
255 |
NameDat: A Database of English Proper Names Spoken by Native Norwegians | Line Adde and Torbjørn Svendsen | ||
262 |
A Database of Age and Gender Annotated Telephone Speech | Felix Burkhardt, Martin Eckert, Wiebke Johannsen and Joachim Stegmann | ||
285 |
WTIMIT: The TIMIT Speech Corpus Transmitted Over The 3G AMR Wideband Mobile Network | Patrick Bauer, David Scheler and Tim Fingscheidt | ||
516 |
Multi-Channel Database of Spontaneous Czech with Synchronization of Channels Recorded by Independent Devices | Petr Pollák and Josef Rajnoha | ||
822 |
Collecting Voices from the Cloud | Ian McGraw, Chia-ying Lee, Lee Hetherington, Stephanie Seneff and Jim Glass | ||
P21 - Dialogue Evaluation | ||||
34 |
Construction of a Benchmark Data Set for Cross-lingual Word Sense Disambiguation | Els Lefever and Véronique Hoste | ||
133 |
Ad-hoc Evaluations Along the Lifecycle of Industrial Spoken Dialogue Systems: Heading to Harmonisation? | Marianne Laurent, Philippe Bretier and Carole Manquillet | ||
675 |
Practical Evaluation of Speech Recognizers for Virtual Human Dialogue Systems | Xuchen Yao, Pravin Bhutada, Kallirroi Georgila, Kenji Sagae, Ron Artstein and David Traum | ||
801 |
Improved Statistical Measures to Assess Natural Language Parser Performance across Domains | Barbara Plank | ||
303 |
Evaluation of HMM-based Models for the Annotation of Unsegmented Dialogue Turns | Carlos-D. Martínez-Hinarejos, Vicent Tamarit and José-M. Benedí | ||
11.25-11.45 | Coffee break | |||
O20 - Discourse Annotation and Parsing | Temi Zamit | |||
11.45-12.05 | 58 |
A Database of Narrative Schemas | Nathanael Chambers and Dan Jurafsky | |
12.05-12.25 | 796 |
How Complex is Discourse Structure? | Markus Egg and Gisela Redeker | |
12.25-12.45 | 893 |
A General Purpose FrameNet-based Shallow Semantic Parser | Bonaventura Coppola and Alessandro Moschitti | |
12.45-13.05 | 730 |
Parsing to Stanford Dependencies: Trade-offs between Speed and Accuracy | Daniel Cer, Marie-Catherine de Marneffe, Dan Jurafsky and Chris Manning | |
O21 - Emotion, Sentiment | Vassali | |||
11.45-12.05 | 123 |
WITcHCRafT: A Workbench for Intelligent exploraTion of Human ComputeR conversaTions | Alexander Schmitt, Gregor Bertrand, Tobias Heinroth, Wolfgang Minker and Jackson Liscombe | |
12.05-12.25 | 91 |
GermanPolarityClues: A Lexical Resource for German Sentiment Analysis | Ulli Waltinger | |
12.25-12.45 | 483 |
CINEMO – A French Spoken Language Resource for Complex Emotions: Facts and Baselines | Björn Schuller, Riccardo Zaccarelli, Nicolas Rollet and Laurence Devillers | |
12.45-13.05 | 506 |
Towards Investigating Effective Affective Dialogue Strategies | Gregor Bertrand, Florian Nothdurft, Steffen Walter, Andreas Scheck, Henrik Kessler and Wolfgang Minker | |
O22 - Corpus Building, Annotation and Methodology | Michel'angelo Grima | |||
11.45-12.05 | 110 |
Challenges in Building a Multilingual Alpine Heritage Corpus | Martin Volk, Noah Bubenhofer, Adrian Althaus, Maya Bangerter, Lenz Furrer and Beni Ruef | |
12.05-12.25 | 451 |
Tag Dictionaries Accelerate Manual Annotation | Marc Carmen, Paul Felt, Robbie Haertel, Deryle Lonsdale, Peter McClanahan, Owen Merkling, Eric Ringger and Kevin Seppi | |
12.25-12.45 | 432 |
WikiWoods: Syntacto-Semantic Annotation for English Wikipedia | Dan Flickinger, Stephan Oepen and Gisle Ytrestøl | |
12.45-13.05 | 199 |
How Large a Corpus Do We Need: Statistical Method Versus Rule-based Method | Hai Zhao, Yan Song and Chunyu Kit | |
O23 - Broadcast News | David Bruce | |||
11.45-12.05 | 394 |
KALAKA: A TV Broadcast Speech Database for the Evaluation of Language Recognition Systems | Luis Javier Rodríguez-Fuentes, Mikel Penagarikano, Germán Bordel, Amparo Varona and Mireia Díez | |
12.05-12.25 | 650 |
The EPAC Corpus: Manual and Automatic Annotations of Conversational Speech in French Broadcast News | Yannick Estève, Thierry Bazillon, Jean-Yves Antoine, Frédéric Béchet and Jérôme Farinas | |
12.25-12.45 | 249 |
Online Temporal Language Model Adaptation for a Thai Broadcast News Transcription System | Kwanchiva Saykham, Ananlada Chotimongkol and Chai Wutiwiwatchai | |
12.45-13.05 | 355 |
DiSCo - A German Evaluation Corpus for Challenging Problems in the Broadcast Domain | Doris Baum, Daniel Schneider, Rolf Bardeli, Jochen Schwenninger, Barbara Samlowski, Thomas Winkler and Joachim Köhler | |
11.45-13.05 | Poster Sessions | Sacra Infermeria | ||
P22 - Machine Translation and Evaluation | ||||
13 |
Creating a Reusable English-Chinese Parallel Corpus for Bilingual Dictionary Construction | Hercules Dalianis, Hao-chun Xing and Xin Zhang | ||
47 |
Automatic and Human Evaluation Study of a Rule-based and a Statistical Catalan-Spanish Machine Translation Systems | Marta R. Costa-jussà, Mireia Farrús, José B. Mariño and José A. R. Fonollosa | ||
23 |
Using Linear Interpolation and Weighted Reordering Hypotheses in the Moses System | Marta R. Costa-jussà and José A. R. Fonollosa | ||
228 |
Towards Improving English-Latvian Translation: A System Comparison and a New Rescoring Feature | Maxim Khalilov, José A. R. Fonollosa, Inguna Skadina, Edgars Brālītis and Lauma Pretkalnina | ||
87 |
Mining the Correlation between Human and Automatic Evaluation at Sentence Level | Yanli Sun | ||
197 |
Appraise: An Open-Source Toolkit for Manual Phrase-Based Evaluation of Translations | Christian Federmann | ||
402 |
Is my Judge a good One? | Olivier Hamon | ||
604 |
Linguistically Motivated Unsupervised Segmentation for Machine Translation | Mark Fishel and Harri Kirik | ||
754 |
Integrating a Rule-based with a Hierarchical Translation System | Yu Chen and Andreas Eisele | ||
823 |
Contrastive Lexical Evaluation of Machine Translation | Aurélien Max, Josep Maria Crego and François Yvon | ||
83 |
Adapting Chinese Word Segmentation for Machine Translation Based on Short Units | Yiou Wang, Kiyotaka Uchimoto, Jun’ichi Kazama, Canasai Kruengkrai and Kentaro Torisawa | ||
581 |
Construction of Chunk-Aligned Bilingual Lecture Corpus for Simultaneous Machine Translation | Masaki Murata, Tomohiro Ohno, Shigeki Matsubara and Yasuyoshi Inagaki | ||
756 |
Data Issues in English-to-Hindi Machine Translation | Ondřej Bojar, Pavel Straňák and Daniel Zeman | ||
696 |
Utilizing Semantic Equivalence Classes of Japanese Functional Expressions in Translation Rule Acquisition from Parallel Patent Sentences | Taiji Nagasaka, Ran Shimanouchi, Akiko Sakamoto, Takafumi Suzuki, Yohei Morishita, Takehito Utsuro and Suguru Matsuyoshi | ||
694 |
English-Hindi Transliteration using Multiple Similarity Metrics | Niraj Aswani and Robert Gaizauskas | ||
P23 - Corpora and Treebanks, Grammar and Syntax | ||||
196 |
Comparing the Influence of Different Treebank Annotations on Dependency Parsing | Cristina Bosco, Simonetta Montemagni, Alessandro Mazzei, Vincenzo Lombardo, Felice Dell'Orletta, Alessandro Lenci, Leonardo Lesmo, Giuseppe Attardi, Maria Simi, Alberto Lavelli, Johan Hall, Jens Nilsson and Joakim Nivre | ||
77 |
Bank of Russian Constructions and Valencies | Olga Lyashevskaya | ||
139 |
The JOS Linguistically Tagged Corpus of Slovene | Tomaž Erjavec, Darja Fišer, Simon Krek and Nina Ledinek | ||
154 |
Developing a Deep Linguistic Databank Supporting a Collection of Treebanks: the CINTIL DeepGramBank | António Branco, Francisco Costa, João Silva, Sara Silveira, Sérgio Castro, Mariana Avelãs, Clara Pinto and João Graça | ||
259 |
The Design of Syntactic Annotation Levels in the National Corpus of Polish | Katarzyna Głowińska and Adam Przepiórkowski | ||
278 |
Syntactic Annotation Guidelines for the Quranic Arabic Dependency Treebank | Kais Dukes, Eric Atwell and Abdul-Baquee M. Sharaf | ||
381 |
Querying Diverse Treebanks in a Uniform Way | Jan Štěpánek and Petr Pajas | ||
388 |
Ways of Evaluation of the Annotators in Building the Prague Czech-English Dependency Treebank | Marie Mikulová and Jan Štěpánek | ||
392 |
Statistical French Dependency Parsing: Treebank Conversion and First Results | Marie Candito, Benoît Crabbé and Pascal Denis | ||
414 |
The German Reference Corpus DeReKo: A Primordial Sample for Linguistic Research | Marc Kupietz, Cyril Belica, Holger Keibel and Andreas Witt | ||
465 |
Hungarian Dependency Treebank | Veronika Vincze, Dóra Szauter, Attila Almási, György Móra, Zoltán Alexin and János Csirik | ||
561 |
Empty Categories in a Hindi Treebank | Archna Bhatia, Rajesh Bhatt, Bhuvana Narasimhan, Martha Palmer, Owen Rambow, Dipti Misra Sharma, Michael Tepper, Ashwini Vaidya and Fei Xia | ||
903 |
Propbank Instance Annotation Guidelines Using a Dedicated Editor, Jubilee | Jinho D. Choi, Claire Bonial and Martha Palmer | ||
617 |
A Japanese Particle Corpus Built by Example-Based Annotation | Hiroki Hanaoka, Hideki Mima and Jun'ichi Tsujii | ||
623 |
A Pilot Arabic CCGbank | Stephen A. Boxwell and Chris Brew | ||
697 |
Syntactic Dependencies for Multilingual and Multilevel Corpus Annotation | Simon Mille and Leo Wanner | ||
812 |
EAGLE: an Error-Annotated Corpus of Beginning Learner German | Adriane Boyd | ||
859 |
ADESSE, a Database with Syntactic and Semantic Annotation of a Corpus of Spanish | José M. García-Miguel, Gael Vaamonde and Fita González Domínguez | ||
917 |
Enriching a Treebank to Investigate Relative Clause Extraposition in German | Jan Strunk | ||
631 |
Porting an Ancient Greek and Latin Treebank | John Lee and Dag Haug | ||
13.05-13.10 | 5 minutes break | |||
13.10-13.30 | Invited Talk: Ray Fabri Maltese at the Crossroads of Technological Developments | Republic Hall | ||
13.30-14.55 | Lunch break | |||
O24 - Machine Translation | Temi Zamit | |||
14.55-15.15 | 244 |
Active Learning and Crowd-Sourcing for Machine Translation | Vamshi Ambati, Stephan Vogel and Jaime Carbonell | |
15.15-15.35 | 426 |
Using a Grammar Checker for Evaluation and Postprocessing of Statistical Machine Translation | Sara Stymne and Lars Ahrenberg | |
15.35-15.55 | 443 |
Using Comparable Corpora to Adapt a Translation Model to Domains | Hiroyuki Kaji, Takashi Tsunakawa and Daisuke Okada | |
15.55-16.15 | 670 |
Enriching Word Alignment with Linguistic Tags | Xuansong Li, Niyu Ge, Stephen Grimes, Stephanie M. Strassel and Kazuaki Maeda | |
16.15-16.35 | 683 |
English – Oromo Machine Translation: An Experiment Using a Statistical Approach | Sisay Adugna and Andreas Eisele | |
O25 - Emotion, Sentiment - Special Session | Vassali | |||
14.55-15.15 | 769 |
SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining | Stefano Baccianella, Andrea Esuli and Fabrizio Sebastiani | |
15.15-15.35 | 403 |
Building a System for Emotions Detection from Speech to Control an Affective Avatar | Mátyás Brendel, Riccardo Zaccarelli and Laurence Devillers | |
15.35-15.55 | 511 |
The Demo / Kemo Corpus: A Principled Approach to the Study of Cross-cultural Differences in the Vocal Expression and Perception of Emotion | Martijn Goudbeek and Mirjam Broersma | |
15.55-16.15 | 909 |
Sentiment Analysis in the News | Alexandra Balahur, Ralf Steinberger, Mijail Kabadjov, Vanni Zavarella, Erik van der Goot, Matina Halkia, Bruno Pouliquen and Jenya Belyaeva | |
16.15-16.35 | DISCUSSION | |||
O26 - Corpus Tools | Michel'angelo Grima | |||
14.55-15.15 | 233 |
New Tools for Web-Scale N-grams | Dekang Lin, Kenneth Church, Heng Ji, Satoshi Sekine, David Yarowsky, Shane Bergsma, Kailash Patil, Emily Pitler, Rachel Lathbury, Vikram Rao, Kapil Dalwani and Sushant Narsale | |
15.15-15.35 | 264 |
GernEdiT - The GermaNet Editing Tool | Verena Henrich and Erhard Hinrichs | |
15.35-15.55 | 65 |
FastKwic, an “Intelligent“ Concordancer Using FASTR | Véronika Lux-Pogodalla, Dominique Besagni and Karën Fort | |
15.55-16.15 | 216 |
A Resource and Tool for Super-sense Tagging of Italian Texts | Giuseppe Attardi, Stefano Dei Rossi, Giulia Di Pietro, Alessandro Lenci, Simonetta Montemagni and Maria Simi | |
16.15-16.35 | 411 |
Identification of Rare & Novel Senses Using Translations in a Parallel Corpus | Richard Schwarz, Hinrich Schütze, Fabienne Martin and Achim Stein | |
O27 - Lexicon, Morphology | David Bruce | |||
14.55-15.15 | 76 |
A Multilayered Declarative Approach to Cope with Morphotactics and Allomorphy in Derivational Morphology | Johannes Handl and Carsten Weber | |
15.15-15.35 | 872 |
Learning Morphology of Romance, Germanic and Slavic Languages with the Tool Linguistica | Helena Blancafort | |
15.35-15.55 | 497 |
A Tool for Linking Stems and Conceptual Fragments to Enhance word Access | Nuria Gala, Véronique Rey and Michael Zock | |
15.55-16.15 | 829 |
GRISP: A Massive Multilingual Terminological Database for Scientific and Technical Domains | Patrice Lopez and Laurent Romary | |
16.15-16.35 | 902 |
Bootstrapping Language Neutral Term Extraction | Wauter Bosma and Piek Vossen | |
14.55-16.35 | Poster Sessions | Sacra Infermeria | ||
P24 - Parsing | ||||
430 |
The Second Evaluation Campaign of PASSAGE on Parsing of French | Patrick Paroubek, Olivier Hamon, Eric de La Clergerie, Cyril Grouin and Anne Vilnat | ||
445 |
The Creation of a Large-Scale LFG-Based Gold Parsebank | Alexis Baird and Christopher R. Walker | ||
739 |
Partial Parsing as a Method to Expedite Dependency Annotation of a Hindi Treebank | Mridul Gupta, Vineet Yadav, Samar Husain and Dipti Misra Sharma | ||
775 |
Exploring the Spinal-STIG Model for Parsing French | Djamé Seddah | ||
834 |
Improving Chunking Accuracy on Croatian Texts by Morphosyntactic Tagging | Kristina Vučković, Željko Agić and Marko Tadić | ||
844 |
Hybrid Constituent and Dependency Parsing with Tsinghua Chinese Treebank | Rui Wang and Yi Zhang | ||
498 |
Disambiguating Compound Nouns for a Dynamic HPSG Treebank of Wall Street Journal Texts | Valia Kordoni and Yi Zhang | ||
136 |
Top-Performing Robust Constituency Parsing of Portuguese: Freely Available in as Many Ways as you Can Get it | João Silva, António Branco and Patricia Gonçalves | ||
178 |
Improvements in Parsing the Index Thomisticus Treebank. Revision, Combination and a Feature Model for Medieval Latin | Marco Passarotti and Felice Dell'Orletta | ||
350 |
FipsRomanian: Towards a Romanian Version of the Fips Syntactic Parser | Violeta Seretan, Eric Wehrli, Luka Nerima and Gabriela Soare | ||
722 |
Training Parsers on Partial Trees: A Cross-language Comparison | Kathrin Spreyer, Lilja Øvrelid and Jonas Kuhn | ||
819 |
Arabic Parsing Using Grammar Transforms | Lamia Tounsi and Josef van Genabith | ||
449 |
LAF/GrAF-grounded Representation of Dependency Structures | Yoshihiko Hayashi, Thierry Declerck and Chiharu Narawa | ||
P25 - Discourse Annotation | ||||
654 |
Integration of Linguistic Markup into Semantic Models of Folk Narratives: The Fairy Tale Use Case | Piroska Lendvai, Thierry Declerck, Sándor Darányi, Pablo Gervás, Raquel Hervás, Scott Malec and Federico Peinado | ||
762 |
Typical Cases of Annotators’ Disagreement in Discourse Annotations in Prague Dependency Treebank | Šárka Zikánová, Lucie Mladová, Jiří Mírovský and Pavlína Jínová | ||
85 |
MPC: A Multi-Party Chat Corpus for Modeling Social Phenomena in Discourse | Samira Shaikh, Tomek Strzalkowski, Aaron Broadwell, Jennifer Stromer-Galley, Sarah Taylor and Nick Webb | ||
330 |
Context Fusion: The Role of Discourse Structure and Centering Theory | Raffaella Bernardi, Manuel Kirschner and Zorana Ratkovic | ||
336 |
PDTB XML: the XMLization of the Penn Discourse TreeBank 2.0 | Xuchen Yao, Irina Borisova and Mehwish Alam | ||
341 |
NLP Resources for the Analysis of Patient/Therapist Interviews | Horacio Saggion, Elena Stein-Sparvieri, David Maldavsky and Sandra Szasz | ||
399 |
Studying the Lexicon of Dialogue Acts | Nicole Novielli and Carlo Strapparava | ||
439 |
Using NLP Methods for the Analysis of Rituals | Nils Reiter, Oliver Hellwig, Anand Mishra, Anette Frank and Jens Burkhardt | ||
479 |
The Leeds Arabic Discourse Treebank: Annotating Discourse Connectives for Arabic | Amal Al-Saif and Katja Markert | ||
644 |
Corpora for the Conceptualisation and Zoning of Scientific Papers | Maria Liakata, Simone Teufel, Advaith Siddharthan and Colin Batchelor | ||
736 |
Constructing an Annotated Story Corpus: Some Observations and Issues | Oi Yee Kwong | ||
835 |
Building a Bank of Semantically Encoded Narratives | David K. Elson and Kathleen R. McKeown | ||
935 |
Exploiting Scope for Shallow Discourse Parsing | Rashmi Prasad, Aravind Joshi and Bonnie Webber | ||
P26 - Dialogue Annotation | ||||
184 |
Annotation of Discourse Relations for Conversational Spoken Dialogs | Sara Tonelli, Giuseppe Riccardi, Rashmi Prasad and Aravind Joshi | ||
18 |
FOLKER: An Annotation Tool for Efficient Transcription of Natural, Multi-party Interaction | Thomas Schmidt and Wilfried Schütte | ||
337 |
Domain-related Annotation of Polish Spoken Dialogue Corpus LUNA.PL | Agnieszka Mykowiecka, Katarzyna Głowińska and Joanna Rabiega-Wiśniewska | ||
391 |
Two-level Annotation of Utterance-units in Japanese Dialogs: An Empirically Emerged Scheme | Yasuharu Den, Hanae Koiso, Takehiko Maruyama, Kikuo Maekawa, Katsuya Takanashi, Mika Enomoto and Nao Yoshida | ||
554 |
Partial Parsing of Spontaneous Spoken French | Olivier Blanc, Matthieu Constant, Anne Dister and Patrick Watrin | ||
558 |
From Speech to Trees: Applying Treebank Annotation to Arabic Broadcast News | Mohamed Maamouri, Ann Bies, Seth Kulick, Wajdi Zaghouani, Dave Graff and Mike Ciul | ||
676 |
Dialogue Acts Annotation for NICT Kyoto Tour Dialogue Corpus to Construct Statistical Dialogue Systems | Kiyonori Ohtake, Teruhisa Misu, Chiori Hori, Hideki Kashioka and Satoshi Nakamura | ||
678 |
Eslo: From Transcription to Speakers' Personal Information Annotation | Iris Eshkol, Denis Maurel and Nathalie Friburger | ||
818 |
Using Dialogue Corpora to Extend Information Extraction Patterns for Natural Language Understanding of Dialogue | Roberta Catizone, Alexiei Dingli and Robert Gaizauskas | ||
852 |
Pr.A.Ti.D: A Coding Scheme for Pragmatic Annotation of Dialogues. | Renata Savy | ||
P27 - Evaluation of Speech Recognition and Speech Synthesis | ||||
281 |
Improving Proper Name Recognition by Adding Automatically Learned Pronunciation Variants to the Lexicon | Bert Réveil, Jean-Pierre Martens and Henk van den Heuvel | ||
456 |
TTS Evaluation Campaign with a Common Spanish Database | Iñaki Sainz, Eva Navas, Inma Hernáez, Antonio Bonafonte and Francisco Campillo | ||
894 |
DICIT: Evaluation of a Distant-talking Speech Interface for Television | Timo Sowa, Fiorenza Arisio and Luca Cristoforetti | ||
16.35-16.55 | Coffee break | |||
O28 - Syntax and Semantics | Temi Zamit | |||
16.55-17.15 | 162 |
Interacting Semantic Layers of Annotation in SoNaR, a Reference Corpus of Contemporary Written Dutch | Ineke Schuurman, Véronique Hoste and Paola Monachesi | |
17.15-17.35 | 603 |
PASSAGE Syntactic Representation: a Minimal Common Ground for Evaluation | Anne Vilnat, Patrick Paroubek, Eric Villemonte de la Clergerie, Gil Francopoulo and Marie-Laure Guénot | |
17.35-17.55 | 828 |
Towards Semi-Automated Annotation for Prepositional Phrase Attachment | Sara Rosenthal, William Lipovsky, Kathleen McKeown, Kapil Thadani and Jacob Andreas | |
17.55-18.15 | 493 |
Mapping between Dependency Structures and Compositional Semantic Representations | Max Jakob, Markéta Lopatková and Valia Kordoni | |
O29 - Metadata | Vassali | |||
16.55-17.15 | 306 |
Meta-Knowledge Annotation of Bio-Events | Raheel Nawaz, Paul Thompson, John McNaught and Sophia Ananiadou | |
17.15-17.35 | 951 |
A Road Map for Interoperable Language Resource Metadata | Christopher Cieri, Khalid Choukri, Nicoletta Calzolari, D. Terence Langendoen, Johannes Leveling, Martha Palmer, Nancy Ide and James Pustejovsky | |
17.35-17.55 | 434 |
Speaker Attribution in Cabinet Protocols | Josef Ruppenhofer, Caroline Sporleder and Fabian Shirokov | |
17.55-18.15 | 652 |
Annotation Time Stamps — Temporal Metadata from the Linguistic Annotation Process | Katrin Tomanek and Udo Hahn | |
O30 - Tagging | Michel'angelo Grima | |||
16.55-17.15 | 227 |
Evaluating Distributional Properties of Tagsets | Markus Dickinson and Charles Jochim | |
17.15-17.35 | 276 |
Morphological Annotation of Quranic Arabic | Kais Dukes and Nizar Habash | |
17.35-17.55 | 384 |
Arabic Part of Speech Tagging | Emad Mohamed and Sandra Kübler | |
17.55-18.15 | 138 |
MULTEXT-East Version 4: Multilingual Morphosyntactic Specifications, Lexicons and Corpora | Tomaž Erjavec | |
O31 - Multimodal Annotation | David Bruce | |||
16.55-17.15 | 560 |
Towards an ISO Standard for Dialogue Act Annotation | Harry Bunt, Jan Alexandersson, Jean Carletta, Jae-Woong Choe, Alex Chengyu Fang, Koiti Hasida, Kiyong Lee, Volha Petukhova, Andrei Popescu-Belis, Laurent Romary, Claudia Soria and David Traum | |
17.15-17.35 | 195 |
Towards an integrated scheme for semantic annotation of multimodal dialogue data | Volha Petukhova and Harry Bunt | |
17.35-17.55 | 772 |
News Image Annotation on a Large Parallel Text-image Corpus | Pierre Tirilly, Vincent Claveau and Patrick Gros | |
17.55-18.15 | 596 |
Types of Nods. The Polysemy of a Social Signal | Isabella Poggi, Francesca D'Errico and Laura Vincze | |
16.55-18.15 | Poster Sessions | Sacra Infermeria | ||
P28 - Terminological Lexicons, Ontologies, Corpora | ||||
57 |
GIS Application Improvement with Multilingual Lexical and Terminological Resources | Ranka Stanković, Ivan Obradović and Olivera Kitanović | ||
147 |
Lexical Semantic Resources in a Terminological Network | Rita Marinelli, Adriana Roventini, Giovanni Spadoni and Sebastiana Cucurullo | ||
378 |
Constructing a Broad-coverage Lexicon for Text Mining in the Patent Domain | Nelleke Oostdijk, Suzan Verberne and Cornelis Koster | ||
695 |
Q-WordNet: Extracting Polarity from WordNet Senses | Rodrigo Agerri and Ana García-Serrano | ||
699 |
A Context Sensitive Variant Dictionary for Supporting Variant Selection | Aya Nishikawa, Ryo Nishimura, Yasuhiko Watanabe and Yoshihiro Okada | ||
703 |
Integrating a Large Domain Ontology of Species into WordNet | Montse Cuadros, Egoitz Laparra, German Rigau, Piek Vossen and Wauter Bosma | ||
855 |
Corpus Based Analysis for Multilingual Terminology Entry Compounding | Andrejs Vasiljevs and Kaspars Balodis | ||
895 |
EcoLexicon: An Environmental TKB | Arianne Reimerink, Pilar León Araúz and Pedro J. Magaña Redondo | ||
60 |
A Swedish Scientific Medical Corpus for Terminology Management and Linguistic Exploration | Dimitrios Kokkinakis and Ulla Gerdin | ||
P29 - Question Answering and Evaluation | ||||
188 |
A Comprehensive Resource to Evaluate Complex Open Domain Question Answering | Silvia Quarteroni and Alessandro Moschitti | ||
724 |
Corpora for Automatically Learning to Map Natural Language Questions into SQL Queries | Alessandra Giordani and Alessandro Moschitti | ||
241 |
Paragraph Acquisition and Selection for List Question Using Amazon’s Mechanical Turk | Fang Xu and Dietrich Klakow | ||
272 |
GikiCLEF: Crosscultural Issues in Multilingual Information Access | Diana Santos, Luís Miguel Cabral, Corina Forascu, Pamela Forner, Fredric Gey, Katrin Lamm, Thomas Mandl, Petya Osenova, Anselmo Peñas, Álvaro Rodrigo, Julia Schulz, Yvonne Skalban and Erik Tjong Kim Sang | ||
520 |
Fine-grained Linguistic Evaluation of Question Answering Systems | Sarra El Ayari, Brigitte Grau and Anne-Laure Ligozat | ||
529 |
A Corpus for Studying Full Answer Justification | Arnaud Grappy, Brigitte Grau, Olivier Ferret, Cyril Grouin, Véronique Moriceau, Isabelle Robba, Xavier Tannier, Anne Vilnat and Vincent Barbier | ||
186 |
Question Answering on Web Data: The QA Evaluation in Quæro | Ludovic Quintard, Olivier Galibert, Gilles Adda, Brigitte Grau, Dominique Laurent, Véronique Moriceau, Sophie Rosset, Xavier Tannier and Anne Vilnat | ||
68 |
FIDJI: Web Question-Answering at Quaero 2009 | Xavier Tannier and Véronique Moriceau | ||
104 |
A Derivational Rephrasing Experiment for Question Answering | Bernard Jacquemin | ||
P30 - Natural Language Generation | ||||
593 |
SINotas: the Evaluation of a NLG Application | Roberto P. A. Araujo, Rafael L. de Oliveira, Eder M. de Novais, Thiago D. Tadeu, Daniel B. Pereira and Ivandré Paraboni | ||
715 |
Extracting Surface Realisation Templates from Corpora | Thiago D. Tadeu, Eder M. de Novais and Ivandré Paraboni | ||
930 |
Multilingual Question Generation | Andrew Hickl, Arnold Jung and Ying Shi | ||
185 |
A Fact-aligned Corpus of Numerical Expressions | Sandra Williams and Richard Power | ||
532 |
The GIVE-2 Corpus of Giving Instructions in Virtual Environments | Andrew Gargett, Konstantina Garoufi, Alexander Koller and Kristina Striegnitz | ||
P31 - Dialogue Corpora | ||||
248 |
CASIA-CASSIL: a Chinese Telephone Conversation Corpus in Real Scenarios with Multi-leveled Annotation | Keyan Zhou, Aijun Li, Zhigang Yin and Chengqing Zong | ||
260 |
Construction of Back-Channel Utterance Corpus for Responsive Spoken Dialogue System Development | Yuki Kamiya, Tomohiro Ohno, Shigeki Matsubara and Hideki Kashioka | ||
275 |
FAU IISAH Corpus -- A German Speech Database Consisting of Human-Machine and Human-Human Interaction Acquired by Close-Talking and Far-Distance Microphones | Werner Spiegl, Korbinian Riedhammer, Stefan Steidl and Elmar Nöth | ||
383 |
Deep Linguistic Processing with GETARUNS for Spoken Dialogue Understanding | Rodolfo Delmonte, Antonella Bristot and Vincenzo Pallotta | ||
768 |
The Kachna L1/L2 Picture Replication Corpus | Helena Spilková, Daniel Brenner, Anton Öttl, Pavel Vondřička, Wim van Dommelen and Mirjam Ernestus | ||
789 |
Greybeard Longitudinal Speech Study | Linda Brandschain | ||
792 |
Mixer 6 | Linda Brandschain, David Graff, Chris Cieri, Kevin Walker, Chris Caruso and Abby Neely | ||
P32 - Dialogue Management and Systems | ||||
345 |
Efficient Spoken Dialogue Domain Representation and Interpretation | Tobias Heinroth, Dan Denich, Alexander Schmitt and Wolfgang Minker | ||
481 |
On the Role of Discourse Markers in Interactive Spoken Question Answering Systems | Ioana Vasilescu, Sophie Rosset and Martine Adda-Decker | ||
578 |
Dialogue Reference in a Visual Domain | Jette Viethen, Simon Zwarts, Robert Dale and Markus Guhe | ||
660 |
NPCEditor: A Tool for Building Question-Answering Characters | Anton Leuski and David Traum | ||
5 minutes break | ||||
O32 - Lexicon | Temi Zamit | |||
18.20-18.40 | 45 |
Handling of Missing Values in Lexical Acquisition | Núria Bel | |
18.40-19.00 | 552 |
Generating FrameNets of Various Granularities: The FrameNet Transformer | Josef Ruppenhofer, Jonas Sunde and Manfred Pinkal | |
19.00-19.20 | 701 |
The Lefff, a Freely Available and Large-coverage Morphological and Syntactic Lexicon for French | Benoît Sagot | |
19.20-19.40 | 773 |
Extensive Evaluation of a FrameNet-WordNet mapping resource | Diego De Cao, Danilo Croce and Roberto Basili | |
O33 - Question Answering | Vassali | |||
18.20-18.40 | 518 |
A Question-answer Distance Measure to Investigate QA System Progress | Guillaume Bernard, Sophie Rosset, Martine Adda-Decker and Olivier Galibert | |
18.40-19.00 | 611 |
Question Answering Biographic Information and Social Network Powered by the Semantic Web | Peter Adolphs, Xiwen Cheng, Tina Klüwer, Hans Uszkoreit and Feiyu Xu | |
19.00-19.20 | 372 |
Evaluation Protocol and Tools for Question-Answering on Speech Transcripts | Nicolas Moreau, Olivier Hamon, Djamel Mostefa, Sophie Rosset, Olivier Galibert, Lori Lamel, Jordi Turmo, Pere R. Comas, Paolo Rosso, Davide Buscaldi and Khalid Choukri | |
19.20-19.40 | 464 |
Evaluating Multilingual Question Answering Systems at CLEF | Pamela Forner, Danilo Giampiccolo, Bernardo Magnini, Anselmo Peñas, Álvaro Rodrigo and Richard Sutcliffe | |
O34 - Endangered Languages | Michel'angelo Grima | |||
18.20-18.40 | 254 |
Reusing Grammatical Resources for New Languages | Lene Antonsen, Trond Trosterud and Linda Wiechetek | |
18.40-19.00 | 921 |
The Problems of Language Identification within Hugely Multilingual Data Sets | Fei Xia, Carrie Lewis and William D. Lewis | |
19.00-19.20 | 559 |
The Role of Parallel Corpora in Bilingual Lexicography | Enikő Héja | |
19.20-19.40 | 333 |
Design and Development of Part-of-Speech-Tagging Resources for Wolof (Niger-Congo, spoken in Senegal) | Cheikh M. Bamba Dione, Jonas Kuhn and Sina Zarrieß | |
O35 - Disordered Speech Corpus | David Bruce | |||
18.20-18.40 | 119 |
The Alborada-I3A Corpus of Disordered Speech | Oscar Saz, Eduardo Lleida, Carlos Vaquero and W.-Ricardo Rodríguez | |
18.40-19.00 | 52 |
A Speech Corpus for Dyslexic Reading Training | Jakob Schou Pedersen and Lars Bo Larsen | |
19.00-19.20 | 327 |
The Cambridge Cookie-Theft Corpus: A Corpus of Directed and Spontaneous Speech of Brain-Damaged Patients and Healthy Individuals | Caroline Williams, Andrew Thwaites, Paula Buttery, Jeroen Geertzen, Billi Randall, Meredith Shafto, Barry Devereux and Lorraine Tyler | |
19.20-19.40 | 626 |
The DesPho-APaDy Project: Developing an Acoustic-phonetic Characterization of Dysarthric Speech in French | Cécile Fougeron, Lise Crevier-Buchman, Corinne Fredouille, Alain Ghio, Christine Meunier, Claude Chevrie-Muller, Jean-Francois Bonastre, Antonia Colazo-Simon, Céline Delooze, Danielle Duez, Cédric Gendrot, Thierry Legou, Nathalie Lévêque, Claire Pillot-Loiseau, Serge Pinto, Gilles Pouchoulin, Danièle Robert, Jacqueline Vaissière, François Viallet and Coralie Vincent | |
18.20-19.40 | Poster Sessions | Sacra Infermeria | ||
P33 - Information Extraction, Terminology, Corpora | ||||
609 |
Automatic Grammar Rule Extraction and Ranking for Definitions | Claudia Borg, Mike Rosner and Gordon J. Pace | ||
647 |
Analysis and Presentation of Results for Mobile Local Search | Alberto Tretti and Barbara Di Eugenio | ||
684 |
Modeling Wikipedia Articles to Enhance Encyclopedic Search | Atsushi Fujii | ||
842 |
Extraction, Merging, and Monitoring of Company Data from Heterogeneous Sources | Christian Federmann and Thierry Declerck | ||
90 |
Processing and Extracting Data from Dicionário Aberto | Alberto Simões, José João Almeida and Rita Farinha | ||
214 |
Improving Domain-specific Entity Recognition with Automatic Term Recognition and Feature Extraction | Ziqi Zhang, José Iria and Fabio Ciravegna | ||
505 |
Quality Indicators of LSP Texts – Selection and Measurements Measuring the Terminological Usefulness of Documents for an LSP Corpus | Jakob Halskov, Dorte Haltrup Hansen, Anna Braasch and Sussi Olsen | ||
900 |
NLGbAse: A Free Linguistic Resource for Natural Language Processing Systems | Eric Charton and Juan-Manuel Torres-Moreno | ||
145 |
Human Judgements on Causation in French Texts | Cécile Grivaz | ||
802 |
Annotating Event Chains for Carbon Sequestration Literature | Heng Ji, Xiang Li, Angelo Lucia and Jianting Zhang | ||
905 |
Inter-sentential Relations in Information Extraction Corpora | Kumutha Swampillai and Mark Stevenson | ||
441 |
Evaluating Complex Semantic Artifacts | Christopher R. Walker and Hannah Copperman | ||
639 |
LAT Bridge: Bridging Tools for Annotation and Exploration of Rich Linguistic Data | Marc Kemps-Snijders, Thomas Koller, Han Sloetjes and Huib Verwey | ||
P34 - Knowledge Discovery | ||||
826 |
Socially Driven Ontology Enrichment for eLearning | Paola Monachesi and Thomas Markus | ||
881 |
Cross-Corpus Textual Entailment for Sublanguage Analysis in Epidemic Intelligence | Avaré Stewart, Kerstin Denecke and Wolfgand Nejdl | ||
407 |
The GeneReg Corpus for Gene Expression Regulation Events — An Overview of the Corpus and its In-Domain and Out-of-Domain Interoperability | Ekaterina Buyko, Elena Beisswanger and Udo Hahn | ||
284 |
The Architecture of FunGramKB | Carlos Periñán-Pascual and Francisco Arcas-Túnez | ||
766 |
A Large Coverage Verb Taxonomy for Arabic | Jaouad Mousser | ||
158 |
Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information | Satoshi Sekine and Kapil Dalwani | ||
P35 - Text Corpora and Language Resources | ||||
328 |
The VeteranTapes: Research Corpus, Fragment Processing Tool, and Enhanced Publications for the e-Humanities | Henk van den Heuvel, René van Horik, Stef Scagliola, Eric Sanders and Paula Witkamp | ||
549 |
Balancing SoNaR: IPR versus Processing Issues in a 500-Million-Word Written Dutch Reference Corpus | Martin Reynaert, Nelleke Oostdijk, Orphée De Clercq, Henk van den Heuvel and Franciska de Jong | ||
949 |
For Standardised Amazigh Linguistic Resources | Youssef Aït Ouguengay and Aïcha Bouhjar | ||
342 |
Medefaidrin: Resources Documenting the Birth and Death Language Life-cycle | Dafydd Gibbon, Moses Ekpenyong and Eno-Abasi Urua | ||
477 |
The RODRIGO Database | Nicolas Serrano, Francisco Castro and Alfons Juan | ||
535 |
Annotation and Representation of a Diachronic Corpus of Spanish | Cristina Sánchez-Marco, Gemma Boleda, Josep Maria Fontana and Judith Domingo | ||
373 |
The Database of Catalan Adjectives | Roser Sanromà and Gemma Boleda | ||
408 |
Word-based Partial Annotation for Efficient Corpus Construction | Graham Neubig and Shinsuke Mori |