LREC 2006 Conference
-->
List of accepted papers

 ID

 Authors

 Title

 4

 Nina Grønnum

 DanPASS - A Danish Phonetically Annotated Spontaneous Speech Corpus

 6

 Andi Wu Kirk Lowery

 A Hebrew Tree Bank Based on Cantillation Marks

 8

 Stéphane CHAUDIRON and Joseph MARIANI

 Technolangue : The French National Initiative for Human Language Technologies (HLT)

 9

 Manny Rayner, Pierrette Bouillon, Beth Ann Hockey and Nikos Chatzichrisafis

 REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications

 10

 Anders Berglund, Richard Johansson, Pierre Nugues

 Extraction of Temporal Information from Texts in Swedish

 11

 Hlavacova Jaroslava

 New Approach to Frequency Dictionaries - Czech Example

 12

 Flora Ramírez Bustamante, Alfredo Arnaiz, Mar Ginés

 A Spell Checker for a World Language

 14

 Elliott Macklovitch

 TransType2: The last word

 19

 Ibon Saratxaga, Eva Navas, Inmaculada Hernáez, Iker Luengo

 Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque

 23

 Dimitrios Kokkinakis

 Collection, Encoding and Linguistic Processing of a Swedish Medical Corpus - The MEDLEX Experience

 29

 Noguchi Masaki, Ichikawa Hiroshi, Hashimoto Taiichi and Tokunaga Takenobu

 A new approach to syntactic annotation

 33

 Tien-Ping Tan, Laurent Besacier

 A French Non-Native Corpus for Automatic Speech Recognition

 35

 Andrea Sansò

 Documenting variation across Europe and the Mediterranean: The Pavia Typological Database

 36

 H. Movasagh, H. Sameti, N. Hafezi, M. Bahrani

 Building and Incorporating Language Models for Persian Continuous Speech Recognition Systems

 37

 Shisanu Tongchim, Prapass Srichaivattana, Virach Sornlertlamvanich, Hitoshi Isahara

 Blind Evaluation for Thai Search Engines

 39

 Yuji Matsumoto, Masayuki Asahara, Kiyota Hashimoto, Yukio Tono, Akira Ohtani, and Toshio Morita

 An Annotated Corpus Management Tool: ChaKi

 43

 Christophe Jouis

 Hierarchical relation " is-a ": distinguishing Belonging, inclusion and part/of relationships.

 44

 Ivan Berlocher, Hyun-gue Huh, Eric Laporte, Jee-sun Nam

 Morphological annotation of Korean with Directly Maintainable Resources

 45

 Patrick Saint-Dizier

 PrepNet: a multilingual lexical description of prepositions

 48

 Marie-Claude L’ Homme and Hee Sook Bae

 A Methodology for Developing Multilingual Resources for Terminology

 49

 Stephan Raidt, Gérard Beilly, Frédéric Elisei

 Does a virtual talking face generate proper multimodal cues to draw the user’s attention to points of interest?

 50

 May Lai-Yin Wong

 Skeleton parsing in written Chinese: annotation scheme and guidelines

 52

 Constantin Orasan and Laura Hasler

 Computer-aided summarisation — what the user really wants

 54

 Nianwen Xue

 Predicate-Argument structure annotation of nominalized verbs in Chinese

 55

 Kyo Kageura and Gen’ichiro Kikui

 A self-referring quantitative evaluation of the ATR basic travel expression corpus BTEC

 58

 Hiromi itoh Ozaku, Akinori Abe, Kaoru Sagara, Noriaki Kuwahara, Kiyoshi Kogure

 Features of terms in actual nursing activities

 59

 Nuno Seco, Diana Santos, Nuno Cardoso and Rui Vilela

 HAREM: An Advanced NER Evaluation Contest for Portuguese

 60

 Rafael Banchs, Antonio Bonafonte, Javier Pérez

 Acceptance Testing of a Spoken Language Translation System

 64

 Valentin Tablan, Wim Peters, Diana Maynard

 Creating Tools for Morphological Analysis of Sumerian

 66

 Ingunn Amdal and Torbjørn Svendsen

 FonDat1: A Speech Synthesis Corpus for Norwegian

 67

 Alon Itai, Shuly Wintner and Shlomo Yona

 A Computational Lexicon of Contemporary Hebrew

 68

 Ruska Ivanovska- Naskova

 Development of the first LRs for Macedonian: current projects

 74

 Reinhard Rapp, Carlos Martin Vide

 Example-based Machine Translation Using a Dictionary of Word Pairs

 75

 Widad MUSTAFA EL HADI, Ismail TIMIMI, Marianne DABBADIE, Khalid CHOUKRI, Yun-Chuang CHIAO, Olivier HAMON

 Terminological Resources Acquisition Tools: Towards a User-Oriented Evaluation Model

 76

 Núria Bel , Sergio Espeja , Montserrat Marimon

 New tools for the encoding of lexical data extracted from corpus

 77

 Daniela Braga, Luis Coelho, Joao Paulo Teixeira, Diamantino Freitas

 Progmatica: A Prosodic Database for European Portuguese

 79

 Enrique Amigó, Jesús Giménez

 IQMT: A Framework for Machine Translation Evaluation

 80

 Tommaso Caselli & Irina Prodanof

 Annotating bridging anaphors in Italian:in search of reliability.

 81

 Henk van den Heuvel, Khalid Choukri, Christian Gollan, Asuncion Moreno, Djamel Mostefa

 TC-STAR: New language resources for ASR and SST purposes

 86

 Yunqing Xia, Kam-Fai Wong and Wenjie Li

 Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach

 87

 Baden Hughes

 Searching for Language Resources on the Web: User Behaviour in the Open Language Archives Community

 88

 Yasunori OHISHI, Katunobu ITOU, Kazuya TAKEDA, and Atsushi FUJII

 Statistical Analysis for Thesaurus Construction using an Encyclopedic Corpus

 89

 Catherine Havasi, James Pustejovsky, and Marc Verhagen

 BULB: A Unified Lexical Browser

 91

 Ulrich Schäfer, Daniel Beck

 Automatic Testing and Evaluation of Multilingual Language Technology Resources and Components

 92

 Elena Grishina

 Spoken Russian in Russian National Corpus (RNC)

 93

 Paul Buitelaar, Philipp Cimiano, Stefania Racioppa, Melanie Siegel

 Ontology-based Information Extraction with SOBA

 94

 Dominique Dutoit

 Alexandria: a web 2.0 interface displaying linguistic data for the web semantic

 96

 Jan Frederik Maas, Britta Wrede

 BITT: A corpus for Topic Tracking evaluation on multimodal Human-Robot-Interaction

 97

 Hercules Dalianis, Bart Jongejan

 Hand-crafted versus machine-learned inflectional rules: The Euroling-SiteSeeker stemmer and CST’s lemmatiser

 99

 Christian Rohrer & Martin Forst

 Improving coverage and parsing quality of a large-scale LFG for German

 100

 Bettina Schrader

 Non-probabilistic alignment of rare German and English nominal expressions

 101

 Paula Chesley, Susanne Salmon-Alt

 Automatic extraction of subcategorization frames for French

 103

 Yuki Irie, Shigeki Matsubara, Nobuo Kawaguchi, Yukiko Yamaguchi, Yasuyoshi Inagaki

 Layered Speech-Act Annotation for Spoken Dialogue Corpus

 104

 Khurshid Ahmad, Craig Bennett and Tim Oliver

 Visual Surveillance Annotation and Description

 105

 Mathieu MANGEOT and Antoine CHALVIN

 Dictionary Building with the Papillon Platform: the GDEF case

 106

 Tomohiro OHNO, Shigeki MATSUBARA, Hideki KASHIOKA, Naoto KATO, Yasuyoshi INAGAKI

 A Syntactically Annotated Corpus of Japanese Spoken Monologue

 107

 Špela Vintar, Gaël Dias, Tomaž Erjavec, Darja Fišer, Jerneja Žganec Gros

 Bilingual lexicon enrichment for a speech-to-speech translation system

 111

 Jerneja Žganec Gros, Varja Cvetko-Orešnik, Primož Jakopin, Aleš Mihelic

 SI-PRON: A Reference Pronunciation Lexicon for Slovenian

 112

 Silvie Cinková

 From PropBank to EngValLex: Adapting PropBank-Lexicon to the Valency Theory of Functional Generative Description

 113

 Tonio WANDMACHER, Jean-Yves ANTOINE

 Training language models without appropriate language resources : experiments with an augmentative and alternative communication system for disabled people

 116

 Brian Roark, Mary Harper, Eugene Charniak, Bonnie Dorr, Mark Johnson, Jeremy Kahn, Yang Liu, Mari Ostendorf, John Hale, Anna Krasnyanskaya, Matt Lease, Izhak Shafran, Matt Snover, Robin Stewart, Lisa Yung

 SParseval: Evaluation Metrics for Parsing Speech

 118

 Ming-Shun Lin and Hsin-Hsi Chen

 Constructing a Named Entity Ontology from Web Corpora

 119

 Flora Ramírez Bustamante, Enrique López Díaz

 Spelling Error Patterns in Spanish for Word Processing Applications

 123

 Panikos Heracleous, Tomomi Kaino, Hiroshi Saruwatari, Kiyohiro Shikano

 Non-audible murmur (NAM) recognition in clean and noisy environments using non-linear and human perception-based features

 124

 Christer Johansson, Anders Nøklestad, Øysten Reigem

 Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of co-reference chains.

 125

 Hitomi Tohyama, Shigeki Matsubara

 Collection of Simultaneous Interpreting Patterns by Using Bilingual Spoken Monologue Corpus

 127

 Shuichi ITAHASHI, Chiu-yu TSENG, Satoshi NAKAMURA

 Oriental COCOSDA: Past, Present and Future

 128

 Angelika Storrer, Sandra Wellinghoff

 Automated detection and annotation of term definitions in German text corpora

 131

 M.Yaseen, M. Atteya, K.Choukri, B.Haddad, B.Maegaard, N.Paulsson, S.Krauwer, C.Mokbel, H Fersøe, A Ali, C Bendahman

 Building Annotated Written Arabic Corpus Resources in NEMLAR Project

 133

 Sašo Džeroski, Tomaž Erjavec, Nina Ledinek, Petr Pajas, Zdenek Žabokrtsky, Andreja Žele

 Towards a Slovene Dependency Treebank

 137

 Canasai Kruengkrai, Virach Sornlertlamvanich, Hitoshi Isahara

 A Conditional Random Field Framework for Thai Morphological Analysis

 138

 Yoshihide Kato, Shigeki Matsubara and Yasuyoshi Inagaki

 A Corpus Search System Utilizing Lexical Dependency Structure

 139

 Peter Berck, Albert Russel

 ANNEX - a web-based framework for exploiting annotated media resources

 140

 Tomaž Erjavec

 The English-Slovene ACQUIS corpus

 141

 Daan Broeder, Andreas Claus, Freddy Offenga, Romuald Skiba, Paul Trilsbeek, Peter Wittenburg

 LAMUS - the Language Archive Management and Upload System

 142

 Daan Broeder, Freddy Offenga, Peter Wittenburg, Peter van der Kamp, David Nathan, Sven Strömqvist

 Technologies for a Federation of Language Resource Archives

 146

 Marc Kemps-Snijders, Julien Ducret, Laurent Romary, Peter Wittenburg

 An API for Accessing the ISO Data Category Registry

 147

 Peter Wittenburg, Daan Broeder, Wolfgang Klein, Stephen Levinson, Laurent Romary

 Foundations of Modern Language Resource Archives

 148

 Freddy Offenga, Daan Broeder, Peter WIttenburg, Julien Ducret, Laurent Romary

 The Metadata Profile in the ISO Data Category Registry

 149

 Marc Kemps-Snijders, Peter Wittenburg

 LEXUS - a web-based tool for manipulating lexical resources

 150

 Tomaž Erjavec, Darja Fišer

 Building Slovene Wordnet

 151

 Luciana Bordoni, Tiziana Mazzoli

 Towards an ontology for art and colours

 152

 Richard Johansson and Pierre Nugues

 Construction of a FrameNet Labeler for Swedish Text

 153

 Han Sloetjes, Albert Russel, Alex Klassmann, Peter Wittenburg, Hennie Brugman

 ELAN V2.4: a professional framework for multimodality research

 154

 Peter Berck, Hans-Jörg Bibiko, Marc Kemps-Snijders, Albert Russel, Peter Wittenburg

 Ontology-based Language Archive Utilization

 162

 Joakim Nivre, Johan Hall and Jens Nilsson

 MaltParser: A Data-Driven Parser-Generator for Dependency Parsing

 163

 Andrej Zgank, Darinka Verdonik, Aleksandra Zogling Markus, Zdravko Kacic

 SINOD - Slovenian non-native speech database

 165

 Mark van Assem, Aldo Gangemi, and Guus Schreiber

 Conversion of WordNet to a standard RDF/OWL representation

 167

 Antal van den Bosch, Ineke Schuurman, Vincent Vandeghinste

 Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development

 168

 Mark Przybocki, Gregory Sanders, Audrey Le

 Edit-Distance - A Metric for Machine Translation Evaluation

 169

 Niels Ole Bernsen, Laila Dybkjær and Svend Kiilerich

 H. C. Andersen Conversation Corpus

 170

 Magnus Sahlgren

 Towards a pertinent evaluation methodology for word-space models

 171

 Andrei Popescu-Belis, Paula Estrella, Margaret King, Nancy Underwood

 A model for context-based evaluation of language processing systems and its application to machine translation evaluation

 172

 Martin Forst & Ronald M. Kaplan

 The importance of precise tokenizing for deep grammars

 174

 Annalisa Sandrelli & Claudio Bendazzoli

 Tagging a corpus of interpreted speeches: the European Parliament Interpreting Corpus (EPIC)

 178

 Hisami Suzuki, Gary Kacmarcik

 RefRef: a Tool for Viewing and Exploring Coreference Space

 179

 Bernt Andrassy and Harald Hoege

 Evaluation of in-car ASR using an Articulation Index based WER(SNR)-measure

 180

 R. Manurung, D. O’Mara, H. Pain, G. Ritchie, A. Waller

 Building a Lexical Database for an Interactive Joke-Generator

 182

 Bruno Cartoni

 Dealing with unknown words by simple decomposition: feasibility studies with Italian prefixes.

 184

 Serge Sharoff

 A uniform interface to large-scale linguistic resources

 186

 Carlo Strapparava, Alessandro Valitutti, Oliviero Stock

 The Affective Weight of Lexicon

 187

 Agnes Lisowska, Nancy Underwood

 ROTE: a tool to support users in defining the relative importance of quality characteristics

 190

 Serge Sharoff, Bogdan Babych, Anthony Hartley

 Using collocations from comparable corpora to find translation equivalents

 192

 Folkert de Vriend, Jos Swanenberg, Joep Kruijsen, Henk van den Heuvel, Lou Boves & Roeland van Hout

 A Unified Structure for Dutch Dialect Dictionary Data

 193

 E. Uí Dhonnchadha & J. Van Genabith

 A Part-of-speech tagger for Irish using Finite-State Morphology and Constraint Grammar Disambiguation

 194

 Véronique Moriceau

 Language Challenges for Data Fusion in Question-Answering

 195

 Luís Sarmento, Luís Cabral

 BACO - A large database of text and co-occurrences

 196

 Ulrich Schäfer

 OntoNERdIE — Mapping and Linking Ontologies to Named Entity Recognition and Information Extraction Resources

 197

 Jonathan Fiscus, Jerome Ajot, Nicolas Radde, and Christophe Laprun

 Multiple Dimension Levenshtein Edit Distance Calculations for Evaluating Automatic Speech Recognition Systems During Simultaneous Speech

 198

 J.Atserias, B.Casas, E.Comelles, M.Gonzàlez, L.Padró, M.Padró

 FreeLing 1.3: Syntactic and semantic services in an open-source NLP library

 199

 Jiri Semecky

 On Automatic Assignment of Verb Valency Frames in Czech

 200

 Ben Medlock

 An Introduction to NLP-based Textual Anonymisation

 201

 Hennie Brugman, Luit Gazendam, Véronique Malaisé

 A web based general thesaurus browser to support indexing of television and radio programs

 203

 Judit Feliu; Jorge Vivaldi; M. Teresa Cabré

 SKELETON: Specialised knowledge retrieval on the basis of terms and conceptual relations

 206

 Irene Cramer, Jochen L. Leidner, Dietrich Klakow

 Building An Evaluation Corpus for German Question Answering by Harvesting Wikipedia

 209

 Roald Eiselen

 Suggestions on the evaluation of suggestion adequacy for spelling checkers

 210

 Gerhard Fliedner

 Towards Natural Interactive Question Answering

 214

 B. Waldron, A. Copestake, U. Schaefer, B. Kiefer

 Preprocessing and Tokenisation Standards in DELPH-IN Tools

 216

 Juri Apresjan, Igor Boguslavsky, Boris Iomdin, Leonid Iomdin, Andrei Sannikov, Victor Sizov

 A Syntactically and Semantically Tagged Corpus of Russian: State of the Art and Prospects

 218

 Martin Hassel, Jonas Sjöbergh

 Towards Holistic Summarization - Selecting Summaries, Not Sentences

 219

 Stefan Schaden, Ute Jekosch

 Casselberveetovallarga and other unpronounceable places: The CrossTowns Corpus released

 220

 Dimitrios Kokkinakis, Dana Dannélls

 Recognizing Acronyms and their Definitions in Swedish Medical Texts

 222

 Lun-Wei Ku, Yu-Ting Liang and Hsin-Hsi Chen

 Tagging Heterogeneous Evaluation Corpora for Opinionated Tasks

 223

 Joakim Nivre, Jens Nilsson and Johan Hall

 Talbanken05: A Swedish Treebank with Phrase Structure and Dependency Annotation

 224

 Oana Postolache and Dan Cristea and Constantin Orasan

 Transferring coreference chains through word alignment.

 225

 G.Tummarello, C.Morbidoni, F. Kepler, F. Piazza, P.Puliti

 Semantic Web based Textual Markup: a preliminary evaluation

 227

 Anna Bogacka

 A task-specific corpus resource for Polish adult learners of English

 228

 Olena Medelyan, Stefan Schulz, Jan Paetzold, Michael Poprat, Kornél Marcó

 Language Specific and Topic Focused Web Crawling

 229

 Martin Reynaert

 Corpus-Induced Corpus Clean-up

 233

 Andrei Popescu-Belis and Maria Georgescul

 TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface

 234

 Feng Pan, Rutu Mulkar, and Jerry R. Hobbs

 An Annotated Corpus of Typical Durations of Events

 235

 Agam Patel and Dragomir Radev

 Lexical similarity can distinguish between automatic and manual translations

 236

 Nick Campbell, Toshiyuki Sadanobu, Damien Douxchamps, Masataka Imura, Suzuki Noriko and Mayumi Bono

 A multimedia database of meetings and informal interactions for tracking participant involvement and discourse flow

 237

 Fu Lee Wang, Xiaotie Deng, Feng Zou

 Towards Unified Chinese Segmentation Algorithm

 238

 Doaa Samy, Antonio Moreno Sandoval, José M. Guirao, Enrique Alfonseca

 Building and Assessment of a Parallel Multilingual Corpus (Arabic-Spanish-English)

 240

 Doaa Samy and Antonio Moreno Sandoval

 XML-Based Arabic Part of Speech Tagger

 241

 Donna K. Byron and Eric Fosler-Lussier

 The OSU Quake 2004 corpus of two-party situated problem-solving dialogs

 242

 Md. Aminul Islam and Diana Inkpen

 Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words

 243

 Yoji Kiyota, Hiroshi Nakagawa

 A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames

 244

 Ronnie Taib, Natalie Ruiz

 Tangible Objects for the Acquisition of Multimodal Interaction Patterns

 245

 Václav Novák and Jan Hajič

 Perspectives of Turning Prague Dependency Treebank into a Knowledge Base

 246

 Jonas Granfeldt, Pierre Nugues, Emil Persson, Jonas Thulin, Malin Ågren, Suzanne Schlyter

 CEFLE and Direkt Profil: a New Computer Learner Corpus in French L2 and a System for Grammatical Profiling

 247

 Hui Tan, Claire Cardie

 Annotating a multi-entity cross-document noun phrase coreference corpus: issues and preliminary results

 248

 Qian Yang, Jean-Pierre Martens, Nanneke Konings, Henk van den Heuvel

 Development of a phoneme-to-phoneme (p2p) converter to improve the grapheme-to-phoneme (g2p) conversion of names

 249

 Jaeyoung Jung, Maki Miyake, Hiroyuki Akama

 Recurrent Markov Cluster(RMCL)Algorithm for the Refinement of the Semantic Network

 254

 Catia Cucchiarini, Hugo Van hamme, Felix Smits

 JASMIN-CGN: Extension of the Spoken Dutch Corpus with speech of elderly people, children and non-natives in the human-machine interaction modality.

 255

 Cédrick Fairon, Sébastien Paumier

 A framework for real time dictionary updating

 256

 Vicente Alabau and Carlos D. Martínez

 Bilingual speech corpus in two phonetically similar languages

 258

 Vincent Vandeghinste, Ineke Schuurman, Michael Carl, Stella Markantonatou, Toni Badia

 METIS-II: Machine Translation for Low Resource Languages

 259

 Elisabeth D’Halleweyn, Jan Odijk, Lisanne Teunissen, Catia Cucchiarini

 The Dutch-Flemish HLT Programme STEVIN: Essential Speech and Language Technology Resources

 261

 Nuria GALA and André Valli

 A computational lexicon of French verbal subcategorization frames

 262

 Emiko Suzuki and Kyoko Kakihana

 On the Web Trilingual Sign Language Dictionary to Learn the foreign Sign Language without Learning a Target Spoken Language

 266

 Daniela Goecke, Andreas Witt

 Exploiting logical document structure and text type structure for anaphora resolution

 269

 Catalina Hallett, Donia Scott and Richard Power

 Textual rendering of complex post-coordinated clinical concepts from a medical ontology

 270

 Cédrick Fairon, Sébastien Paumier

 A translated corpus of 30,000 French SMS

 273

 Feng Zou, Fu Lee Wang, Xiaotie Deng, Song Han

 Evaluation of Stop Word List in Chinese Language

 275

 Anna Sinopalnikova and Pavel Smrz

 Intelligent Dictionary Interfaces: Usability Evaluation of Access-Supporting Enhancements

 277

 Florian Schiel, Moritz Kaiser, Hannes Mögele

 German SmartWeb UMTS Speech Data Collection

 278

 Žabokrtský, Z., Lopatková, M., Skwarska, K.

 Valency Lexicon of Czech Verbs: Alternation-Based Model

 279

 Darinka Verdonik, Matej Rojc

 Are you ready for a call? - Spontaneous conversations in tourism for speech-to-speech translation systems

 280

 Florian Schiel, Hannes Mögele, Moritz Kaiser

 Bikers Accessing the Web: The SmartWeb Motorbike Corpus

 281

 Asanee Kawtrakul

 Ontology Driven Knowledge Portal Construction

 282

 Dan Cristea, Corina Forascu, Ionut Cristian Pistol

 Temporality in relation with discourse structure

 283

 Eva Hajičová, Petr Sgall

 Corpus Annotation As a Test of a Linguistic Theory

 285

 Ondrej Bojar and Magdalena Prokopova

 Czech-English Word Alignment

 286

 Sergio Scalise, Antonietta Bisetto, Emiliano Guevara, Chiara Melloni

 MORBO/COMP: a multilingual database of compound words

 288

 Daniel Sonntag and Massimo Romanelli

 A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications

 292

 Nicole Grégoire

 A Standard Lexical Representation of Multiword Expressions

 293

 Anders Green, Helge Hüttenrauch, Elin Anna Topp, Kerstin Severinson Eklundh

 Developing a Contextualized Multimodal Corpus for Human-Robot Interaction

 294

 Wei-Yun Ma, Chu-Ren Huang

 Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus

 295

 Ana-Maria Barbu, Emil Ionescu, Verginica Barbu Mititelu

 Romanian Valence Dictionary in XML Format

 296

 Niels Ole Bernsen, Thomas K. Hansen, Svend Kiilerich and Torben Kruchov Madsen

 Field Evaluation of a Single-Word Pronunciation Training System

 297

 Wei Li, Wenjie Li, Qin Lu

 Mining the Implicit Entities in the Queries

 298

 Kiyotaka Uchimoto, Ryoji Hamabe, Takehiko Maruyama, Katsuya Takanashi, Tatsuya Kawahara, and Hitoshi Isahara

 Dependency-structure Annotation to Corpus of Spontaneous Japanese

 299

 N. Areta, A. Gurrutxaga, I. Leturia, Z. Polin, R. Saiz (Elhuyar) I. Alegria, X. Artola, A. Diaz de Ilarraza, N. Ezeiza, A. Sologaistoa, A. Soroa, A. Valverde (IXA)

 STRUCTURE, ANNOTATION AND TOOLS IN THE BASQUE ZT CORPUS

 301

 Joseph Polifroni and Marilyn Walker

 Learning Database Content for Spoken Dialogue System Design

 304

 Jorge Civera and Alfons Juan

 Bilingual Machine-Aided Indexing

 305

 Panunzi Alessandro, Fabbri Marco, Moneglia Massimo

 INTEGRATING METHODS AND LRs FOR AUTOMATIC KEYWORD EXTRACTION FROM OPEN DOMAIN TEXTS

 306

 Luís Fernando Costa

 Component evaluation in a question answering system

 307

 Stefan Breuer, Sven Bergmann, Ralf Dragon, Sebastian Möller

 Set-up of a Unit-Selection Synthesis with a Prominent Voice

 308

 Nasredine Semmar, Meriama Laib and Christian Fluhr

 Using a Deep Linguistic Analysis in Cross-language Information Retrieval

 309

 Susana Inácio and Diana Santos

 Annotating COMPARA, a grammar-aware parallel corpus

 310

 Lina Henriksen, Claus Povlsen, Andrejs Vasiljevs

 EuroTermBank - a terminology resource based on best practice

 311

 Florbela Barreto, António Branco, Eduardo Ferreira, Amália Mendes, Maria Fernanda Bacelar do Nascimento, Filipe Nunes, João Ricardo Silva

 A Large High Quality Linguistically Interpreted Open Corpus of Portuguese: the TagShare Corpus

 312

 Luís Sarmento, Belinda Maia, Diana Santos, Luís Cabral, Ana Sofia Pinto

 Corpógrafo V3 From simple word concordance to semi-automatic knowledge engineering

 313

 Liviu P. Dinu

 On the data base of Romanian syllables

 314

 Alessandro Bahgat, Fabio Massimo Zanzotto

 A Dependency-based Algorithm for Grammar Conversion

 315

 Khurshid Ahmad, Maria Teresa Musacchio and Giuseppe Palumbo

 Ontological Terminological Commitments and the Discourse of Specialist Communities

 318

 Alessandro Oltramari

 FrameNet as a conceptual window to WordNet

 319

 Chloe Clavel, Ioana Vasilescu, Laurence Devillers, Thibaut Ehrette, Gaël Richard

 The SAFE Corpus: fear-type emotions detection for surveillance applications

 320

 Arne Mauser, Evgeny Matusov, Hermann Ney

 Training a Statistical Machine Translation System without GIZA++

 323

 Evie Coussé and Steven Gillis

 Regional Bias in the Broad Phonetic Transcription of the Spoken Dutch Corpus

 324

 Atsushi Fujii, Makoto Iwayama, Noriko Kando

 Test Collections for Patent Retrieval and Patent Classification in the Fifth NTCIR Workshop

 326

 Željko Agić, Marko Tadić

 Evaluating Morphosyntactic Tagging of Croatian Texts

 328

 Martin Rajman, Marita Ailomaa, Agnes Lisowska, Mirek Melichar, Susan Armstrong

 Extending the Wizard of Oz Technique for Multimodal Language-enabled Systems

 329

 Gertjan van Noord, Ineke Schuurman, Vincent Vandeghinste

 Syntactic Annotation of Large Corpora in STEVIN

 332

 Davide Buscaldi, Paolo Rosso

 Mining Knowledge from Wikipedia for the Question Answering Task

 333

 Sabine Schulte im Walde

 Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet

 336

 Peter W. Wagacha, Guy De Pauw and Pauline W. Githinji

 Automatic Diacritic Placemenent using Memory-Based Learning: a Gikuyu Language Case Study

 338

 Juan José Rodríguez Soler, Pedro Concejero Cerezo, Daniel Tapias Merino, Alberto José Sánchez

 MEDUSA: User-Centred Design and usability evaluation of Automatic Speech Recognition telephone services in Telefónica Móviles España

 339

 Aljoscha Burchardt, Katrin Erk, Anette Frank, Andrea Kowalski, Sebastian Pado, Manfred Pinkal

 The SALSA corpus: a German corpus resource for lexical semantics

 340

 Ralf Steinberger, Bruno Pouliquen, Anna Widiger, Camelia Ignat, Tomaz Erjavec, Dan Tufis, Dániel Varga

 The JRC-Acquis: A multilingual aligned parallel corpus with 20+ languages

 341

 Aljoscha Burchardt, Katrin Erk, Anette Frank, Andrea Kowalski, Sebastian Pado

 SALTO — A Versatile Multi-Level Annotation Tool

 342

 Veronique Hoste and Guy De Pauw

 KNACK-2002: a Richly Annotated Corpus of Dutch Written Text

 343

 Philipp Cimiano, Matthias Hartung and Esther Ratsch

 Finding the appropriate generalization level for binary ontological relations extracted from the Genia corpus

 344

 Tomoyuki Kato, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano

 Transcription Cost Reduction for Constructing Acoustic Models Using Acoustic Likelihood Selection Criteria

 345

 Margot Mieskes, Michael Strube

 Part-of-Speech Tagging of Transcribed Speech

 346

 Branimir Boguraev and Rie Kubota Ando

 Analysis of TimeBank as a resource for TimeML Parsing

 347

 Daisuke Kawahara and Sadao Kurohashi

 Case Frame Compilation from the Web using High-Performance Computing

 348

 Patrick Paroubek, Anne Vilnat, Isabelle Robba, Christelle Ayache, Louis-Gabriel Pouillot

 EASY: Evaluation Campaign of Syntactic Parsers of French

 349

 Toshiyuki Kanamaru, Masaki Murata, Hitoshi Isahara

 Creating a Japanese Adverb Dictionary That Includes Information About Speaker’s Communicative Intention Using Machine Learning

 350

 Gregory Grefenstette, Fathi Debili, Christian Fluhr, Svitlana Zinger

 Exploiting text for extracting image processing resources

 351

 Naoaki Okazaki and Sophia Ananiadou

 Clustering acronyms in biomedical text for disambiguation

 352

 Goran Nenadic, Naoki Okazaki, Sophia Ananiadou

 Towards a terminological resource for biomedical text mining

 356

 Hans Hjelm

 Extraction of Cross Language Term Correspondences

 357

 David Guthrie, Ben Allison, Wei Liu, Louise Guthrie, Yorick Wilks

 A Closer Look at Skip-gram Modelling

 359

 Dobrov B., Loukachevitch N.

 Development of Linguistic Ontology on Natural Sciences and Technology

 360

 Matthew W. Bilotti and Eric Nyberg

 Evaluation for Scenario Question Answering Systems

 362

 Loukachevitch N., Dobrov B.

 Information Significance as a Factor of Inclusion of Multiword Expressions to Computational Resources

 363

 Dirk Bühler, Wolfgang Minker

 STOCHASTIC SPOKEN NATURAL LANGUAGE PARSING IN THE FRAMEWORK OF THE FRENCH MEDIA EVALUATION CAMPAIGN

 364

 Stephan Oepen and Jan Tore Lønning

 Discriminant-Based MRS Banking

 365

 György Szarvas, Richárd Farkas, László Felföldi, András Kocsor

 Highly accurate Named Entity corpus for Hungarian

 366

 Thierry Declerck and Mihaela Vela

 A Generic NLP Tool for Supporting Shallow Ontology Building

 367

 Julia Ritz, Ulrich Heid

 Extraction tools for collocations and their morphosyntactic specificities

 368

 Luke Nezda, Andrew Hickl, John Lehmann, and Sarmad Fayyaz

 What in the world is a Shahab? Wide Coverage Named Entity Recognition for Arabic

 372

 Massimo Poesio, Philippe Goux, and Mijail A. Kabadjov

 AN ANAPHORA RESOLUTION-BASED ANONYMIZATION MODULE

 375

 Manfred Sailer and Beata Trawinski

 The Collection of Distributionally Idiosyncratic Items: A Multilingual Resource for Linguistic Research

 376

 Danie J. Prinsloo, Elsabé Taljard, Ulrich Heid

 Grammar-based tools for the creation of tagging resources for an unresourced language: the case of Northern Sotho

 378

 Maria Clara Paixão de Sousa and Thorsten Trippel

 Building a historical corpus for Classical Portuguese: some technological aspects

 379

 Maria Teresa Pazienza, Marco Pennacchiotti, Fabio Massimo Zanzotto

 Mixing WordNet, Verbnet and PropBank for studying verb relations

 381

 Leo Wanner and Margarita Alonso Ramos

 Local Document Relevance Clustering in IR Using Collocation Information

 384

 Andrea Esuli and Fabrizio Sebastiani

 SentiWordNet: A Publicly Available Lexical Resource for Opinion Mining

 386

 Alexandre Denis, Matthieu Quignard, Guillaume Pitel

 A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation

 388

 Felix Sasaki

 Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources

 389

 Margaret King and Nancy Underwood

 Evaluating Symbiotic Systems: the challenge

 390

 A. Chalamandaris, A. Protopapas, P. Tsiakoulis, S. Raptis

 All Greek to me! An automatic Greeklish to Greek transliteration system

 392

 Thurid Vogt, Elisabeth Andre

 Improving Automatic Emotion Recognition from Speech via Gender Differentiation

 394

 Khurshid Ahmad, Lee Gillam and David Cheng

 Sentiments on a Grid: Analysis of Streaming News and Views

 395

 Briony Williams and Ivan Uemlianin

 A procedure for facilitating the development of manual letter-to-sound rules for the Festival speech synthesis framework

 397

 Thierry Declerck, Ovidiu Vela, Asunción Gómez Pérez, Zeno Gantner,and David Manzano-Macho

 Multilingual Lexical Resources for Ontology translation

 398

 Beatrice Alex, Malvina Nissim and Claire Grover

 The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text

 404

 Ben Wellner and Marc Vilain

 Leveraging Machine Readable Dictionaries in Discriminative Sequence Models

 405

 Sasa Hasan, Anas El Isbihani, Hermann Ney

 Creating a Large-Scale Arabic to French Statistical Machine Translation System

 407

 Chen Yirong,Lu Qin,Li Wenjie,Sui Zhifang

 A Study on Terminology Extraction Based on Classified Corpora

 408

 Anna Estelles Palanca, María Amparo Alcina Caudet, Victoria Soler Puertes

 Retrieving terminological data from the TxtCeram tagged domain corpus: first step on a terminological ontology.

 409

 Violetta Cavalli-Sforza, Abdelhadi Soudi

 IMORPHE: An Inheritance and Equivalence Based Morphology Description Compiler

 410

 Laurianne Sitbon and Patrice Bellot

 Tools and methods for topic segmentation of texts and contextual evaluation

 411

 L. Devillers, R. Cowie, J-C. Martin, E. Douglas-Cowie

 Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches

 412

 Maja Popovic, Hermann Ney

 POS-based Word Reorderings for Statistical Machine Translation

 413

 David Vilar, Jia Xu, Hermann Ney, Luis Fernando D’Haro

 Error Analysis of Statistical Machine Translation Output

 414

 Castellón, Irene; Fernández-Montraveta, Ana; Vázquez, Gloria; Alonso, Laura; Capilla, Joan Antonio

 The Sensem Corpus: a Corpus Annotated at the Syntactic and Semantic Level

 415

 Javier Pérez, Antonio Bonafonte

 GAIA: A Common Framework for the Development of Speech Translation Techonologies

 416

 Attila Novák

 Morphological Tools for Six Small Uralic Languages

 417

 Javier Pérez, Antonio Bonafonte, Horst-Udo Hain, Eric Keller, Stefan Breuer and Jilei Tian

 ECESS Inter-Module Interface Specification for Speech Synthesis

 418

 Petr Nemec

 Tree Searching/Rewriting Interpreter Tool

 420

 Maite Taboada, Caroline Anthony and Kimberly Voll

 Methods for creating semantic orientation dictionaries

 421

 Mahmoud Shokrollahi-Far,Antal van den Bosch, Harry Bunt, Walter Daelemans, Ayaz Isazadeh

 Bootstrapping POS Tags on the Vocalism of Classical Arabic…

 422

 Hans C. Boas

 CoLangDoc: A web-based tool for collaborative research on endangered languages

 423

 Masaki Itagaki, Anthony Aue, and Takako Aikawa

 Detecting Inter-domain Semantic Shift using Syntactic Similarity

 424

 Pablo D. Agüero, Jordi Adell, Mónica Caballero, Antonio Bonafonte and Asunción Moreno

 Database pruning and unit recovering for unit selection speech synthesis

 427

 Hynek BOŘIL, Tomáš BOŘIL, Petr POLLÁK

 Methodology of Lombard Speech Database Acquisition: Experiences with CLSD

 428

 Harry Bunt

 Design Principles for Multidimensional Dialogue Act Annotation Schemes

 429

 Gabriel Murray and Ben Hachey

 Improved Automatic Evaluation of Summaries Using a Large Semantic Space

 430

 Olivier Baude, Michel Jacobson, Atanas Tchobanov & Richard Walter

 Interoperability of the sound corpora: the case of the French corpora

 431

 Pavel Pecina, Pavel Schlesinger

 Evaluation of Classification Methods for Collocation Extraction

 433

 R. Bernardi, D. Calvanese, L. Dini, V. Di Tommaso, E. Frasnelli, B. Plank

 Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano

 435

 Irene Langkilde-Geary and Justin Betteridge

 A Factored Functional Dependency Transformation of the English Penn Treebank for Probabilistic Surface Generation

 436

 Jim Talley

 Bootstrapping new language ASR capabilities: Achieving best letter-to-sound performance under resource constraints

 438

 Eduard Hovy, Chin-Yew Lin, Liang Zhou, and Junichi Fukumoto

 Automated Summarization Evaluation with Basic Elements

 439

 Hiroyuki Kaji and Mariko Watanabe

 Automatic Construction of Japanese WordNet

 440

 Marie-Catherine de Marneffe, Bill MacCartney, Christopher D. Manning

 Generating Typed Dependency Parses from Phrase Structure Parses

 441

 Sónia Frota, Marina Vigário & Fernando Martins

 FreP: An electronic tool for extracting frequency information of phonological units from Portuguese written text

 442

 Ulrik Petersen

 Querying both parallel and treebank corpora: Evaluation of a corpus query system

 443

 Liang Zhou, Chin-Yew Lin, and Eduard Hovy

 Summarizing Answers for Complicated Questions

 445

 Monica Monachini, Nicoletta Calzolari, Khalid Choukri, Jochen Friedrich, Michele Mammini, Jan Odijk, Marisa Ulivieri

 Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian

 446

 Ronny Melz, Pum-Mo Ryu, Key-Sun Choi

 Compiling a large Korean IT-related language resource by language independent methods for domain-specific taxonomy learning

 448

 Felix Pîrvan, Dan Tufis

 Tagsets Mapping and Statistical Training Data Cleaning-up

 451

 Dan Tufiş, Elena Irimia

 RoCo-News: A Hand Validated Journalistic Corpus of Romanian

 452

 Eckhard Bick

 Turning a Dependency Treebank into a PSG-style Constituent Treebank

 453

 Dan Ştefănescu, Dan Tufiş

 Aligning multilingual thesauri

 454

 Radu Ion, Alexandru Ceauşu, Dan Tufiş

 Dependency-Based Phrase Alignment

 456

 Alexandru Ceauşu, Dan Ştefănescu, Dan Tufiş

 Acquis Communautaire sentence alignment using Support Vector Machines

 457

 Claire Grover and Richard Tobin

 Rule-Based Chunking and Reusability

 459

 Baden Hughes, Timothy Baldwin, Steven Bird, Jeremy Nicholson and Andrew MacKinlay

 Reconsidering Language Identification for Written Language Resources

 460

 Sun-Mee Bae and Key-Sun Choi

 A Comparison of Conceptual Maps Between Korean and Chinese Words in CoreNet

 461

 Yasuko SENDA, Yasusi SINOHARA, Manabu OKUMURA

 Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing

 463

 Lundälv, Mats, Mühlenbock, Katarina, Farre, Bengt and Brännström, Annika

 SYMBERED - a symbol-concept editing tool

 464

 Yi Zhang, Valia Kordoni

 Automated Deep Lexical Acquisition for Robust Open Texts Processing

 465

 Martin, J.-C., Caridakis, G., Devillers, L., Karpouzis, K., Abrilian, S.

 Manual Annotation and Image Processing of Multimodal Emotional Behaviours in TV Interviews

 467

 Cvetana Krstev, Ranka Stanković, Duško Vitas, and Ivan Obradović

 WS4LR: A Workstation for Lexical Resources

 468

 Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer

 Extending VerbNet with novel verb classes

 469

 James Pustejovsky, Catherine Havasi, Roser Sauri, Patrick Hanks, Jessica Littman, Anna Rumshisky, Jose Castano, Marc Verhagen

 Towards a Generative Lexical Resource: The Brandeis Semantic Ontology

 471

 Hans Dybkjær and Laila Dybkjær

 Act-Topic Patterns for Automatically Checking Dialogue Models

 472

 Rojas, David M. and Takako Aikawa

 Predicting MT quality as a function of the source language

 473

 Pawel Mazur and Robert Dale

 Named Entity Extraction with Conjunction Disambiguation

 477

 Beatrice Alex

 Integrating Language Knowledge Resources to Extend the English Inclusion Classifier to a New Language

 478

 Griet Depoorter, Remco van Veenendaal

 Functioning of the Centre for Dutch Language and Speech Technology (TST Centre)

 480

 Bente Maegaard, Lene Offersgaard, Lina Henriksen, Costanza Navarretta, Claus Povlsen, Hanne Jansen, Xavier Lepetit

 The MULINCO corpus and corpus platform

 481

 Claudia Soria, Maurizio Tesconi, Francesca Bertagna, Nicoletta Calzolari, Andrea Marchetti, Monica Monachini

 Moving to dynamic computational lexicons with LeXFlow

 482

 Caroline Sporleder, Marieke van Erp, Tijn Porcelijn, Antal van den Bosch and Pim Arntzen

 Identifying Named Entities in Text Databases from the Natural History Domain

 483

 Harold Somers, Gareth Evans and Zeinab Mohamed

 Developing Speech Synthesis for Under-Resourced Languages by ‘Faking it’ : An Experiment with Somali

 484

 Dragos CIOBANU, Anthony HARTLEY, Serge SHAROFF

 Using richly annotated trilingual language resources for acquiring reading skills in a foreign language

 485

 Guido Boella, Alessandro Mazzei, Piercarlo Rossi, Marco Martin

 A Development Tool For Multilingual Ontology-based Conceptual Dictionaries

 487

 Jens-Erik Fenstad, Lars Ahrenberg, Knut Kvale, Bente Maegaard, Katarina Mühlenbock, Bernt-Erik Heid

 KUNSTI - Knowledge Generation for Norwegian Language Technology

 488

 Péter Halácsy, Dániel Varga, Csaba, Oravecz, András Kornai, Viktor Trón

 Using a morphological analyzer in high precision POS tagging of Hungarian

 489

 Toumouh Adil, Widdows Dominic, Lehireche Ahmed, Malki Mimoun

 Adapting WordNet to the Medical Domain using Lexicosyntactic Patterns in the Ohsumed Corpus

 492

 Renata Savy, Francesco Cutugno, Claudia Crocco

 Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb).

 493

 Baden Hughes, Dafydd Gibbon and Thorsten Trippel

 Feature-Based Encoding and Querying Language Resources with Character Semantics

 495

 Rajen Subba, Barbara Di Eugenio, Elena Terenzi

 Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations

 496

 Kiyotaka Uchimoto, Naoko Hayashida, Toru Ishida, and Hitoshi Isahara

 Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence

 497

 Isa Maks and Michel Boekestein

 Exploring opportunities for enrichment by linking lexical databases

 498

 Horacio Saggion

 Multilingual Multidocument Summarization Tools and Evaluation

 500

 Olivier Ferret

 Building a network of topical relations from a corpus

 501

 P. Bouquet, L. Serafini, S. Zanobini

 The role of lexical resources in matching classification schemas

 503

 Manolis Maragoudakis, Katia Kermanidis, Aristogiannis Garbis and Nikos Fakotakis

 Dealing with imbalanced data using Bayesian techniques

 504

 J.M.Benedí, E.Lleida, A.Varona, M.J.Castro, I.Galiano, R.Justo, I.López and A.Miguel

 Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA

 505

 Rainer Osswald, Hermann Helbig, Sven Hartrumpf

 The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon

 506

 Jean Véronis, Yun-Chuang Chiao

 The ARCADE II project : Evaluation of multilingual text alignment systems

 508

 Francesca Bertagna

 Representation and Inference for Open-Domain QA: strength and limits of two Italian Semantic Lexicons

 509

 Abdelrahim Abdelsapor, Noha Adly, Kareem Darwish, Ossama Emam, Magdi Nagi

 Building a Heterogeneous Information Retrieval Collection of Printed Arabic Documents

 510

 Saturnino Luz, Matt-Mouley Bouamrane, Masood Masoodian

 Gathering a corpus of multimodal computer-mediated meetings with focus on text and audio interaction

 511

 A. L. Dimou, A. Chalamandaris

 Language identification from suprasegmental cues: Speech synthesis of Greek utterances from different dialectal variations

 513

 Roger Levy and Galen Andrew

 Tregex and Tsurgeon: tools for querying and manipulating tree data structures

 515

 L. Gillard, P. Bellot, M. El-Bèze

 EVALUATIONS OF QUESTION ANSWERING AND EVALUATIONS OF THE EVALUATION

 518

 B. Magnini, E. Pianta, C. Girardi, M. Negri, L. Romano, M. Speranza, and R. Sprugnoli

 I-CAB: the Italian Content Annotation Bank

 521

 Bente Maegaard, Steven Krauwer, Khalid Choukri, Lise Damsgaard Jørgensen

 The BLARK concept and BLARK for Arabic

 524

 Gabriella Pardelli, Manuela Sassi, Sara Goggi, Paola Orsolini

 Natural Language Processing: A Terminological and Statistical Approach

 525

 Suzan Verberne, Lou Boves, Nelleke Oostdijk

 Data for question answering: the case of ’why’

 527

 Kiril Simov and Petya Osenova

 Shallow Semantic Annotation of Bulgarian

 530

 Christopher Cieri, Walt Andrews, Joseph P. Campbell, George Doddington, Jack Godfrey, Shudong Huang, Mark Liberman, Alvin Martin, Hirotaka Nakasone, Mark Przybocki, Kevin Walker

 The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

 531

 Laurence Devillers, Jean-Claude Martin, Sarkis Abrilian

 Annotation of Emotions in Real-Life Video Interviews: Variability between Coders

 532

 Ya-Min Chou, Chu-Ren Huang

 Hantology-A Linguistic Resource for Chinese Language Processing and Studying

 534

 Voula Giouli, Penny Labropoulou, Maria Gavrilidou, Stelios Piperidis, Monica Monachini, Claudia Soria, Nicoletta Calzolari, Khalid Choukri

 Language resources production models: the case of INTERA multilingual corpus and terminology

 535

 Kyoko Kanzaki,Qing Ma,Eiko Yamamoto and Hitoshi Isahara

 Semantic Analysis of Abstract Nouns to compile a thesaurus of Adjectives

 537

 Katrin Erk and Sebastian Pado

 FRED and ROSY — A Flexible Toolbox For Semantic Role Assignment

 538

 Valentin Tablan, Tamara Polajnar, Hamish Cunningham, Kalina Bontcheva

 User-friendly ontology authoring using a controlled language

 539

 Laura Hasler and Constantin Orasan

 Towards cross-document event annotation

 540

 Amália Mendes, Maria Fernanda Bacelar do Nascimento, Sandra Antunes, Luísa Pereira

 COMBINA-PT: a large corpus-extracted and hand-checked lexical database of Portuguese multiword expressions

 541

 David Graff, Tim Buckwalter, Mohamed Maamouri, Hubert Jin

 Lexicon Development for Varieties of Spoken Dialectal Arabic

 542

 Alexandre Patry, Fabrizio Gotti, Philippe Langlais

 MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation

 543

 Mohamed Maamouri, Ann Bies, Tim Buckwalter, Mona Diab, Nizar Habash, Owen Rambow, Dalila Tabessi

 Developing and Using a Pilot Dialectal Arabic Treebank

 544

 Beata Bandmann Megyesi, Eva Csato Johansson, Anna Sågvall Hein

 Building a Swedish-Turkish Parallel Corpus

 545

 Horacio Saggion, Robert Gaizauskas

 Language Resources for Background Gathering

 547

 Jennifer Doyon, Keith Miller, John Henderson, Linda Van Guilder, Steven Van Guilder, Kristian Concepcion, Florence Reeder

 Document Exploitation Improvement by Component Evaluation

 550

 Julie Medero, Kazuaki Maeda, Stephanie Strassel, Christopher Walker

 An Efficient Approach to Gold Standard Annotation: Decision Points for Complex Tasks

 552

 Stefan Klatt

 A Corpus-Based Approach to the Interpretation of Unknown Words with an Application to German

 553

 Sophie Rosset, Sandra Petel

 The Ritel Corpus - An annotated Human-Machine open-domain question answering spoken dialog corpus

 554

 Anna Feldman, Jirka Hana, Chris Brew

 A cross-language approach to rapid creation of new morpho-syntactically annotated resources

 557

 Konstantinos I. Diamantaras, Ionas Michailidis, Spyros Vasileiadis, Yannick Frère

 Greek Named Entities Recognition using Support Vector Machines, Maximum Entropy Models and Onetime

 558

 Anna Korhonen, Yuval Krymolowski, Ted Briscoe

 A Large Subcategorization Lexicon for Natural Language Processing Applications

 560

 Nancy Ide, Keith Suderman

 An Open Linguistic Infrastructure for American English

 562

 Nancy Ide, Laurent Romary

 Representing Linguistic Corpora and Their Annotations

 565

 Zhongqiang Huang, Lei Chen, Mary Harper

 An Open Source Prosodic Feature Extraction Tool

 566

 Alina Andreevskaia and Sabine Bergler

 SEMANTIC Tag Extraction from WordNet Glosses

 567

 Kow Kuroda and Hitoshi Isahara

 Getting Deeper Semantics than Berkeley FrameNet with MSFA

 569

 Enrique Alfonseca, Antonio Moreno-Sandoval, Jose Maria Guirao and Maria Ruiz-Casado

 The wraetlic NLP suite

 570

 Tomoko Ohta, Yuka Tateisi, Jin-Dong Kim, Akane Yakushiji, Jun-ichi Tsujii

 Linguistic and Biological Annotations of Biological Interaction Events

 573

 Knut Hofland, Paul Meurer, Kari Tenfjord

 The ASK corpus - a language learner corpus of Norwegian as a second language

 575

 Ivana Kruijff-Korbayova, Klara Chvatalova, Oana Postolache

 Annotation Guidelines for Czech-English Word Alignment

 576

 Jérémie Segouat, Annelies Braffort and Emilie Martin

 Sign Language corpus analysis: Synchronisation of linguistic annotation and numerical data

 577

 Gil Francopoulo, Monte George, Nicoletta Calzolari, Monica Monachini, Nuria Bel, Mandy Pet, Claudia Soria

 Lexical Markup Framework (LMF)

 578

 Bruno Pouliquen, Marco Kimler, Ralf Steinberger, Camelia Ignat, Tamara Oellinger, Ken Blackler, Flavio Fluart, Wajdi Zaghouani, Anna Widiger, Ann-Charlotte Forslund, Clive Best

 Geocoding multilingual texts: recognition, disambiguation and visualisation

 580

 Bolette Sandford Pedersen

 Search on Danish compounds using shallow linguistic techniques

 581

 Thatsanee Charoenporn, Canasai Kruengkrai, Thanaruk Theeramunkong, Virach Sornlertlamvanich, and Hitoshi Isahara

 Word Knowledge Acquisition for Computational Lexicon Construction

 582

 Wim Peters, Maria Teresa Sagri, Daniela Tiscornia, Sara Castagnoli

 The LOIS Project

 583

 Joris Vaneyghen, Dirk Van Compernolle

 A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora

 584

 Adriana Roventini - Nilda Ruimy

 Linking Verbal Entries of Different Lexical Resources Linking Verbal Entries of Different Lexical Resources

 586

 K Choukri, M Dabbadie, O Hamon, A Hartley, W Mustafa El Hadi, A Popescu-Belis, M Rajman, S Surcin, I Timimi

 CESTA Machine Translation Evaluation Campaign: towards a reliable, reusable protocol

 589

 Marie-Jeanne Derouin, André Le Meur

 Lemma-oriented dictionaries, concept-oriented terminology and translation memories

 590

 Martí Umbert, Asunción Moreno, Pablo Aguero, Antonio Bonafonte

 Spanish Synthesis Corpora

 591

 Pavel Ircing, Jan Hoidekr, Josef Psutka

 Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech

 593

 Milena Slavcheva

 Semantic Descriptors: The Case of Reflexive Verbs

 595

 Guenter Neumann and Berthold Crysmann

 Exploring HPSG-based Treebanks for Probabilistic Parsing

 596

 Rita Marinelli, Remo Bindi

 Proper Names and Linguistic Dynamics

 597

 Sonja Bosch, Jackie Jones, Laurette Pretorius

 Towards machine-readable lexicons for South African Bantu languages

 599

 Le Chenadec G., Maffiolo V., Chateau N. and Colletta J.M.

 Creation of a corpus of multimodal spontaneous expressions of emotions in Human-Machine Interaction

 600

 Yoshihiko Hayashi, Toru Ishida

 A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons

 602

 Rita Marinelli, Adriana Roventini

 Using Core Ontology for Domain Lexicon Structuring

 603

 W. Abramowicz, A. Filipowska, K. Wecel, K. Wieloch, J. Piskorski

 Linguistic Suite for Polish Cadastral System

 604

 Hiromichi Kawanami, Takahiro Kitamura and Kiyohiro Shikano

 Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech

 606

 Pavel Smrz, Vit Novacek

 Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator

 608

 Bernardi R., Bolognesi A., Seidenari C., Tamburini F.

 POS tagset design for Italian

 610

 Kentaro Inui, Toru Hirano, Ryu Iida, Atsushi Fujita and Yuji Matsumoto

 Augmenting a Semantic Verb Lexicon with a Large Scale Collection of Example Sentences

 611

 Onelli C., Proietti D., Seidenari C., Tamburini F.

 The DiaCORIS project: a diachronic corpus of written Italian

 612

 Eneko Agirre, Izaskun Aldezabal, Jone Etxeberria, Eli Pociello

 A preliminary study for building the Basque PropBank

 613

 Eiko Yamamoto, Kyoko Kanzaki, and Hitoshi Isahara

 Detection of inconsistencies in concept classifications in a large dictionary — Toward an improvement of the EDR electronic dictionary —

 614

 Eneko Agirre, Izaskun Aldezabal, Jone Etxeberria, Eli Izagirre, Karmele Mendizabal, Eli Pociello and Mikel Quintian

 A methodology for the joint development of the Basque WordNet and Semcor

 615

 Jan Hoidekr, J.V. Psutka, Aleš Pražák, Josef Psutka

 Benefit of a Class-based Language Model for Real-time Closed-captioning of TV Ice-hockey Commentaries

 616

 Javier Cuétara-Priede, Luis Villaseñor-Pineda, Manuel Montes-y-Gómez, Manuel Alberto Pérez-Coutiño

 TranscríbEMex: An automatic phonetic transcriber for the Mexican Spanish

 617

 Stefanie Anstein, Gerhard Kremer, Uwe Reyle

 Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology

 618

 Mathieu Lafourcade

 Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence

 619

 Mariona Taulé, Roser Morante, Juan Aparicio, M Antònia Martí

 3LB-LEX and 3LB SEM lexical resources for Spanish and Catalan

 620

 Ernesto William De Luca and Andreas Nürnberger

 Rebuilding Lexical Resources for Information Retrieval using Sense Folder Detection and Merging Methods.

 621

 Pavel Smrz, Vit Novacek

 Automatic Acquisition of Semantics-Extraction Patterns from Parallel Texts

 622

 Lea Cyrus

 Building a resource for studying translation shifts

 623

 Christoph Draxler

 Speech Recordings in Public Schools in Germany — the Perfect Show Case for Web-based Recordings and Annotation

 624

 B K Tsou, B Y Lai, K K Sin, L Y L Cheung

 Court Stenography-To-Text (“STT”) in Hong Kong: A Jurilinguistic Engineering Effort

 625

 Lars Hellan & Dorothee Beermann

 Word Sense Disambiguation and Semantic Construction Analysis for Deep Processing Grammars

 626

 Dennis Reidsma, Dirk Heylen, Roeland Ordelman

 Annotating emotion in meetings

 627

 H. Bonneau-Maynard, C. Ayache, F. Bechet, A. Denis, , A. Kuhn, F. Lefevre, D. Mostefa, M. Quignard, S. Rosset, C. Servan, J. Villaneau

 Results of the French Evalda-Media evaluation campaign for literal understanding

 629

 Jonas Kuhn, Michael Jellinghaus

 Multilingual parallel treebanking: a lean and flexible approach

 630

 Julie Mauclair, Yannick Estève, Simon Petit-Renaud, Paul Deléglise

 Automatic detection of well recognized words in automatic speech transcription

 631

 Francesca Bertagna, Federico Calzolari, Nicoletta Calzolari, Sebastiana Cucurullo, Alessandro Enea, Monica Monachini, Eugenio Picchi, Manuela Sassi, Eva Sassolini, Claudia Soria

 Next Generation Language Resources using GRID

 632

 Sanni Nimb

 LEXADV - a multilingual semantic lexicon for adverbs

 633

 Voula Giouli, Alexis Konstadinidis, Elina Desipri, Harris Papageorgiou, Stelios Piperidis

 Multi-domain Multi-lingual Named Entity Recognition: Re-visiting & grounding the resources issue

 634

 Rebecca Passonneau and Nizar Habash and Owen Rambow

 Inter-annotator agreement on a multilingual semantic annotation task

 636

 Rebecca Passonneau

 Measuring Agreement on Set-Valued Items (MASI) for Semantic and Pragmatic Annotation

 637

 Tomoyosi Akiba

 Exploiting Dynamic Passage Retrieval for Spoken Question Recognition and Context Processing towards Speech-driven Information Access Dialogue

 638

 Marc Verhagen, Robert Knippen, Inderjeet Mani, and James Pustejovsky

 Annotation of TimeML Relations with Tango

 639

 Patrizia Paggio

 Information structure and pauses in a corpus of spoken Danish

 641

 Quasthoff, Uwe; Biemann, Chris; Richter, Matthias

 Corpus Portal for Search in 16 Monolingual Corpora

 642

 Tristan Vanrullen, Philippe Blache & Jean-Marie Balfourier

 Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy

 643

 Andreas Eisele

 Parallel corpora and phrase-based statistical MT for new language pairs via multiple intermediaries

 644

 Jae-Ho Kim, Sun-Mee Bae and Key-Sun Choi

 Automatic Generation of Medical Term Definition from Large Corpora

 645

 Owen Rambow, Bonnie Dorr, David Farwell, Rebecca Green, Nizar Habash, Stephen Helmreich, Eduard Hovy, Lori Levin, Keith J. Miller, Teruko Mitamura, Florence Reeder, Advaith Siddharthan

 Parallel Syntactic Annotation of Multiple Languages

 646

 Sylvain Galliano, Edouard Geoffrois, Jean-François Bonastre, Guillaume Gravier, Djamel Mostefa, Khalid Choukri.

 Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

 647

 Juan José Rodríguez Soler, Lucía Palacios Fernández, Pedro Concejero Cerezo, Carlos Lázaro Ávila, Daniel Tapias Merino

 Usability evaluation of 3G multimodal services in Telefónica Móviles España

 649

 Márton Miháltz, Gábor Pohl

 Exploiting Parallel Corpora for Supervised Word-Sense Disambiguation in English-Hungarian Machine Translation

 650

 Porfírio Filipe and Nuno Mamede

 A FRAMEWORK TO INTEGRATE UBIQUITOUS KNOWLEDGE MODELING

 651

 Felice Dell’Orletta, Alessandro Lenci, Simonetta Montemagni, Vito Pirrelli

 Searching Treebanks for Functional Constraints: Crosslingual Experiments in Grammatical Relations Assignment

 652

 Declerck Thierry

 SynAF: Towards a standard for Syntactic Annotation

 653

 Christelle Ayache, Brigitte Grau, Anne Vilnat

 Evaluation of question-answering systems : The French EQueR-EVALDA Evaluation Campaign

 654

 Maria Fernanda Bacelar do Nascimento, José Bettencourt Gonçalves, Luísa Pereira, Antónia Estrela, Afonso Pereira

 The African Varieties of Portuguese: Compiling Comparable Corpora and analysing data-derived lexicons

 655

 Richard Evans and Viktor Pekar

 BiRD: A prototype system to build and exploit resource databases for researchers

 656

 Benjamin K. Tsou and Oi Yee Kwong

 Toward a Pan-Chinese Thesaurus

 657

 Nelleke Oostdijk, Lou Boves

 User requirements analysis for the design of a reference corpus of written Dutch

 658

 Brigitte Grau, Anne-Laure Ligozat, Isabelle Robba, Anne Vilnat

 FRASQUES : a Question Answering system in the EQueR evaluation campaign

 661

 Gábor Hodász

 Evaluation Methods of a Linguistically Enriched Translation Memory System

 664

 José João Almeida, Alberto Simões

 T2O - Recycling Thesauri into a Multilingual Ontology

 666

 Saba Amsalu

 Data-driven Amharic-English Bilingual Lexicon Acquisition

 667

 Jörg Tiedemann

 ISA & ICA - Two Web Interfaces for Interactive Alignment of Bitexts

 669

 Wolfgang Minker, Heiko Neumann, Günther Palm, Petra-Maria Strauß, Harald Traue, Ulrich Weidenbacher

 WIZARD-OF-OZ DATA COLLECTION FOR PERCEPTION AND INTERACTION IN MULTI-USER ENVIRONMENTS

 670

 Nancy Underwood, Agnes Lisowska

 The evolution of an evaluation framework for a text mining system

 672

 Eline Westerhout, Paola Monachesi

 A pilot study for a Corpus of Dutch Aphasic Speech (CODAS)

 673

 Jan Bungeroth, Daniel Stein, Philippe Dreuw, Morteza Zahedi and Hermann Ney

 A German Sign Language Corpus of the Domain Weather Report

 674

 Roberto Bartolini, Caterina Caracciolo, Emiliano Giovannetti, Alessandro Lenci, Simone Marchi, Vito Pirrelli, Chiara Renso, Laura Spinsanti

 Creation and Use of Lexicons and Ontologies for Natural Language Interface to Databases

 676

 MULLONI, Andrea; PEKAR, Viktor

 AUTOMATIC DETECTION OF ORTHOGRAPHIC CUES FOR COGNATES RECOGNITION

 677

 Timothy Baldwin and Su’ad Awab

 Open Source Corpus Analysis Tools for Malay

 678

 Fidelia Ibekwe-SanJuan

 A task-oriented framework for evaluating theme detection systems.

 679

 Asunción Moreno, Albert Febrer, Lluis Márquez

 Generation of Language Resources for the Development of Speech Technologies in Catalan

 680

 Georgiana Puscasu and Ruslan Mitkov

 If it were then, then when was it? Establishing the anaphoric role of “then”

 683

 Viktor Trón, Péter Halácsy, Péter Rebrus, András Rung, Péter Vajda, Eszter Simon

 Morphdb.hu: Hungarian lexical database and morphological grammar

 685

 Hong Phuong Le, Thi Minh Huyen Nguyen, Laurent Romary, Azim Roussanaly

 A Lexicalized Tree Adjoining Grammar for Vietnamese

 687

 Silvie Cinkova, Pavel Pecina, Petr Podvesky, Pavel Schlesinger

 Semi-automatic Building of Swedish Collocation Lexicon

 688

 Dominika Oliver and Krzysztof Szklanny

 Creation and analysis of a Polish speech database for use in unit selection speech synthesis

 689

 Jorge Kinoshita Laís do Nascimento Salvador Carlos Eduardo Dantas de Menezes

 CoGrOO: a Brazilian-Portuguese Grammar Checker based on the CETENFOLHA Corpus

 691

 Stefan Schaden

 Evaluation of automatically generated transcriptions of non-native pronunciations using a phonetic distance measure

 692

 Isabella Chiari

 Slips and errors in spoken data transcription

 697

 Motoko Ueyama

 Evaluation of Web-based Corpora: Effects of Seed Selection and Time Interval

 700

 José Iria, Christopher Brewster, Fabio Ciravegna and Yorick Wilks

 AN INCREMENTAL TRI-PARTITE APPROACH TO ONTOLOGY LEARNING

 701

 Thomas Pellegrini and Lori Lamel

 Experimental detection of vowel prononciation variants in Amharic

 702

 R. Catizone, A. Dalli and Y. WIlks

 A Methodology for Evaluating Timelines

 704

 Ivana Kruijff-Korbayova, Tilman Becker, Nate Blaylock, Ciprian Gerstenberger, Michael Kaisser, Peter Poller, Verena Rieser, Jan Schehl

 The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player

 705

 Rebecca Passonneau, Roberta Blitz, David Elson, Angela Giral, and Judith Klavans

 CLiMB ToolKit: A Case Study of Iterative Evaluation in a Multidisciplinary Project

 706

 Anna Rumshisky, James Pustejovsky

 Inducing sense-discriminating context patterns from sense-tagged corpora

 707

 Milen Kouylekov and Bernardo Magnini

 Building a Large-Scale Repository of Textual Entailment Rules

 708

 Alessandro Moschitti and Roberto Basili

 A Tree Kernel approach to Question and Answer Classification in Question Answering Systems

 709

 C. d’ Alessandro, P. Boula de Mareüil, G. Bailly, M.-N. Garcia, M. Morel

 A joint intelligibility evaluation of French text to speech systems: the EVASY/SUS campaign

 710

 Winston Anderson and Petro Kotzé

 Finite state tokenisation of an orthographical disjunctive agglutinative language: The verbal segment of Northern Sotho

 711

 Couturier, Jean-François; Drouin, Patrick; Neuvel, Sylvain

 Applying lexical constraints on morpho-syntactic patterns for the identification of conceptual relational content in specialized texts.

 712

 Katerina Pastra

 Beyond Multimedia Integration Mechanisms: in need of extending multimedia annotation schemes

 715

 Yoshihiko Nitta, Masashi Saraki, Satoru Ikehara

 Building Carefully Tagged Bi-lingual Corpora to Cope with Linguistic Idiosyncrasy

 716

 Roser Saurí, Marc Verhagen, James Pustejovsky

 SlinkET: A Partial Modal Parser for Events

 717

 Christopher Cieri, Mark Liberman

 More Data and Tools for More Languages and Research Areas: A Progress Report on LDC Activities

 719

 A. Bonafonte, H. Höge, I. Kiss, A. Moreno, U. Ziegenhain, H. van den Heuvel, H.U. Hain, X. S. Wang, M. N. Garcia

 TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis

 720

 Miriam Voghera, Francesco Cutugno

 An Observatory on Spoken Italian Linguistic Resources and Descriptive Standards

 721

 Kamel Smaïli, Caroline Lavecchia and J.P. Haton

 Linguistic Features Modeling Based on Partial New Cache Model

 723

 Pum-Mo Ryu, Key-Sun Choi, and Ronny Melz

 Taxonomy Learning using Term Specificity and Similarity

 724

 Stefan Schulz, Kornél Markó, Percy Nohama, Roosewelt Leite de Andrade, Edson José Pacheco, Udo Hahn, Martin Romacker

 The MORPHOSAURUS Medical Subword Lexicon

 725

 Marko Salmenkivi

 Finding representative sets of dialect words for geographical regions

 726

 Olga Uryupina

 Coreference resolution with and without linguistic knowledge

 727

 Keith Miller and Michelle Vanni

 Modern Standard Arabic v. Dialect: Comparing Arabic-English MT Performance

 729

 O Hamon, M Rajman

 X-Score : Automatic Evaluation of Machine Translation Grammaticality

 730

 Andrew Roberts, Latifa Al-Sulaiti and Eric Atwell

 A comparison of concordancers for Arabic: WordSmith, Mono-Conc, Xaira and aConCorde

 733

 Roberto Navigli

 Reducing the Granularity of a Computational Lexicon via an Automatic Mapping to a Coarse-Grained Sense Inventory

 735

 Dafydd Gibbon, Flaviane Romani Fernandes, Thorsten Trippel

 A BLARK extension for temporal annotation mining

 736

 Michael Schiehlen and Kristina Spranger

 The Mass-Count Distinction: Acquisition and Disambiguation

 737

 Lars Nygaard & Stephan Oepen

 Identifying Transparent Compounds in a Large Computational Lexicon

 738

 Andrew W. Cole

 Corpus Development and Publication

 739

 Shu-Chuan Tseng, Dafydd Gibbon

 Discourse functions of duration in Mandarin

 740

 Leonardo Lesmo Livio Robaldo

 From Natural Language to Databases via Ontologies

 742

 Adeline Nazarenko, Erick Alphonse, Julien Derivière, Thierry Hamon, Guillaume Vauvert, Davy Weissenbacher

 The ALVIS Format for Linguistically Annotated Documents

 744

 Alexander Raake and Brian FG Katz

 SUS-based Method for Speech Reception Threshold Measurement in French

 745

 Stephanie Strassel, Christopher Cieri, Andrew Cole, Denise DiPersio, Mark Liberman, Xiaoyi Ma, Mohamed Maamouri

 Integrated Linguistic Resources for Language Exploitation Technologies

 746

 Xiaoyi Ma

 Champollion: A Robust Parallel Text Sentence Aligner

 748

 Fabian Behrens & Jan-Torsten Milde

 The Eclipse Annotator: an extensible system for multimodal corpus creation

 749

 Harris Papageorgiou, Elina Desipri, Maria Koutsombogera, Kanella Pouli, Andromaxi Griva, Prokopis Prokopidis

 Adding multi-layer semantics to the Greek Dependency Treebank

 750

 Cristina BOSCO, Vincenzo LOMBARDO

 Comparing linguistic information in treebank annotations

 752

 Nella Cucurullo, Simonetta Montemagni, Matilde Paoli, Eugenio Picchi, Eva Sassolini

 Dialectal resources on-line: the ALT-Web experience

 754

 Xiaoyi Ma, Chris Cieri

 Corpus Support for Machine Translation at LDC

 755

 Ann Bies, Stephanie Strassel, Haejoong Lee, Kazuaki Maeda, Seth Kulick, Yang Liu, Mary Harper, Matthew Lease

 Linguistic Resources for Speech Parsing

 756

 Tomasz Obrebski, Michal Stolarski

 UAM Text Tools - a flexible NLP architecture

 757

 Markus Geilfuss & Jan-Torsten Milde

 SAM - an annotation editor for parallel texts

 758

 Hans Uszkoreit, Feiyu Xu, Joerg Steffen, Ilhan Aslan

 The pragmatic combination of different crosslingual resources for multilingual information services

 759

 Nizar Habash, Clinton Mah, Sabiha Imran, Randy Calistri-Yeh and Paraic Sheridan

 Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

 760

 Grażyna Vetulani, Zygmunt Vetulani, Tomasz Obrębski

 Syntactic Lexicon of Polish Predicative Nouns

 762

 Alonge Antonietta

 The Italian Metaphor Database

 763

 Georgiana Puscasu

 How to identify temporal clauses?

 765

 Jose Iria, Fabio Ciravegna

 A Methodology and Tool for Representing Language Resources for Information Extraction

 767

 Harry Halpin

 Automatic Evaluation and Compositon of NLP Pipelines with Web Services

 768

 C. de Loupy

 The OuRAL Project

 769

 Harry Bunt and Amanda Schiffrin

 Methodological aspects of semantic annotation

 770

 Whitney Gegg-Harrison, Donna Byron

 PYCOT: An OT-Based Pronoun Resolution Tool

 772

 Emma Barker, Ryuichiro Higashinaka, François Mairesse, Robert Gaizauskas, Marilyn Walker, Jonathan Foster

 Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives

 774

 Jeongwoo Ko, Laurie Hiyakumoto, Eric Nyberg

 Exploiting Multiple Semantic Resources for Answer Selection

 776

 Kazuaki Maeda, Christopher Cieri and Kevin Walker

 Low-cost Customized Speech Corpus Creation for Speech Technology Applications

 777

 John Niekrasz and Alexander Gruenstein

 NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Discourse

 778

 Christoph Benzmueller, Helmut Horacek, Henri Lesourd, Ivana Kruijff-Korbayova, Marvin Schiller, Magdalena Wolska

 A corpus of tutorial dialogs on theorem proving; the influence of the presentation of the study-material.

 779

 J.Laoudi, C. Tate, C. Voss

 Task-based MT Evaluation: From Who/When/Where Extraction to Event Understanding

 780

 Kazuaki Maeda, Haejoong Lee, Julie Medero, Stephanie Strassel

 A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit

 782

 Hideki Shima, Mengqiu Wang, Frank Lin, Teruko Mitamura

 Modular Approach to Error Analysis and Evaluation for Multilingual Question Answering

 783

 Jeongwoo Ko, Fumihiko Murase, Teruko Mitamura, Eric Nyberg, Masahiko Tateishi, and Ichiro Akahori

 Analyzing the Effects of Spoken Dialog Systems on Driving Behavior

 787

 Magesh Balasubramanya, Peter Lucas, Michael Higgins and Dominic Widdows

 Collaborative Annotation that Lasts Forever: Using Peer-to-Peer Technology for Disseminating Corpora and Language Resources

 788

 Vasile Rus, Art Graesser

 The Look and Feel of a Textual Entailer

 789

 R.Gaizauskas, M. Hepple and H.Harkema

 Indirect Evaluation of Parsers Using Dependency Relations to Assess Semantic Output

 790

 Gregory Marton and Boris Katz

 Automatic TREC Relationship Answering and Nugget Evaluation

 792

 Finley Lacatusu, Andrew Hickl, Sanda Harabagiu

 Impact of Question Decomposition on the Quality of Answer Summaries

 793

 Sanda M. Harabagiu, Cristina Nicolae, Gabriel Nicolae

 Coreference by Consolidation

 794

 Sanda Harabagiu and Cosmin Adrian Bejan

 An Answer Bank for Temporal Inference

 795

 Paul C Morarescu

 Principles for annotating and reasoning with spatial information

 796

 Li Sujian, Lu Qin, Li Wenjie, Xu Ruifeng

 Interact between Lexical Base and Ontology with Formal Concept Analysis

 798

 Rachada Kongkachandra and Kosin Chamnongthai

 SEMANTIC-BASED KEYWORD RECOVERY FUNCTION FOR KEYWORD EXTRACTION SYSTEM

 799

 Ruifeng Xu, Qin Lu and Sujian Li

 The Design and Construction of Chinese Collocation Bank

 800

 Nilda Ruimy, Adriana Roventini

 Merging two Ontology-based Lexical Resources

 801

 Nimaan Abdillahi, Nocera Pascal

 Towards automatic transcription of Somali language

 802

 Susanne Burger, Zachary A. Sloane, Jie Yang

 Competitive Evaluation of Commercially Available Speech Recognizers in Multiple Languages

 803

 Kornel Laskowski and Susanne Burger

 Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus

 805

 W. Black, S. Elkateb, H. Rodriguez, M. Alkhalifa, P. Vossen, A. Pease, C. Fellbaum

 Introducing the Arabic WordNet Project

 806

 Benoît Sagot and Pierre Boullier

 Deep non-probabilistic parsing of large corpora

 807

 Magnar Brekke, Kai Innselset, Marita Kristiansen & Kari Øvsthus

 KB-N: Automatic Term Extraction from Knowledge Bank of Economics

 808

 Alexander Klassmann, Freddy Offenga, Daan Broeder, Romuald Skiba, Peter Wittenburg

 Comparison of Resource Discovery Methods

 809

 Peter Lucas, Magesh Balasubramanya, and Dominic Widdows

 The Information Commons Gazetteer: A Public Resource of Populated Places and Worldwide Administrative Divisions

 810

 Benoît Sagot, Lionel Clément, Eric Villemonte de La Clergerie and Pierre Boullier

 The Lefff 2 syntactic lexicon for French: architecture, acquisition, use

 811

 Nilda Ruimy

 Structuring a Domain Vocabulary in a General Knowledge Environment

 812

 Alexander Geyken, Norbert Schrader

 LexikoNet - a lexical database based on type and role hierarchies

 813

 Djamel Mostefa ; Olivier Hamon, Khalid CHOUKRI et. al.

 TC-STAR: Evaluation of speech to speech components

 814

 Djamel Mostefa ; Marie-Neige Garcia, Olivier Hamon, Khalid CHOUKRI et. al.

 CHIL: Evaluation of multimodal technology components

 815

 Carol Peters

 The Impact of Evaluation on Multilingual Information Retrieval System Development

 816

 Carol Peters , Bernardo Magnini, Christelle Ayache, et. al.

 CLEF Q-A special track

 817

 Marie-Neige Garcia , Christophe d’Alessandro , Philippe Boula de Mareüil

 A joint prosody evaluation of French text-to-speech systems: the EVASY Prosody campaign

Copyright © 2006 ELRA