AUTHORS: Browse articles of the conference sorted by authors

 

A - B - C - D - E - F - G - H - I - J - K - L - M - N - O - P - Q - R - S - T - U - V - W - X - Y - Z

Author Titles
A  
Abdelsapor, A. Building a Heterogeneous Information Retrieval Test Collection of Arabic Document Images
Abe, A. Features of Terms in Actual Nursing Activities
Abramowicz, W. Linguistic Suite for Polish Cadastral System
Abrilian, S. Annotation of Emotions in Real-Life Video Interviews: Variability between Coders

Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches

Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews
Adly, N. Building a Heterogeneous Information Retrieval Test Collection of Arabic Document Images
Agić, Ž. Evaluating Morphosyntactic Tagging of Croatian Texts
Agirre, E. A methodology for the joint development of the Basque WordNet and Semcor

A Preliminary Study for Building the Basque PropBank
Ågren, M. CEFLE and Direkt Profil: a New Computer Learner Corpus in French L2 and a System for Grammatical Profiling
Agüero, P. Spanish Synthesis Corpora
Ahmad, K. Ontological and Terminological Commitments and the Discourse of Specialist Communities

Visual Surveillance and Video Annotation and Description

Sentiments on a Grid: Analysis of Streaming News and Views
Aholab, I. Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque
Ahrenberg, L. KUNSTI - Knowledge Generation for Norwegian Language Technology
Aikawa, T. Detecting Inter-domain Semantic Shift using Syntactic Similarity

Predicting MT Quality as a Function of the Source Language
Ailomaa, M. Extending the Wizard of Oz Methodologie for Multimodal Language-enabled Systems
Ajani, G. A Development Tool For Multilingual Ontology-based Conceptual
Ajot, J. Multiple Dimension Levenshtein Edit Distance Calculations for Evaluating Automatic Speech Recognition Systems During Simultaneous Speech
Akahori, I. Analyzing the Effects of Spoken Dialog Systems on Driving Behavior
Akam, H. Recurrent Markov Cluster (RMCL) Algorithm for the Refinement of the Semantic Network
Akiba, T. Exploiting Dynamic Passage Retrieval for Spoken Question Recognition and Context Processing towards Speech-driven Information Access Dialogue
Al-kufaishi, A. Building Annotated Written and Spoken Arabic LRs in NEMLAR Project
Alabau, V. Bilingual speech corpus in two phonetically similar languages
Alcina, A. Retrieving Terminological Data from the TxtCeram Tagged Domain Corpus: A First Step towards a Terminological Ontology
Aldezabal, I. A methodology for the joint development of the Basque WordNet and Semcor

A Preliminary Study for Building the Basque PropBank
Alegria, I. Structure, Annotation and Tools in the Basque ZT Corpus
Alex, B. The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text
Alessandro, P. Integrating Methods and LRs for Automatic Keyword Extraction from Open Domain Texts
Alfonseca, E. Building a Parallel Multilingual Corpus (Arabic-Spanish-English)

The wraetlic NLP suite
Alkhalifa, M. Building a WordNet for Arabic
Allison, B. A Closer Look at Skip-gram Modelling
Almeida, J. T2O - Recycling Thesauri into a Multilingual Ontology
Alonge, A. The Italian Metaphor Database
Alonso, L. The Sensem Corpus: a Corpus Annotated at the Syntactic and Semantic Level
Alphonse, E. The ALVIS Format for Linguistically Annotated Documents
Amdal, I. FonDat1: A Speech Synthesis Corpus for Norwegian
Amigó, E. Iqmt: A Framework for Automatic Machine Translation Evaluation
Amsalu, S. Data-driven Amharic-English Bilingual Lexicon Acquisition
Ananiadou, S. Clustering acronyms in biomedical text for disambiguation

Towards a terminological resource for biomedical text mining
Anderson, W. Finite state tokenisation of an orthographical disjunctive agglutinative language: The verbal segment of Northern Sotho
Ando, R. Analysis of TimeBank as a Resource for TimeML Parsing
Andrade, R. Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon
Andrassy, B. Human and machine recognition as a function of SNR
Andre, E. Improving Automatic Emotion Recognition from Speech via Gender Differentiaion
Andreevskaia, A. Semantic Tag Extraction from WordNet Glosses
Andrew, G. Tregex and Tsurgeon: tools for querying and manipulating tree data structures
Andrews, W. The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
Anstein, S. Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology
Anthony, C. Methods for Creating Semantic Orientation Dictionaries
Antoine, J. Training Language Models without Appropriate Language Resources: Experiments with an AAC System for Disabled People
Antunes, S. COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions
Apresjan, J. A Syntactically and Semantically Tagged Corpus of Russian: State of the Art and Prospects
Areta, N. Structure, Annotation and Tools in the Basque ZT Corpus
Aribau, E. Building a WordNet for Arabic
Armstrong, S. Extending the Wizard of Oz Methodologie for Multimodal Language-enabled Systems
Arnaiz, A. A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker
Arntzen, P. Identifying Named Entities in Text Databases from the Natural History Domain
Artola, X. Structure, Annotation and Tools in the Basque ZT Corpus
Asahara, M. An Annotated Corpus Management Tool: ChaKi
Aslan, I. The pragmatic combination of different crosslingual resources
Assem, M. Conversion of WordNet to a standard RDF/OWL representation
Athanassia-lida, D. Language identification from suprasegmental cues: Speech synthesis of Greek utterances from different dialectal variations.
Atserias, J. FreeLing 1.3: Syntactic and semantic services in an open-source NLP library
Attia, M. Building Annotated Written and Spoken Arabic LRs in NEMLAR Project
Aue, A. Detecting Inter-domain Semantic Shift using Syntactic Similarity
Aunimo, L. The Multilingual Question Answering Track at CLEF
Ávila, C. Usability evaluation of 3G multimodal services in Telefónica Móviles España
Awab, S. Open Source Corpus Analysis Tools for Malay
Ayache, C. Data, Annotations and Measures in EASY the Evaluation Campaign for Parsers of French.

Results of the French Evalda-Media evaluation campaign for literal understanding

The Multilingual Question Answering Track at CLEF

EQueR: the French Evaluation campaign of Question-Answering Systems
B  
Babych, B. Using collocations from comparable corpora to find translation equivalents
Badia, T. METIS-II: Machine Translation for Low Resource Languages
Bae , H. A Methodology for Developing Multilingual Resources for Terminology
Bahrani, M. Building and Incorporating Language Models for Persian Continuous Speech Recognition Systems
Bailly, G. A joint prosody evaluation of French text-to-speech synthesis systems

A joint intelligibility evaluation of French text-to-speech synthesis systems: the EvaSy SUS/ACR campaign

Does a Virtual Talking Face Generate Proper Multimodal Cues to Draw User’s Attention to Points of Interest?
Balasubramanya, M. Collaborative Annotation that Lasts Forever: Using Peer-to-Peer Technology for Disseminating Corpora and Language Resources

The Information Commons Gazetteer
Baldwin, T. Reconsidering Language Identification for Written Language Resources

Open Source Corpus Analysis Tools for Malay
Balfourier, J. Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy
Banchs, R. Acceptance Testing of a Spoken Language Translation System
Barbu, A. Romanian Valence Dictionary in XML Format
Barker, E. Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives
Barreto, F. Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project
Bartolini, R. Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases
Basili, R. A Tree Kernel approach to Question and Answer Classification in Question Answering Systems
Baude, O. Interoperability of audio corpora : the case of the French corpora
Bechet, F. Results of the French Evalda-Media evaluation campaign for literal understanding
Beck, D. Automatic Testing and Evaluation of Multilingual Language Technology Resources and Components
Becker, T. The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player
Beermann, D. Word Sense Disambiguation and Semantic Disambiguation for Construction Types in Deep Processing Grammars
Behrens, F. The Eclipse Annotator: an extensible system for multimodal corpus creation
Bel, N. Lexical Markup Framework (LMF)

New tools for the encoding of lexical data extracted from corpus
Bellot, P. Question Answering Evaluation Survey

Tools and methods for objective or contextual evaluation of topic segmentation
Bendahman, C. Building Annotated Written and Spoken Arabic LRs in NEMLAR Project
Bendazzoli, C. Tagging a Corpus of Interpreted Speeches: the European Parliament Interpreting Corpus (EPIC)
Benedí, J. Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA
Bennett, C. Visual Surveillance and Video Annotation and Description
Benzmüller, C. A corpus of tutorial dialogs on theorem proving; the influence of the presentation of the study-material
Berck, P. ANNEX - a web-based Framework for Exploiting Annotated Media Resources

Ontology-based Language Archive Utilization
Bergler, S. Semantic Tag Extraction from WordNet Glosses
Berglund, A. Extraction of Temporal Information from Texts in Swedish
Bergmann, S. Set-up of a Unit-Selection Synthesis with a Prominent Voice
Berlocher, I. Morphological annotation of Korean with Directly Maintainable Resources
Bernardi, R. Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano

POS tagset design for Italian
Bernsen, N. H. C. Andersen Conversation Corpus

Field Evaluation of a Single-Word Pronunciation Training System
Bertagna, F. Moving to dynamic computational lexicons with LeXFlow

Representation and Inference for Open-Domain QA: Strength and Limits of two Italian Semantic Lexicons

Next Generation Language Resources using Grid
Besacier, L. A French Non-Native Corpus for Automatic Speech Recognition
Best, C. Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation
Betteridge, J. A Factored Functional Dependency Transformation of the English Penn Treebank for Probabilistic Surface Generation
Bibiko, H. Ontology-based Language Archive Utilization
Bick, E. Turning a Dependency Treebank into a PSG-style Constituent Treebank
Biemann, C. Corpus Portal for Search in Monolingual Corpora
Bies, A. Developing and Using a Pilot Dialectal Arabic Treebank

Linguistic Resources for Speech Parsing
Bilotti, M. Evaluation for Scenario Question Answering Systems
Bindi, R. Proper Names and Linguistic Dynamics
Bird, S. Reconsidering Language Identification for Written Language Resources
Bisetto, A. MORBO/COMP: a multilingual database of compound words
Bishop, E. An Anaphora Resolution-Based Anonymization Module
Blache, P. Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy
Black, W. Building a WordNet for Arabic
Blackler, K. Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation
Blaylock, N. The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player
Blitz, R. CLiMB ToolKit: A Case Study of Iterative Evaluation in a Multidisciplinary Project
Boekestein, M. Functioning of the Centre for Dutch Language and Speech Technology
Boelhouwer, B. Exploring opportunities for Comparability and Enrichment by Linking lexical databases
Boella, G. A Development Tool For Multilingual Ontology-based Conceptual
Bogacka, A. General and Task-Specific Corpus Resources for Polish Adult Learners of English
Boguraev, B. Analysis of TimeBank as a Resource for TimeML Parsing
Boguslavsky, I. A Syntactically and Semantically Tagged Corpus of Russian: State of the Art and Prospects
Bojar, O. Czech-English Word Alignment
Bolognesi, A. POS tagset design for Italian
Bonafonte, A. ECESS Inter-Module Interface Specification for Speech Synthesis

TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis

Acceptance Testing of a Spoken Language Translation System

Spanish Synthesis Corpora

GAIA: Common Framework for the Development of Speech Translation Technologies
Bonastre, J. Towards automatic transcription of Somali language

Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News
Bonneau-maynard, H. Results of the French Evalda-Media evaluation campaign for literal understanding
Bontcheva, K. User-friendly ontology authoring using a controlled language
Bordoni, L. Towards an Ontology for Art and Colours
Boril, H. Methodology of Lombard Speech Database Acquisition: Experiences with CLSD
Boril, T. Methodology of Lombard Speech Database Acquisition: Experiences with CLSD
Bosch, A. Identifying Named Entities in Text Databases from the Natural History Domain

Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development
Bosch, S. Towards machine-readable lexicons for South African Bantu languages
Bosco, C. Comparing linguistic information in treebank annotations
Bouamrane, M. Gathering a corpus of multimodal computer-mediated meetings
Bouillon, P. REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
Boullier, P. The Lefff 2 syntactic lexicon for French: architecture, acquisition, use

Deep non-probabilistic parsing of large corpora
Bouquet, P. The role of lexical resources in matching classification schemas
Boves, L. User requirements analysis for the design of a reference corpus of written Dutch

Data for question answering: The case of why

A Unified Structure for Dutch Dialect Dictionary Data
Braffort, A. Sign Language corpus analysis: Synchronisation of linguistic annotation and numerical data
Braga, D. Progmatica: A Prosodic Database for European Portuguese
Branco, A. Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project
Brekke, M. Automatic Term Extraction from Knowledge Bank of Economics
Breuer, S. Set-up of a Unit-Selection Synthesis with a Prominent Voice

ECESS Inter-Module Interface Specification for Speech Synthesis
Brew, C. A Cross-language Approach to Rapid Creation of New Morpho-syntactically Annotated Resources
Brewster, C. An Incremental Tri-Partite Approach To Ontology Learning
Briscoe, T. A Large Subcategorization Lexicon for Natural Language Processing Applications
Broeder, D. Comparison of Resource Discovery Methods

LAMUS: the Language Archive Management and Upload System

Foundations of Modern Language Resource Archives

Metadata Profile in the ISO Data Category Registry

Technologies for a Federation of Language Resource Archives
Brugman, H. A Web Based General Thesaurus Browser to Support Indexing of Television and Radio Programs

ELAN: a Professional Framework for Multimodality Research
Brännström, A. SYMBERED - a Symbol-Concept Editing Tool
Buckwalter, T. Developing and Using a Pilot Dialectal Arabic Treebank

Lexicon Development for Varieties of Spoken Colloquial Arabic
Buehler, D. Stochastic Spoken Natural Language Parsing in the Framework of the French MEDIA Evaluation Campaign
Buitelaar, P. Ontology-based Information Extraction with SOBA
Bungeroth, J. A German Sign Language Corpus of the Domain Weather Report
Bunt, H. Methodological Aspects of Semantic Annotation

Dimensions in Dialogue Act Annotation
Burchardt, A. SALTO - A Versatile Multi-Level Annotation Tool

The SALSA Corpus: a German Corpus Resource for Lexical Semantics
Burger, S. Competitive Evaluation of Commercially Available Speech Recognizers in Multiple Languages

Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus
Buscaldi, D. Mining Knowledge fromWikipedia for the Question Answering task
Bustamante, F. A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker

Spelling Error Patterns in Spanish for Word Processing Applications
Byron, D. PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit

The OSU Quake 2004 corpus of two-party situated problem-solving dialogs
C  
Cabral, L. Corpógrafo V3 - From Terminological Aid to Semi-automatic Knowledge Engineering
Cabré, M. SKELETON: Specialised knowledge retrieval on the basis of terms and conceptual relations
Calistri-yeh, R. Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
Calvanese, D. Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano
Calzolari, F. Next Generation Language Resources using Grid
Calzolari, N. Moving to dynamic computational lexicons with LeXFlow

Lexical Markup Framework (LMF)

Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology

Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian

Next Generation Language Resources using Grid
Campbell, J. The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
Campbell, N. Multimedia Database of Meetings and Informal Interactions for Tracking Participant Involvement and Discourse Flow
Capilla, J. The Sensem Corpus: a Corpus Annotated at the Syntactic and Semantic Level
Caracciolo, C. Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases
Cardoso, N. HAREM: An Advanced NER Evaluation Contest for Portuguese
Caridakis, G. Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews
Carl, M. METIS-II: Machine Translation for Low Resource Languages
Carletta, J. Unleashing the killer corpus: experiences in creating the multi-everything AMI Meeting Corpus
Cartoni, B. Dealing with unknown words by simple decomposition: feasibility studies with Italian prefixes.
Casas, B. FreeLing 1.3: Syntactic and semantic services in an open-source NLP library
Caselli, T. Annotating Bridging Anaphors in Italian: in Search of Reliability
Castagnoli, S. The LOIS Project
Casteleiro, J. COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions
Castellón, I. The Sensem Corpus: a Corpus Annotated at the Syntactic and Semantic Level
Castro, M. Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA
Catizone, R. Evaluating Automatically Generated Timelines from the Web
Cavalli-sforza, V. IMORPHĒ: An Inheritance and Equivalence Based Morphology Description Compiler
Ceauşu, A. Dependency-Based Phrase Alignment

Acquis Communautaire Sentence Alignment using Support Vector Machines
Cerezo, P. MEDUSA: User-Centred Design and usability evaluation of Automatic Speech Recognition telephone services in Telefónica Móviles España

Usability evaluation of 3G multimodal services in Telefónica Móviles España
Chalamandaris, A. All Greek to me! An automatic Greeklish to Greek transliteration system

Language identification from suprasegmental cues: Speech synthesis of Greek utterances from different dialectal variations.
Chalvin, A. Dictionary Building with the Jibiki Platform: the GDEF case
Chamnongthai, K. Semantic-Based Keyword Recovery Function for Keyword Extraction System
Charniak, E. SParseval: Evaluation Metrics for Parsing Speech
Charoenporn, T. Word Knowledge Acquisition for Computational Lexicon Construction
Chateau, N. Creation of a corpus of multimodal spontaneous expressions of emotions in Human-Machine Interaction
Chatzichrisafis, N. REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
Chaudiron, S. Techno-langue: The French National Initiative for Human Language Technologies (HLT)
Chen, H. Constructing a Named Entity Ontology from Web Corpora

Tagging Heterogeneous Evaluation Corpora for Opinionated Tasks
Chen, L. An Open Source Prosodic Feature Extraction Tool
Chenfour, N. Building Annotated Written and Spoken Arabic LRs in NEMLAR Project
Cheng, D. Sentiments on a Grid: Analysis of Streaming News and Views
Chesley, P. Automatic extraction of subcategorization frames for French
Cheung, L. Court Stenography-To-Text (“STT”) in Hong Kong: A Jurilinguistic Engineering Effort
Chiao, Y. Evaluation of multilingual text alignment systems: the ARCADE II project

Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model
Chiari, I. Slips and errors in spoken data transcription
Choi, K. Compiling large language resources using lexical similarity metrics for domain taxonomy learning
Chou, Y. Hantology-A Linguistic Resource for Chinese Language Processing and Studying
Choukri, K. Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR:Results from the first evaluation campaign

CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

Building Annotated Written and Spoken Arabic LRs in NEMLAR Project

Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology

The BLARK concept and BLARK for Arabic

Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian

Evaluation of multimodal components within CHIL: The evaluation packages and results

Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model

TC-STAR: New language resources for ASR and SLT purposes
Chvatalova, K. Annotation Guidelines for Czech-English Word Alignment
Cieri, C. The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

Integrated Linguistic Resources for Language Exploitation Technologies

Low-cost Customized Speech Corpus Creation for Speech Technology Applications

More Data and Tools for More Languages and Research Areas: A Progress Report on LDC Activities

Corpus Support for Machine Translation at LDC
Cimiano, P. Finding the Appropriate Generalization Level for Binary Ontological Relations Extracted from the Genia Corpus

Ontology-based Information Extraction with SOBA
Cinkova, S. Semi-automatic Building of Swedish Collocation Lexicon
Cinková, S. From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description
Ciobanu, D. Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language
Ciravegna, F. A Methodology and Tool for Representing Language Resources for Information Extraction

An Incremental Tri-Partite Approach To Ontology Learning
Civera, J. Bilingual Machine-Aided Indexing
Claus, A. LAMUS: the Language Archive Management and Upload System
Clavel, C. Fear-type emotions of the SAFE Corpus: annotation issues
Clément, L. The Lefff 2 syntactic lexicon for French: architecture, acquisition, use
Coelho, L. Progmatica: A Prosodic Database for European Portuguese
Cole, A. Integrated Linguistic Resources for Language Exploitation Technologies

Corpus Development and Publication
Colletta, J. Creation of a corpus of multimodal spontaneous expressions of emotions in Human-Machine Interaction
Comelles, E. FreeLing 1.3: Syntactic and semantic services in an open-source NLP library
Compernolle, D. A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora
Copestake, A. Preprocessing and Tokenisation Standards in DELPH-IN Tools
Coppen, P. Data for question answering: The case of why
Corti, L. An Anaphora Resolution-Based Anonymization Module
Costa, L. Component Evaluation in a Question Answering System
Coussé, E. Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus
Couturier, J. Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts
Cowie, R. Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches
Cramer, I. Building an Evaluation Corpus for German Question Answering by Harvesting Wikipedia
Cristea, D. Temporality in relation with discourse structure

Transferring Coreference Chains through Word Alignment
Crocco, C. Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb).
Crysmann, B. Exploring HPSG-based Treebanks for Probabilistic Parsing HPSG grammar extraction
Csirik, J. A highly accurate Named Entity corpus for Hungarian
Cucchiarini, C. JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality

The Dutch-Flemish HLT Programme STEVIN: Essential Speech and Language Technology Resources
Cucurullo, N. Dialectal resources on-line: the ALT-Web experience
Cucurullo, S. Next Generation Language Resources using Grid
Cunningham, H. User-friendly ontology authoring using a controlled language

Creating Tools for Morphological Analysis of Sumerian
Cutugno, F. Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb).

An observatory on Spoken Italian linguistic resources and descriptive standards.
Cvetko-orešnik, V. SI-PRON: A Pronunciation Lexicon for Slovenian
Cyrus, L. Building a resource for studying translation shifts
D  
D'halleweyn, E. The Dutch-Flemish HLT Programme STEVIN: Essential Speech and Language Technology Resources
D'haro, L. Error Analysis of Statistical Machine Translation Output
Dabbadie, M. Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model

CESTA: First Conclusions of the Technolangue MT Evaluation Campaign
Daelemans, W. A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora
Dale, R. Named Entity Extraction with Conjunction Disambiguation
Dalianis, H. Hand-crafted versus Machine-learned Inflectional Rules: The Euroling-SiteSeeker Stemmer and CST's Lemmatiser
Dalli, A. Evaluating Automatically Generated Timelines from the Web
Dannélls, D. Recognizing Acronyms and their Definitions in Swedish Medical Texts
Darwish, K. Building a Heterogeneous Information Retrieval Test Collection of Arabic Document Images
Daumke, P. Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon
Debili, F. Exploiting text for extracting image processing resources
Declerck, T. SynAF: Towards a Standard for Syntactic Annotation

Multilingual Lexical Semantic Resources for Ontology Translation

Generic NLP Tools for Supporting Shallow Ontology Building
Dell’orletta, F. Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment
Deléglise, P. Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions
Deng, X. Towards Unified Chinese Segmentation Algorithm

Evaluation of Stop Word Lists in Chinese Language
Denis, A. A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation

Results of the French Evalda-Media evaluation campaign for literal understanding
Depoorter, G. Functioning of the Centre for Dutch Language and Speech Technology
Derivière, J. The ALVIS Format for Linguistically Annotated Documents
Derouin, M. Lemma-oriented dictionaries, concept-oriented terminology and translation memories
Desipri, E. Adding multi-layer semantics to the Greek Dependency Treebank
Desypri, E. Multi-domain Multi-lingual Named Entity Recognition: Revisiting & Grounding the resources issue
Devillers, L. Annotation of Emotions in Real-Life Video Interviews: Variability between Coders

Fear-type emotions of the SAFE Corpus: annotation issues

Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches

Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews
Dhonnchadha, E. A Part-of-speech tagger for Irish using Finite-State Morphology and Constraint Grammar Disambiguation
Diab, M. Developing and Using a Pilot Dialectal Arabic Treebank
Diamantaras, K. Greek Named Entity Recognition using Support Vector Machines, Maximum Entropy and Onetime
Diaz de Ilarraza, A. Structure, Annotation and Tools in the Basque ZT Corpus
Dini, L. Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano
Dinu, A. On the data base of Romanian syllables and some of its quantitative and cryptographic aspects
Dinu, L. On the data base of Romanian syllables and some of its quantitative and cryptographic aspects
Dipersio, D. Integrated Linguistic Resources for Language Exploitation Technologies
Dobrov, B. Development of Linguistic Ontology on Natural Sciences and Technology
Doddington, G. The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
Dorow, B. Ongoing Developments in Automatically Adapting Lexical Resources to the Biomedical Domain
Dorr, B. SParseval: Evaluation Metrics for Parsing Speech

Parallel Syntactic Annotation of Multiple Languages
Douglas-cowie, E. Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches
Douxchamps, D. Multimedia Database of Meetings and Informal Interactions for Tracking Participant Involvement and Discourse Flow
Dragon, R. Set-up of a Unit-Selection Synthesis with a Prominent Voice
Draxler, C. Speech Recordings in Public Schools in Germany - the Perfect Show Case for Web-based Recordings and Annotation
Dreuw, P. A German Sign Language Corpus of the Domain Weather Report
Drouin, P. Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts
Ducret, J. Metadata Profile in the ISO Data Category Registry

An API for accessing the Data Category Registry
Dutoit, D. Alexandria: A Powerful Multilingual Resource for Web
Dybkjær, H. Act-Topic Patterns for Automatically Checking Dialogue Models
Dybkjær, L. H. C. Andersen Conversation Corpus

Act-Topic Patterns for Automatically Checking Dialogue Models
Dziubalska-kołaczyk, K. General and Task-Specific Corpus Resources for Polish Adult Learners of English
D’alessandro, C. A joint prosody evaluation of French text-to-speech synthesis systems

A joint intelligibility evaluation of French text-to-speech synthesis systems: the EvaSy SUS/ACR campaign
Džeroski, S. Towards a Slovene Dependency Treebank
Díaz, E. Spelling Error Patterns in Spanish for Word Processing Applications
E  
Ehrette, T. Fear-type emotions of the SAFE Corpus: annotation issues
Eisele, A. Parallel Corpora and Phrase-Based Statistical Machine Translation for New Language Pairs via Multiple Intermediaries
El-bèze, M. Question Answering Evaluation Survey
Elisei, F. Does a Virtual Talking Face Generate Proper Multimodal Cues to Draw User’s Attention to Points of Interest?
Elkateb, S. Building a WordNet for Arabic
Elson, D. CLiMB ToolKit: A Case Study of Iterative Evaluation in a Multidisciplinary Project
Emam, O. Building a Heterogeneous Information Retrieval Test Collection of Arabic Document Images
Enea, A. Next Generation Language Resources using Grid
Erjavec, T. Towards a Slovene Dependency Treebank

The English-Slovene ACQUIS corpus

The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages

Building Slovene WordNet
Erk, K. SALTO - A Versatile Multi-Level Annotation Tool

Shalmaneser - A Toolchain For Shallow Semantic Parsing

The SALSA Corpus: a German Corpus Resource for Lexical Semantics
Erp, M. Identifying Named Entities in Text Databases from the Natural History Domain
Espeja, S. New tools for the encoding of lexical data extracted from corpus
Estellés, A. Retrieving Terminological Data from the TxtCeram Tagged Domain Corpus: A First Step towards a Terminological Ontology
Estrela, A. The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon
Estrella, P. A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation
Estève, Y. Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions
Esuli, A. SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining
Etxeberria, J. A methodology for the joint development of the Basque WordNet and Semcor

A Preliminary Study for Building the Basque PropBank
Eugenio, B. Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations
Evans, G. Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali
Ezeiza, N. Structure, Annotation and Tools in the Basque ZT Corpus
F  
Fairon, C. A translated corpus of 30,000 French SMS

A framework for real-time dictionary updating
Fakotakis, N. Dealing with Imbalanced Data using Bayesian Techniques
Farkas, R. A highly accurate Named Entity corpus for Hungarian
Farre, B. SYMBERED - a Symbol-Concept Editing Tool
Farwell, D. Parallel Syntactic Annotation of Multiple Languages
Fayyaz, S. What in the world is a Shahab?: Wide Coverage Named Entity Recognition for Arabic
Febrer, A. Generation of Language Resources for the Development of Speech Technologies in Catalan
Feldman, A. A Cross-language Approach to Rapid Creation of New Morpho-syntactically Annotated Resources
Felföldi, L. A highly accurate Named Entity corpus for Hungarian
Feliu, J. SKELETON: Specialised knowledge retrieval on the basis of terms and conceptual relations
Fellbaum, C. Building a WordNet for Arabic
Fenstad, J. KUNSTI - Knowledge Generation for Norwegian Language Technology
Fernandes, F. A BLARK extension for temporal annotation mining
Fernández-montraveta, A. The Sensem Corpus: a Corpus Annotated at the Syntactic and Semantic Level
Ferreira, E. Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project
Ferret, O. Building a network of topical relations from a corpus
Fersøe, H. Building Annotated Written and Spoken Arabic LRs in NEMLAR Project
Filipe, P. A Framework to Integrate Ubiquitous Knowledge Modeling
Filipowska, A. Linguistic Suite for Polish Cadastral System
Fiscus, J. Multiple Dimension Levenshtein Edit Distance Calculations for Evaluating Automatic Speech Recognition Systems During Simultaneous Speech
Fišer, T. Building Slovene WordNet
Fliedner, G. Towards Natural Interactive Question Answering
Fluart, F. Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation
Fluhr, C. Exploiting text for extracting image processing resources

A Deep Linguistic Analysis for Cross-language Information Retrieval
Forăscu, C. Temporality in relation with discourse structure
Forslund, A. Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation
Forst, M. The importance of precise tokenizing for deep grammars

Improving coverage and parsing quality of a large-scale LFG for German
Fosler-lussier, E. The OSU Quake 2004 corpus of two-party situated problem-solving dialogs
Foster, J. Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives
Francopoulo, G. Lexical Markup Framework (LMF)
Frank, A. SALTO - A Versatile Multi-Level Annotation Tool

The SALSA Corpus: a German Corpus Resource for Lexical Semantics
Frasnelli, E. Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano
Freitas, D. Progmatica: A Prosodic Database for European Portuguese
Friedrich, J. Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian
Frota, S. FreP: An electronic tool for extracting frequency information of phonological units from Portuguese written text
Frère, Y. Greek Named Entity Recognition using Support Vector Machines, Maximum Entropy and Onetime
Fujii, A. Test Collections for Patent Retrieval and Patent Classification in the Fifth NTCIRWorkshop

Statistical Analysis for Thesaurus Construction using an Encyclopedic Corpus
Fujita, A. Augmenting a Semantic Verb Lexicon with a Large Scale Collection of Example Sentences
Fukumoto, J. Automated Summarization Evaluation with Basic Elements.
G  
Gaizauskas, R. Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives

Language Resources for Background Gathering
Galiano, I. Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA
Galliano, S. Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News
Gangemi, A. Conversion of WordNet to a standard RDF/OWL representation
Gantner, Z. Multilingual Lexical Semantic Resources for Ontology Translation
Garbis, A. Dealing with Imbalanced Data using Bayesian Techniques
Garcia, M. A joint prosody evaluation of French text-to-speech synthesis systems

TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis

A joint intelligibility evaluation of French text-to-speech synthesis systems: the EvaSy SUS/ACR campaign

Evaluation of multimodal components within CHIL: The evaluation packages and results
Gavrilidou, M. Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology
Gazendam, L. A Web Based General Thesaurus Browser to Support Indexing of Television and Radio Programs
Gegg-harrison, W. PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit
Geilfuss, M. SAM - an annotation editor for parallel texts
Genabith, J. A Part-of-speech tagger for Irish using Finite-State Morphology and Constraint Grammar Disambiguation
Geoffrois, E. Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News
George, M. Lexical Markup Framework (LMF)
Georgescul, M. TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface
Gerstenberger, C. The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player
Geyken, A. LexikoNet - a lexical database based on type and role hierarchies
Giampicollo, D. The Multilingual Question Answering Track at CLEF
Gibbon, D. A BLARK extension for temporal annotation mining

Discourse functions of duration in Mandarin: resource design and implementation

Feature-based Encoding and Querying Language Resources with Character Semantics
Gillam, L. Sentiments on a Grid: Analysis of Streaming News and Views
Gillard, L. Question Answering Evaluation Survey
Gillis, S. Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus
Giménez, J. Iqmt: A Framework for Automatic Machine Translation Evaluation
Ginés, M. A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker
Giouli, V. Multi-domain Multi-lingual Named Entity Recognition: Revisiting & Grounding the resources issue

Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology
Giovannetti, E. Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases
Giral, A. CLiMB ToolKit: A Case Study of Iterative Evaluation in a Multidisciplinary Project
Girardi, C. I-CAB: the Italian Content Annotation Bank
Githinji, P. A Grapheme-Based Approach for Accent Restoration in Gikuyu
Godfrey, J. The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
Goecke, D. Exploiting logical document structure for anaphora resolution
Goggi, S. Natural Language Processing: A Terminological and Statistical Approach
Gollan, C. TC-STAR: New language resources for ASR and SLT purposes
González, M. FreeLing 1.3: Syntactic and semantic services in an open-source NLP library
Gonçalves, J. The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon
Gotti, F. MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation
Goux, P. An Anaphora Resolution-Based Anonymization Module
Graesser, A. The Look and Feel of a Confident Entailer
Graff, D. Lexicon Development for Varieties of Spoken Colloquial Arabic
Granfeldt, J. CEFLE and Direkt Profil: a New Computer Learner Corpus in French L2 and a System for Grammatical Profiling
Grau, B. FRASQUES: A Question Answering system in the EQueR evaluation campaign

EQueR: the French Evaluation campaign of Question-Answering Systems
Gravier, G. Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News
Green, A. Developing a ContextualizedMultimodal Corpus for Human-Robot Interaction
Green, R. Parallel Syntactic Annotation of Multiple Languages
Grefenstette, G. Exploiting text for extracting image processing resources
Grishina, E. Spoken Russian in the Russian National Corpus (RNC)
Gros, J. SI-PRON: A Pronunciation Lexicon for Slovenian
Grover, C. The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text

Rule-Based Chunking and Reusability
Gruenstein, J. NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora
Grégoire, N. Elaborating the parameterized Equivalence Class Method for Dutch
Grønnum, N. DanPASS - A Danish Phonetically Annotated Spontaneous Speech Corpus
Guevara, E. MORBO/COMP: a multilingual database of compound words
Guirao, J. Building a Parallel Multilingual Corpus (Arabic-Spanish-English)

The wraetlic NLP suite
Gurrutxaga, A. Structure, Annotation and Tools in the Basque ZT Corpus
Guthrie, D. A Closer Look at Skip-gram Modelling
Guthrie, L. A Closer Look at Skip-gram Modelling
Göteborgs, D. Collection, Encoding and Linguistic Processing of a Swedish Medical Corpus - The MEDLEX Experience
H  
Haamid, S. Building Annotated Written and Spoken Arabic LRs in NEMLAR Project
Habash, N. Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

Developing and Using a Pilot Dialectal Arabic Treebank

Parallel Syntactic Annotation of Multiple Languages

Inter-annotator Agreement on a Multilingual Semantic Annotation Task
Haddad, B. Building Annotated Written and Spoken Arabic LRs in NEMLAR Project
Hafezi, N. Building and Incorporating Language Models for Persian Continuous Speech Recognition Systems
Hahn, U. Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon
Hain, H. ECESS Inter-Module Interface Specification for Speech Synthesis

TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis
Hajic, J. Perspectives of Turning Prague Dependency Treebank into a Knowledge Base
Hajicova, E. Corpus Annotation as a Test of a Linguistic Theory
Hale, J. SParseval: Evaluation Metrics for Parsing Speech
Hall, J. Talbanken05: A Swedish Treebank with Phrase Structure and Dependency Annotation

MaltParser: A Data-Driven Parser-Generator for Dependency Parsing
Halpin, H. Automatic Evaluation and Composition of NLP Pipelines with Web Services
Halácsy, P. Morphdb.hu: Hungarian lexical database and morphological grammar

Using a morphological analyzer in high precision POS tagging of Hungarian
Hamabe, R. Dependency-structure Annotation to Corpus of Spontaneous Japanese
Hamme, H. JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality
Hamon, O. Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR:Results from the first evaluation campaign

X-Score: Automatic Evaluation of Machine Translation Grammaticality

CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model
Hamon, T. The ALVIS Format for Linguistically Annotated Documents
Han, S. Evaluation of Stop Word Lists in Chinese Language
Hana, J. A Cross-language Approach to Rapid Creation of New Morpho-syntactically Annotated Resources
Hansen, T. Field Evaluation of a Single-Word Pronunciation Training System
Hanser, S. Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon
Harabagiu, S. An Answer Bank for Temporal Inference

Impact of Question Decomposition on the Quality of Answer Summaries
Harper, M. SParseval: Evaluation Metrics for Parsing Speech

Linguistic Resources for Speech Parsing

An Open Source Prosodic Feature Extraction Tool
Hartley, A. CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

Using collocations from comparable corpora to find translation equivalents
Hartley, T. Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language
Hartrumpf, S. The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon
Hartung, M. Finding the Appropriate Generalization Level for Binary Ontological Relations Extracted from the Genia Corpus
Hasan, S. Creating a Large-Scale Arabic to French Statistical MachineTranslation System
Hashimoto, K. An Annotated Corpus Management Tool: ChaKi
Hasler, L. NPs for Events: Experiments in Coreference Annotation

Computer-aided summarisation -- what the user really wants
Hassel, M. Towards Holistic Summarization -- Selecting Summaries, Not Sentences
Haton, J. Linguistic features modeling based on Partial New Cache
Havasi, C. Towards a Generative Lexical Resource: The Brandeis Semantic Ontology

BULB: A Unified Lexical Browser
Hayashi, Y. A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons
Hayashida, N. Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence
Heid, B. KUNSTI - Knowledge Generation for Norwegian Language Technology
Heid, U. Extraction tools for collocations and their morphosyntactic specificities

Grammar-based tools for the creation of tagging resources for an unresourced language: the case of Northern Sotho
Hein, A. Building a Swedish-Turkish Parallel Corpus
Helbig, H. The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon
Hellan, L. Word Sense Disambiguation and Semantic Disambiguation for Construction Types in Deep Processing Grammars
Helmreich, S. Parallel Syntactic Annotation of Multiple Languages
Henriksen, L. EuroTermBank - a Terminology Resource based on Best Practice

The MULINCO corpus and corpus platform
Hernáez, I. Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque
Herwijnen, O. JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality
Heuvel, H. Development of a phoneme-to-phoneme (p2p) converter to improve the grapheme-to-phoneme (g2p) conversion of names

TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis

A Unified Structure for Dutch Dialect Dictionary Data

TC-STAR: New language resources for ASR and SLT purposes
Heylen, D. Annotating Emotions in Meetings
Hickl, A. What in the world is a Shahab?: Wide Coverage Named Entity Recognition for Arabic

Impact of Question Decomposition on the Quality of Answer Summaries
Higashinaka, R. Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives
Higgins, M. Collaborative Annotation that Lasts Forever: Using Peer-to-Peer Technology for Disseminating Corpora and Language Resources

The Information Commons Gazetteer
Hirano, T. Augmenting a Semantic Verb Lexicon with a Large Scale Collection of Example Sentences
Hiroshi, I. A new approach to syntactic annotation
Hiyakumoto, L. Exploiting Multiple Semantic Resources for Answer Selection
Hjelm, H. Extraction of Cross Language Term Correspondences
Hlaváèová, J. New Approach to Frequency Dictionaries - Czech Example
Hobbs, J. An Annotated Corpus of Typical Durations of Events
Hockey, B. REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
Hodász, G. Evaluation Methods of a Linguistically Enriched Translation Memory System
Hoege, H. Human and machine recognition as a function of SNR
Hoffmann, H. Wizard-of-oz Data Collection for Perception and Interaction in Multi-user Environments
Hofland, K. The ASK Corpus - a Language Learner Corpus of Norwegian as a Second Language
Hoidekr, J. Benefit of a Class-based Language Model for Real-time Closed-captioning of TV Ice-hockey Commentaries

Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech
Horacek, H. A corpus of tutorial dialogs on theorem proving; the influence of the presentation of the study-material
Hoste, V. KNACK-2002: a Richly Annotated Corpus of Dutch Written Text
Hout, R. A Unified Structure for Dutch Dialect Dictionary Data
Hovy, E. Parallel Syntactic Annotation of Multiple Languages

Summarizing Answers for Complicated Questions

Automated Summarization Evaluation with Basic Elements.
Huang, C. Hantology-A Linguistic Resource for Chinese Language Processing and Studying

Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus
Huang, S. The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
Huang, Z. An Open Source Prosodic Feature Extraction Tool
Hughes, B. Reconsidering Language Identification for Written Language Resources

Searching for Language Resources on the Web: User Behaviour in the Open Language Archives Community

Feature-based Encoding and Querying Language Resources with Character Semantics
Huh, K. Morphological annotation of Korean with Directly Maintainable Resources
Human, C. An Answer Bank for Temporal Inference
Höge, H. TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis
Hüttenrauch, H. Developing a ContextualizedMultimodal Corpus for Human-Robot InteractionKeywords: Multimodal corpus, Human-Robot Interaction, Spatiality,Dialogue systems, Gesture
I  
Ibekwe-sanjuan, F. A task-oriented framework for evaluating theme detection systems: A discussion paper
Ide, N. Representing Linguistic Corpora and Their Annotations

Integrating Linguistic Resources: The American National Corpus Model
Ignat, C. Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation

The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages
Iida, R. Augmenting a Semantic Verb Lexicon with a Large Scale Collection of Example Sentences
Ikehara, S. Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy
Imran, S. Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
Imura, M. Multimedia Database of Meetings and Informal Interactions for Tracking Participant Involvement and Discourse Flow
Inagaki, Y. Layered Speech-Act Annotation for Spoken Dialogue Corpus

A Syntactically Annotated Corpus of Japanese Spoken Monologue

A Corpus Search System Utilizing Lexical Dependency Structure
Inkpen, D. Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words
Innselset, K. Automatic Term Extraction from Knowledge Bank of Economics
Inui, K. Augmenting a Semantic Verb Lexicon with a Large Scale Collection of Example Sentences
Inácio, S. Annotating COMPARA, a Grammar-aware Parallel Corpus
Iomdin, B. A Syntactically and Semantically Tagged Corpus of Russian: State of the Art and Prospects
Iomdin, L. A Syntactically and Semantically Tagged Corpus of Russian: State of the Art and Prospects
Ion, R. Dependency-Based Phrase Alignment
Ionescu, E. Romanian Valence Dictionary in XML Format
Ircing, P. Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech
Iria, J. A Methodology and Tool for Representing Language Resources for Information Extraction

An Incremental Tri-Partite Approach To Ontology Learning
Irie, Y. Layered Speech-Act Annotation for Spoken Dialogue Corpus
Irimia, E. RoCo-News: A Hand Validated Journalistic Corpus of Romanian
Isahara, H. Semantic Analysis of Abstract Nouns to Compile a Thesaurus of Adjectives

Dependency-structure Annotation to Corpus of Spontaneous Japanese

Blind Evaluation for Thai Search Engines

Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence

Creation of a Japanese Adverb Dictionary that Includes Information on the Speaker’s Communicative Intention Using Machine Learning

Word Knowledge Acquisition for Computational Lexicon Construction

Detection of inconsistencies in concept classifications in a large dictionary - Toward an improvement of the EDR electronic dictionary

A Conditional Random Field Framework for Thai Morphological Analysis

Getting Deeper Semantics than Berkeley FrameNet with MSFA
Isbihani, A. Creating a Large-Scale Arabic to French Statistical MachineTranslation System
Ishida, T. A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons

Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence
Islam, M. Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words
Itagaki, M. Detecting Inter-domain Semantic Shift using Syntactic Similarity
Itahashi, S. Oriental COCOSDA: Past, Present and Future
Itai, A. A Computational Lexicon of Contemporary Hebrew
Itou, K. Statistical Analysis for Thesaurus Construction using an Encyclopedic Corpus
Ivanovska-naskova, R. Development of the First LRs for Macedonian: Current Projects
Iwahashi, N. Multimedia Database of Meetings and Informal Interactions for Tracking Participant Involvement and Discourse Flow
Iwayama, M. Test Collections for Patent Retrieval and Patent Classification in the Fifth NTCIRWorkshop
Izagirre, E. A methodology for the joint development of the Basque WordNet and Semcor
J  
Jacobson, M. Interoperability of audio corpora : the case of the French corpora
Jakopin, P. SI-PRON: A Pronunciation Lexicon for Slovenian
Jansen, H. The MULINCO corpus and corpus platform
Jekosch, U. “Casselberveetovallarga” and other Unpronounceable Places: The CrossTowns Corpus
Jellinghaus, M. Multilingual parallel treebanking: a lean and flexible approach
Jin, H. Lexicon Development for Varieties of Spoken Colloquial Arabic
Johanson, É. Building a Swedish-Turkish Parallel Corpus
Johansson, C. Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains
Johansson, R. Extraction of Temporal Information from Texts in Swedish

Construction of a FrameNet Labeler for Swedish Text
Johnson, M. SParseval: Evaluation Metrics for Parsing Speech
Jones, J. Towards machine-readable lexicons for South African Bantu languages
Jones, R. Tools and resources for speech synthesis arising from a Welsh TTS project
Jongejan, B. Hand-crafted versus Machine-learned Inflectional Rules: The Euroling-SiteSeeker Stemmer and CST's Lemmatiser
Jouis, C. Hierarchical Relationships "is-a": Distinguishing Belonging, Inclusion and Part/of Relationships.
Juan, A. Bilingual Machine-Aided Indexing
Jung, J. Recurrent Markov Cluster (RMCL) Algorithm for the Refinement of the Semantic Network
Justo, R. Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA
Jänsch, K. Speech Recordings in Public Schools in Germany - the Perfect Show Case for Web-based Recordings and Annotation
Jørgensen, L. The BLARK concept and BLARK for Arabic
K  
Kabadjov, M. An Anaphora Resolution-Based Anonymization Module
Kacic, Z. SINOD - Slovenian non-native speech database
Kacmarcik, G. RefRef: A Tool for Viewing and Exploring Coreference Space
Kageura, K. A Self-Referring Quantitative Evaluation of the ATR Basic Travel Expression Corpus (BTEC)
Kahn, J. SParseval: Evaluation Metrics for Parsing Speech
Kaiser, M. Bikers Accessing the Web: The SmartWeb Motorbike Corpus

SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus
Kaisser, M. The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player
Kaji, H. Automatic Construction of Japanese WordNet
Kakihana, K. On the Web Trilingual Sign Language Dictionary to Learn the foreign Sign Language without Learning a Target Spoken Language
Kamp, P. Technologies for a Federation of Language Resource Archives
Kanamaru, T. Creation of a Japanese Adverb Dictionary that Includes Information on the Speaker’s Communicative Intention Using Machine Learning
Kando, N. Test Collections for Patent Retrieval and Patent Classification in the Fifth NTCIRWorkshop
Kanzaki, K. Semantic Analysis of Abstract Nouns to Compile a Thesaurus of Adjectives

Detection of inconsistencies in concept classifications in a large dictionary - Toward an improvement of the EDR electronic dictionary
Kaplan, R. The importance of precise tokenizing for deep grammars
Karpouzis, K. Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews
Kashioka, H. A Syntactically Annotated Corpus of Japanese Spoken Monologue
Kato, N. A Syntactically Annotated Corpus of Japanese Spoken Monologue
Kato, T. Transcription Cost Reduction for Constructing Acoustic Models Using Acoustic Likelihood Selection Criteria
Kato, Y. A Corpus Search System Utilizing Lexical Dependency Structure
Katz, B. Using Semantic Overlap Scoring in Answering TREC Relationship Questions

US-based Method for Speech Reception Threshold Measurement in French
Kawaguchi, N. Layered Speech-Act Annotation for Spoken Dialogue Corpus
Kawahara, D. Case Frame Compilation from the Web using High-Performance Computing
Kawahara, T. Dependency-structure Annotation to Corpus of Spontaneous Japanese
Kawanami, H. Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech
Kawtrakul, A. Ontology Driven K-Portal Construction and K-Service Provision
Keller, E. ECESS Inter-Module Interface Specification for Speech Synthesis
Kemps-snijders, M. LEXUS, a web-based tool for manipulating lexical resources lexicon

Ontology-based Language Archive Utilization

An API for accessing the Data Category Registry
Kepler, F. A novel Textual Encoding paradigm based on Semantic Web tools and semantics
Kermanidis, K. Dealing with Imbalanced Data using Bayesian Techniques
Kiefer, B. Preprocessing and Tokenisation Standards in DELPH-IN Tools
Kiilerich, S. H. C. Andersen Conversation Corpus

Field Evaluation of a Single-Word Pronunciation Training System
Kikui, G. A Self-Referring Quantitative Evaluation of the ATR Basic Travel Expression Corpus (BTEC)
Kim, J. Linguistic and Biological Annotations of Biological Interaction Events
Kimler, M. Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation
King, M. A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation

Evaluating Symbiotic Systems: the challenge
Kinoshita, J. CoGrOO: a Brazilian-Portuguese Grammar Checker based on the CETENFOLHA Corpus
Kipper, K. Extending VerbNet with Novel Verb Classes
Kiss, I. TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis
Kitamura, T. Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech
Kiyota, Y. A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames
Klakow, D. Building an Evaluation Corpus for German Question Answering by Harvesting Wikipedia
Klassmann, A. Comparison of Resource Discovery Methods

ELAN: a Professional Framework for Multimodality Research
Klatt, S. A Corpus-based Approach to the Interpretation of Unknown Words with an Application to German
Klavans, J. CLiMB ToolKit: A Case Study of Iterative Evaluation in a Multidisciplinary Project
Klein, W. Foundations of Modern Language Resource Archives
Knippen, R. Annotation of Temporal Relations with Tango
Ko, J. Exploiting Multiple Semantic Resources for Answer Selection

Analyzing the Effects of Spoken Dialog Systems on Driving Behavior
Kocsor, A. A highly accurate Named Entity corpus for Hungarian
Kogure, K. Features of Terms in Actual Nursing Activities
Kokkinakis, D. Recognizing Acronyms and their Definitions in Swedish Medical Texts
Kongkachandra, R. Semantic-Based Keyword Recovery Function for Keyword Extraction System
Konings, N. Development of a phoneme-to-phoneme (p2p) converter to improve the grapheme-to-phoneme (g2p) conversion of names
Konstandinidis, A. Multi-domain Multi-lingual Named Entity Recognition: Revisiting & Grounding the resources issue
Kordoni, V. Automated Deep Lexical Acquisition for Robust Open Texts Processing
Korhonen, A. A Large Subcategorization Lexicon for Natural Language Processing Applications

Extending VerbNet with Novel Verb Classes
Kornai, A. Using a morphological analyzer in high precision POS tagging of Hungarian
Kotzé, P. Finite state tokenisation of an orthographical disjunctive agglutinative language: The verbal segment of Northern Sotho
Koutsombogera, M. Adding multi-layer semantics to the Greek Dependency Treebank
Kouylekov, M. Building a Large-Scale Repository of Textual Entailment Rules
Kowalski, A. SALTO - A Versatile Multi-Level Annotation Tool

The SALSA Corpus: a German Corpus Resource for Lexical Semantics
Kraif, O. Evaluation of multilingual text alignment systems: the ARCADE II project
Krasnyanskaya, A. SParseval: Evaluation Metrics for Parsing Speech
Krauwer, S. Building Annotated Written and Spoken Arabic LRs in NEMLAR Project

The BLARK concept and BLARK for Arabic
Kremer, G. Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology
Kristiansen, M. Automatic Term Extraction from Knowledge Bank of Economics
Krstev, C. WS4LR: A Workstation for Lexical Resources
Kruengkrai, C. Word Knowledge Acquisition for Computational Lexicon Construction

A Conditional Random Field Framework for Thai Morphological Analysis
Kruijff-korbayova, I. A corpus of tutorial dialogs on theorem proving; the influence of the presentation of the study-material

The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player
Kruijff-korbayová, I. Annotation Guidelines for Czech-English Word Alignment
Kruijsen, J. A Unified Structure for Dutch Dialect Dictionary Data
Kruschwitz, U. An Anaphora Resolution-Based Anonymization Module
Krymolowski, Y. A Large Subcategorization Lexicon for Natural Language Processing Applications
Krynicki, G. General and Task-Specific Corpus Resources for Polish Adult Learners of English
Ku, L. Tagging Heterogeneous Evaluation Corpora for Opinionated Tasks
Kugler, U. Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano
Kuhn, A. Results of the French Evalda-Media evaluation campaign for literal understanding
Kuhn, J. Multilingual parallel treebanking: a lean and flexible approach
Kulick, S. Linguistic Resources for Speech Parsing
Kuroda, K. Getting Deeper Semantics than Berkeley FrameNet with MSFA
Kurohashi, S. Case Frame Compilation from the Web using High-Performance Computing
Kuwahara, N. Features of Terms in Actual Nursing Activities
Kvale, K. KUNSTI - Knowledge Generation for Norwegian Language Technology
Kwong, O. Toward a Pan-Chinese Thesaurus
L  
Labropoulou, P. Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology
Lacatusu, F. Impact of Question Decomposition on the Quality of Answer Summaries
Lafourcade, M. Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence
Lai, T. Court Stenography-To-Text (“STT”) in Hong Kong: A Jurilinguistic Engineering Effort
Laib, M. A Deep Linguistic Analysis for Cross-language Information Retrieval
Lamel, L. Experimental detection of vowel pronunciation variants in Amharic
Langkilde-geary, I. A Factored Functional Dependency Transformation of the English Penn Treebank for Probabilistic Surface Generation
Langlais, P. MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation
Laoudi, J. Task-based MT Evaluation: From Who/When/Where Extraction to Event Understanding
Laporte, E. Morphological annotation of Korean with Directly Maintainable Resources
Laprun, C. Multiple Dimension Levenshtein Edit Distance Calculations for Evaluating Automatic Speech Recognition Systems During Simultaneous Speech
Laskowski, K. Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus
Laurent, D. Evaluation of multilingual text alignment systems: the ARCADE II project
Lavecchia, C. Linguistic features modeling based on Partial New Cache
Le, A. Edit Distance: A Metric for Machine Translation Evaluation
Le, P. A Lexicalized Tree-Adjoining Grammar for Vietnamese
Lease, M. SParseval: Evaluation Metrics for Parsing Speech

Linguistic Resources for Speech Parsing
Lechenadec, G. Creation of a corpus of multimodal spontaneous expressions of emotions in Human-Machine Interaction
Ledinek, N. Towards a Slovene Dependency Treebank
Lee, H. A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit

Linguistic Resources for Speech Parsing
Lefevre, F. Results of the French Evalda-Media evaluation campaign for literal understanding
Lehireche, A. Ongoing Developments in Automatically Adapting Lexical Resources to the Biomedical Domain
Lehmann, J. What in the world is a Shahab?: Wide Coverage Named Entity Recognition for Arabic
Leidner, J. Building an Evaluation Corpus for German Question Answering by Harvesting Wikipedia
Lenci, A. Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases

Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment
Lenzi, V. I-CAB: the Italian Content Annotation Bank
Lepetit, X. The MULINCO corpus and corpus platform
Lesmo, L. From Natural Language to Databases via Ontologies

A Development Tool For Multilingual Ontology-based Conceptual
Lesourd, H. A corpus of tutorial dialogs on theorem proving; the influence of the presentation of the study-material
Leturia, I. Structure, Annotation and Tools in the Basque ZT Corpus
Levin, L. Parallel Syntactic Annotation of Multiple Languages
Levinson, S. Foundations of Modern Language Resource Archives
Levy, R. Tregex and Tsurgeon: tools for querying and manipulating tree data structures
Li, S. Interaction between Lexical Base and Ontology with Formal Concept Analysis

The Design and Construction of A Chinese Collocation Bank
Li, Wei. Mining Implicit Entities in Queries

Mining Implicit Entities in Queries

Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach
Li, Wen. Interaction between Lexical Base and Ontology with Formal Concept Analysis

Mining Implicit Entities in Queries
Liang, Y. Tagging Heterogeneous Evaluation Corpora for Opinionated Tasks
Liberman, M. The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

Integrated Linguistic Resources for Language Exploitation Technologies

More Data and Tools for More Languages and Research Areas: A Progress Report on LDC Activities
Ligozat, A. FRASQUES: A Question Answering system in the EQueR evaluation campaign
Lin, C. Summarizing Answers for Complicated Questions

Automated Summarization Evaluation with Basic Elements.
Lin, F. Modular Approach to Error Analysis and Evaluation for Multilingual Question Answering
Lin, M. Constructing a Named Entity Ontology from Web Corpora
Lisowska, A. ROTE: A Tool to Support Users in Defining the Relative Importance of Quality Characteristics

The Evolution of an Evaluation Framework for a Text Mining System

Extending the Wizard of Oz Methodologie for Multimodal Language-enabled Systems
Littman, J. Towards a Generative Lexical Resource: The Brandeis Semantic Ontology
Liu, W. A Closer Look at Skip-gram Modelling
Liu, Y. SParseval: Evaluation Metrics for Parsing Speech

Linguistic Resources for Speech Parsing
Lleida, E. Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA
Lombardo, V. Comparing linguistic information in treebank annotations
Lopatkova, M. Valency Lexicon of Czech Verbs: Alternation-Based Model
López de Letona, I. Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA
Loukachevitch, N. Development of Linguistic Ontology on Natural Sciences and Technology
Lowery, K. A Hebrew Tree Bank Based on Cantillation Marks
Lu, Q. Interaction between Lexical Base and Ontology with Formal Concept Analysis

Mining Implicit Entities in Queries

The Design and Construction of A Chinese Collocation Bank
Luca, E. Rebuilding Lexical Resources for Information Retrieval using Sense Folder Detection and Merging Methods
Lucas, P. Collaborative Annotation that Lasts Forever: Using Peer-to-Peer Technology for Disseminating Corpora and Language Resources

The Information Commons Gazetteer
Lundälv, M. SYMBERED - a Symbol-Concept Editing Tool
Luning, J. A Study on Terminology Extraction Based on Classified Corpora
Luz, S. Gathering a corpus of multimodal computer-mediated meetings
L’homme, M. A Methodology for Developing Multilingual Resources for Terminology
Lønning, J. Discriminant-Based MRS Banking
M  
Ma, Q. Semantic Analysis of Abstract Nouns to Compile a Thesaurus of Adjectives
Ma, W. Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus
Ma, X. Integrated Linguistic Resources for Language Exploitation Technologies

Champollion: A Robust Parallel Text Sentence Aligner

Corpus Support for Machine Translation at LDC
Maamouri, M. Integrated Linguistic Resources for Language Exploitation Technologies

Developing and Using a Pilot Dialectal Arabic Treebank

Lexicon Development for Varieties of Spoken Colloquial Arabic
Maas, J. BITT: A Corpus for Topic Tracking Evaluation on Multimodal Human-Robot-Interaction
Maccartney, B. Generating Typed Dependency Parses from Phrase Structure Parses
Mackinlay, A. Reconsidering Language Identification for Written Language Resources
Macklovitch, E. TransType2 : The Last Word
Madsen, T. Field Evaluation of a Single-Word Pronunciation Training System
Maeda, K. Integrated Linguistic Resources for Language Exploitation Technologies

A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit

Linguistic Resources for Speech Parsing

Low-cost Customized Speech Corpus Creation for Speech Technology Applications

An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks
Maegaard, B. Building Annotated Written and Spoken Arabic LRs in NEMLAR Project

KUNSTI - Knowledge Generation for Norwegian Language Technology

The BLARK concept and BLARK for Arabic

The MULINCO corpus and corpus platform
Maffiolo, V. Creation of a corpus of multimodal spontaneous expressions of emotions in Human-Machine Interaction
Magdy, W. Building a Heterogeneous Information Retrieval Test Collection of Arabic Document Images
Magnini, B. I-CAB: the Italian Content Annotation Bank

The Multilingual Question Answering Track at CLEF

Building a Large-Scale Repository of Textual Entailment Rules
Mah, C. Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
Maia, B. Corpógrafo V3 - From Terminological Aid to Semi-automatic Knowledge Engineering
Mairesse, F. Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives
Maks, I. Exploring opportunities for Comparability and Enrichment by Linking lexical databases
Malaisé, V. A Web Based General Thesaurus Browser to Support Indexing of Television and Radio Programs
Maltese, G. Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian
Mamede, N. A Framework to Integrate Ubiquitous Knowledge Modeling
Mammini, M. Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian
Mangeot, M. Dictionary Building with the Jibiki Platform: the GDEF case
Mani, I. Annotation of Temporal Relations with Tango
Manning, C. Generating Typed Dependency Parses from Phrase Structure Parses
Manurung, R. Building a Lexical Database for an Interactive Joke-Generator
Manzano-macho, D. Multilingual Lexical Semantic Resources for Ontology Translation
Maragoudakis, M. Dealing with Imbalanced Data using Bayesian Techniques
Marchetti, A. Moving to dynamic computational lexicons with LeXFlow
Marchi, S. Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases
Marco, F. Integrating Methods and LRs for Automatic Keyword Extraction from Open Domain Texts
Mareüil, P. A joint prosody evaluation of French text-to-speech synthesis systems

A joint intelligibility evaluation of French text-to-speech synthesis systems: the EvaSy SUS/ACR campaign
Mariani, J. Techno-langue: The French National Initiative for Human Language Technologies (HLT)
Marimon, M. New tools for the encoding of lexical data extracted from corpus
Marinelli, R. Using Core Ontology for Domain Lexicon Structuring

Proper Names and Linguistic Dynamics
Markantonatou, S. METIS-II: Machine Translation for Low Resource Languages
Markó, K. Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon

Language Specific and Topic Focused Web Crawling
Markus, A. SINOD - Slovenian non-native speech database
Marneffe, M. Generating Typed Dependency Parses from Phrase Structure Parses
Martens, J. Development of a phoneme-to-phoneme (p2p) converter to improve the grapheme-to-phoneme (g2p) conversion of names
Martin, A. The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
Martin, E. Sign Language corpus analysis: Synchronisation of linguistic annotation and numerical data
Martin, F. FreP: An electronic tool for extracting frequency information of phonological units from Portuguese written text
Martin, J. Annotation of Emotions in Real-Life Video Interviews: Variability between Coders

Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches

Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews
Martin, M. A Development Tool For Multilingual Ontology-based Conceptual
Martinez, C. Bilingual speech corpus in two phonetically similar languages
Marton, G. Using Semantic Overlap Scoring in Answering TREC Relationship Questions
Maruyama, T. Dependency-structure Annotation to Corpus of Spontaneous Japanese
Masaki, N. A new approach to syntactic annotation
Masoodian, M. Gathering a corpus of multimodal computer-mediated meetings
Massimo, M. Integrating Methods and LRs for Automatic Keyword Extraction from Open Domain Texts
Matsubara, S. Layered Speech-Act Annotation for Spoken Dialogue Corpus

A Syntactically Annotated Corpus of Japanese Spoken Monologue

A Corpus Search System Utilizing Lexical Dependency Structure

Collection of Simultaneous Interpreting Patterns by Using Bilingual Spoken Monologue Corpus
Matsumoto, Y. An Annotated Corpus Management Tool: ChaKi

Augmenting a Semantic Verb Lexicon with a Large Scale Collection of Example Sentences
Matusov, E. Training a Statistical Machine Translation System without GIZA++
Mauclair, J. Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions
Mauser, A. Training a Statistical Machine Translation System without GIZA++
Maynard, D. Creating Tools for Morphological Analysis of Sumerian
Mazur, P. Named Entity Extraction with Conjunction Disambiguation
Mazze, A. A Development Tool For Multilingual Ontology-based Conceptual
Mazzoli, T. Towards an Ontology for Art and Colours
Mcrorie, M. Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches
Medelyan, O. Language Specific and Topic Focused Web Crawling
Medero, J. A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit

An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks
Medlock, B. An Introduction to NLP-based Textual Anonymisation
Megyesi, B. Building a Swedish-Turkish Parallel Corpus
Melichar, M. Extending the Wizard of Oz Methodologie for Multimodal Language-enabled Systems
Melloni, C. MORBO/COMP: a multilingual database of compound words
Melz, R. Compiling large language resources using lexical similarity metrics for domain taxonomy learning
Mendes, A. Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project

COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions
Mendizabal, K. A methodology for the joint development of the Basque WordNet and Semcor
Menezes, C. CoGrOO: a Brazilian-Portuguese Grammar Checker based on the CETENFOLHA Corpus
Merino, D. MEDUSA: User-Centred Design and usability evaluation of Automatic Speech Recognition telephone services in Telefónica Móviles España

Usability evaluation of 3G multimodal services in Telefónica Móviles España
Meur, A. Lemma-oriented dictionaries, concept-oriented terminology and translation memories
Meurer, P. The ASK Corpus - a Language Learner Corpus of Norwegian as a Second Language
Michailidis, I. Greek Named Entity Recognition using Support Vector Machines, Maximum Entropy and Onetime
Mieskes, M. Part-of-Speech Tagging of Transcribed Speech
Miguel, A. Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA
Mihelic, A. SI-PRON: A Pronunciation Lexicon for Slovenian
Miháltz, M. Exploiting Parallel Corpora for Supervised Word-Sense Disambiguation in English-Hungarian Machine Translation
Milde, J. The Eclipse Annotator: an extensible system for multimodal corpus creation

SAM - an annotation editor for parallel texts
Miller, K. Parallel Syntactic Annotation of Multiple Languages

Formal v. Informal: Register-Differentiated Arabic MT Evaluation in the PLATO Paradigm
Minker, W. Wizard-of-oz Data Collection for Perception and Interaction in Multi-user Environments

Stochastic Spoken Natural Language Parsing in the Framework of the French MEDIA Evaluation Campaign
Mitamura, T. Parallel Syntactic Annotation of Multiple Languages

Modular Approach to Error Analysis and Evaluation for Multilingual Question Answering

Analyzing the Effects of Spoken Dialog Systems on Driving Behavior
Mititelu, V. Romanian Valence Dictionary in XML Format
Mitkov, R. If “it” were “then”, then when was “it”? Establishing the anaphoric role of “then”
Miyake, M. Recurrent Markov Cluster (RMCL) Algorithm for the Refinement of the Semantic Network
Mohamed, Z. Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali
Monachesi, P. A pilot study for a Corpus of Dutch Aphasic Speech (CoDAS)
Monachini, M. Moving to dynamic computational lexicons with LeXFlow

Lexical Markup Framework (LMF)

Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology

Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian

Next Generation Language Resources using Grid
Monceau, L. FRASQUES: A Question Answering system in the EQueR evaluation campaign
Montemagni, S. Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment

Dialectal resources on-line: the ALT-Web experience
Morărescu, P. Principles for annotating and reasoning with spatial information
Morbidoni, C. A novel Textual Encoding paradigm based on Semantic Web tools and semantics
Morel, M. A joint prosody evaluation of French text-to-speech synthesis systems

A joint intelligibility evaluation of French text-to-speech synthesis systems: the EvaSy SUS/ACR campaign
Moreno, A. TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis

Generation of Language Resources for the Development of Speech Technologies in Catalan

Spanish Synthesis Corpora

TC-STAR: New language resources for ASR and SLT purposes
Moreno-sandoval, A. The wraetlic NLP suite
Moriceau, V. Language Challenges for Data Fusion in Question-Answering
Morita, T. An Annotated Corpus Management Tool: ChaKi
Moschitti, A. A Tree Kernel approach to Question and Answer Classification in Question Answering Systems
Mostefa, D. Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR:Results from the first evaluation campaign

Results of the French Evalda-Media evaluation campaign for literal understanding

Evaluation of multimodal components within CHIL: The evaluation packages and results

TC-STAR: New language resources for ASR and SLT purposes
Mouradi, A. Building Annotated Written and Spoken Arabic LRs in NEMLAR Project
Movassagh, H. Building and Incorporating Language Models for Persian Continuous Speech Recognition Systems
Mukbel, C. Building Annotated Written and Spoken Arabic LRs in NEMLAR Project
Mulkar, R. An Annotated Corpus of Typical Durations of Events
Mulloni, A. Automatic Detection of Orthographic Cues for Cognate Recognition
Murase, F. Analyzing the Effects of Spoken Dialog Systems on Driving Behavior
Murata, M. Creation of a Japanese Adverb Dictionary that Includes Information on the Speaker’s Communicative Intention Using Machine Learning
Musacchio, M. Ontological and Terminological Commitments and the Discourse of Specialist Communities
Mustafa el Hadi, W. Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model

CESTA: First Conclusions of the Technolangue MT Evaluation Campaign
Márquez, L. Generation of Language Resources for the Development of Speech Technologies in Catalan
Mögele, H. Bikers Accessing the Web: The SmartWeb Motorbike Corpus

SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus
Möller, S. Set-up of a Unit-Selection Synthesis with a Prominent Voice
Mühlenbock, K. SYMBERED - a Symbol-Concept Editing Tool

KUNSTI - Knowledge Generation for Norwegian Language Technology
N  
Nagi, M. Building a Heterogeneous Information Retrieval Test Collection of Arabic Document Images
Nakagawa, H. A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames
Nakamura, S. Oriental COCOSDA: Past, Present and Future
Nakasone, H. The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research
Nam, J. Morphological annotation of Korean with Directly Maintainable Resources
Nascimento, M. Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project

COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions

The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon
Nathan, D. Technologies for a Federation of Language Resource Archives
Naumann, K. NPs for Events: Experiments in Coreference Annotation
Navarretta, C. The MULINCO corpus and corpus platform
Navas, E. Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque
Navigli, R. Reducing the Granularity of a Computational Lexicon via an Automatic Mapping to a Coarse-Grained Sense Inventory
Nazarenko, A. The ALVIS Format for Linguistically Annotated Documents
Nederhof, M. LEXUS, a web-based tool for manipulating lexical resources lexicon, standards, tool
Negri, M. I-CAB: the Italian Content Annotation Bank
Nemec, P. Tree Searching/Rewriting Formalism
Nenadic, G. Towards a terminological resource for biomedical text mining
Neumann, G. Exploring HPSG-based Treebanks for Probabilistic Parsing HPSG grammar extraction
Neumann, H. Wizard-of-oz Data Collection for Perception and Interaction in Multi-user Environments
Neuvel, S. Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts
Ney, H. POS-based Word Reorderings for Statistical Machine Translation

Error Analysis of Statistical Machine Translation Output

Training a Statistical Machine Translation System without GIZA++

A German Sign Language Corpus of the Domain Weather Report

Creating a Large-Scale Arabic to French Statistical MachineTranslation System
Nezda, L. What in the world is a Shahab?: Wide Coverage Named Entity Recognition for Arabic
Nguyen, T. Evaluation of multilingual text alignment systems: the ARCADE II project

A Lexicalized Tree-Adjoining Grammar for Vietnamese
Nicholson, J. Reconsidering Language Identification for Written Language Resources
Niekrasz, J. NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora
Nilsson, J. Talbanken05: A Swedish Treebank with Phrase Structure and Dependency Annotation

MaltParser: A Data-Driven Parser-Generator for Dependency Parsing
Nimaan, A. Towards automatic transcription of Somali language
Nimb, S. LEXADV - a multilingual semantic Lexicon for Adverbs
Nissim, M. The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text
Nitta, Y. Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy
Nivre, J. Talbanken05: A Swedish Treebank with Phrase Structure and Dependency Annotation

MaltParser: A Data-Driven Parser-Generator for Dependency Parsing
Nocera, P. Towards automatic transcription of Somali language
Nohama, P. Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon
Noord, G. Syntactic Annotation of Large Corpora in STEVIN
Noriko, S. Multimedia Database of Meetings and Informal Interactions for Tracking Participant Involvement and Discourse Flow
Novak, V. Perspectives of Turning Prague Dependency Treebank into a Knowledge Base
Nováček, V. Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator
Novák, A. Morphological Tools for Six Small Uralic Languages
Nugues, P. Extraction of Temporal Information from Texts in Swedish

CEFLE and Direkt Profil: a New Computer Learner Corpus in French L2 and a System for Grammatical Profiling

Construction of a FrameNet Labeler for Swedish Text
Nunes, F. Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project
Nyberg, E. Evaluation for Scenario Question Answering Systems

Exploiting Multiple Semantic Resources for Answer Selection

Analyzing the Effects of Spoken Dialog Systems on Driving Behavior
Nøklestad, A. Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains
Nürnberger, A. Rebuilding Lexical Resources for Information Retrieval using Sense Folder Detection and Merging Methods
O  
Obrębski, T. Syntactic Lexicon of Polish Predicative Nouns
Obradović, I. WS4LR: A Workstation for Lexical Resources
Obrebski, T. UAM Text Tools - a flexible NLP architecture
Odijk, J. The Dutch-Flemish HLT Programme STEVIN: Essential Speech and Language Technology Resources

Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian
Oellinger, T. Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation
Oepen, S. Discriminant-Based MRS Banking
Offenga, F. Comparison of Resource Discovery Methods

LAMUS: the Language Archive Management and Upload System

Metadata Profile in the ISO Data Category Registry

Technologies for a Federation of Language Resource Archives
Offersgaard, L. The MULINCO corpus and corpus platform
Ohishi, Y. Statistical Analysis for Thesaurus Construction using an Encyclopedic Corpus
Ohno, T. A Syntactically Annotated Corpus of Japanese Spoken Monologue
Ohta, T. Linguistic and Biological Annotations of Biological Interaction Events
Ohtani, A. An Annotated Corpus Management Tool: ChaKi
Okazaki, N. Clustering acronyms in biomedical text for disambiguation

Towards a terminological resource for biomedical text mining
Okumura, M. Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing
Oliveira, S. The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon
Oliver, D. Creation and analysis of a Polish speech database for use in unit selection synthesis
Oliver, T. Visual Surveillance and Video Annotation and Description
Oltramari, A. LexiPass methodology: a conceptual path from frames to senses and back
Onelli, C. The DiaCORIS project: a diachronic corpus of written Italian
Oostdijk, N. User requirements analysis for the design of a reference corpus of written Dutch

Data for question answering: The case of why
Orasan, C. NPs for Events: Experiments in Coreference Annotation

Computer-aided summarisation -- what the user really wants

Transferring Coreference Chains through Word Alignment
Oravecz, C. Using a morphological analyzer in high precision POS tagging of Hungarian
Ordelman, R. Annotating Emotions in Meetings
Orsolini, P. Natural Language Processing: A Terminological and Statistical Approach
Osenova, P. The Multilingual Question Answering Track at CLEF

Shallow Semantic Annotation of Bulgarian
Osswald, R. The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon
Ostendorf, M. SParseval: Evaluation Metrics for Parsing Speech
Øvsthus, K. Automatic Term Extraction from Knowledge Bank of Economics
Ozaku, H. Features of Terms in Actual Nursing Activities
O’mara, D. Building a Lexical Database for an Interactive Joke-Generator
P  
Pacheco, E. Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon
Pado, S. SALTO - A Versatile Multi-Level Annotation Tool

Shalmaneser - A Toolchain For Shallow Semantic Parsing

The SALSA Corpus: a German Corpus Resource for Lexical Semantics
Padró, L. FreeLing 1.3: Syntactic and semantic services in an open-source NLP library
Padró, M. FreeLing 1.3: Syntactic and semantic services in an open-source NLP library
Paetzold, J. Language Specific and Topic Focused Web Crawling
Paggio, P. Annotating Information Structure in a Corpus of Spoken Danish
Pain, H. Building a Lexical Database for an Interactive Joke-Generator
Pajas, P. Towards a Slovene Dependency Treebank
Palm, G. Wizard-of-oz Data Collection for Perception and Interaction in Multi-user Environments
Palmer, M. Extending VerbNet with Novel Verb Classes
Palumbo, G. Ontological and Terminological Commitments and the Discourse of Specialist Communities
Pan, F. An Annotated Corpus of Typical Durations of Events
Paoli, M. Dialectal resources on-line: the ALT-Web experience
Papageorgiou, H. Multi-domain Multi-lingual Named Entity Recognition: Revisiting & Grounding the resources issue

Adding multi-layer semantics to the Greek Dependency Treebank
Pardelli, G. Natural Language Processing: A Terminological and Statistical Approach
Paroubek, P. Data, Annotations and Measures in EASY the Evaluation Campaign for Parsers of French.
Passonneau, R. Measuring Agreement on Set-valued Items (MASI) for Semantic and Pragmatic Annotation

Inter-annotator Agreement on a Multilingual Semantic Annotation Task

CLiMB ToolKit: A Case Study of Iterative Evaluation in a Multidisciplinary Project
Pastra, K. Beyond Multimedia Integration: corpora and annotations for cross-media decision mechanisms
Patel, A. Lexical similarity can distinguish between automatic and manual translations
Patry, A. MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation
Paulsson, N. Building Annotated Written and Spoken Arabic LRs in NEMLAR Project
Paumier, S. A translated corpus of 30,000 French SMS

A framework for real-time dictionary updating
Pauw, G. KNACK-2002: a Richly Annotated Corpus of Dutch Written Text

A Grapheme-Based Approach for Accent Restoration in Gikuyu

A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora
Pazienza, M. Mixing WordNet, VerbNet and PropBank for studying verb relations
Pease, A. Building a WordNet for Arabic
Pechsiri, C. Ontology Driven K-Portal Construction and K-Service Provision
Pecina, P. Semi-automatic Building of Swedish Collocation Lexicon
Pedersen, B. Query Expansion on Compounds
Pekar, V. Automatic Detection of Orthographic Cues for Cognate Recognition
Pellegrini, T. Experimental detection of vowel pronunciation variants in Amharic
Pennacchiotti, M. Mixing WordNet, VerbNet and PropBank for studying verb relations
Pereira, L. COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions

The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon
Pereira, A. The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon
Permpool, T. Ontology Driven K-Portal Construction and K-Service Provision
Persson, E. CEFLE and Direkt Profil: a New Computer Learner Corpus in French L2 and a System for Grammatical Profiling
Pet, M. Lexical Markup Framework (LMF)
Petel, S. The Ritel Corpus - An annotated Human-Machine open-domain question answering spoken dialog corpus
Peters, C. The Impact of Evaluation on Multilingual Information Retrieval System Development
Peters, W. Creating Tools for Morphological Analysis of Sumerian

The LOIS Project
Petersen, U. Querying Both Parallel And Treebank Corpora: Evaluation Of A Corpus Query System
Petit-renaud, S. Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions
Peñas, A. The Multilingual Question Answering Track at CLEF
Pianta, E. I-CAB: the Italian Content Annotation Bank
Piazza, F. A novel Textual Encoding paradigm based on Semantic Web tools and semantics
Picchi, E. Dialectal resources on-line: the ALT-Web experience

Next Generation Language Resources using Grid
Pietrala, D. General and Task-Specific Corpus Resources for Polish Adult Learners of English
Pinto, A. Corpógrafo V3 - From Terminological Aid to Semi-automatic Knowledge Engineering
Piperidis, S. Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology
Pirrelli, V. Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases

Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment
Piskorski, J. Linguistic Suite for Polish Cadastral System
Pistol, I. Temporality in relation with discourse structure
Pitel, G. A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation
Plank, B. Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano
Pociello, E. A methodology for the joint development of the Basque WordNet and Semcor

A Preliminary Study for Building the Basque PropBank
Podvesky, P. Semi-automatic Building of Swedish Collocation Lexicon
Poesio, M. An Anaphora Resolution-Based Anonymization Module
Pohl, G. Exploiting Parallel Corpora for Supervised Word-Sense Disambiguation in English-Hungarian Machine Translation
Poietti, D. The DiaCORIS project: a diachronic corpus of written Italian
Polajnar, T. User-friendly ontology authoring using a controlled language
Polifroni, J. Learning Database Content for Spoken Dialogue System Design
Polin, Z. Structure, Annotation and Tools in the Basque ZT Corpus
Poller, P. The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player
Pollák, P. Methodology of Lombard Speech Database Acquisition: Experiences with CLSD
Pomikálek, J. Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator
Popescu-belis, A. CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface

A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation
Popovic, M. POS-based Word Reorderings for Statistical Machine Translation
Poprat, M. Language Specific and Topic Focused Web Crawling
Porcelijn, T. Identifying Named Entities in Text Databases from the Natural History Domain
Postolache, O. Annotation Guidelines for Czech-English Word Alignment

Transferring Coreference Chains through Word Alignment
Pouli, K. Adding multi-layer semantics to the Greek Dependency Treebank
Pouliquen, B. Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation

The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages
Povlsen, C. EuroTermBank - a Terminology Resource based on Best Practice

The MULINCO corpus and corpus platform
Pražák, A. Benefit of a Class-based Language Model for Real-time Closed-captioning of TV Ice-hockey Commentaries
Pretorius, L. Towards machine-readable lexicons for South African Bantu languages
Prinsloo, D. Grammar-based tools for the creation of tagging resources for an unresourced language: the case of Northern Sotho
Prodanof, I. Annotating Bridging Anaphors in Italian: in Search of Reliability
Prokopidis, P. Adding multi-layer semantics to the Greek Dependency Treebank
Prokopova, M. Czech-English Word Alignment
Protopapas, A. All Greek to me! An automatic Greeklish to Greek transliteration system
Przybocki, M. The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

Edit Distance: A Metric for Machine Translation Evaluation
Psutka, J. Benefit of a Class-based Language Model for Real-time Closed-captioning of TV Ice-hockey Commentaries

Benefit of a Class-based Language Model for Real-time Closed-captioning of TV Ice-hockey Commentaries

Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech
Puliti, P. A novel Textual Encoding paradigm based on Semantic Web tools and semantics
Puscasu, G. If “it” were “then”, then when was “it”? Establishing the anaphoric role of “then”
Pustejovsk, J. SlinkET: A Partial Modal Parser for Events
Pustejovsky, J. Inducing Sense-Discriminating Context Patterns from Sense-Tagged Corpora

Towards a Generative Lexical Resource: The Brandeis Semantic Ontology

Annotation of Temporal Relations with Tango

BULB: A Unified Lexical Browser
Pérez, A. Multilingual Lexical Semantic Resources for Ontology Translation
Pérez, J. ECESS Inter-Module Interface Specification for Speech Synthesis

Acceptance Testing of a Spoken Language Translation System

GAIA: Common Framework for the Development of Speech Translation Technologies
Pîrvan, F. Tagset Mapping and Statistical Training Data Cleaning-up
Q  
Qin, L. A Study on Terminology Extraction Based on Classified Corpora
Quasthoff, U. Corpus Portal for Search in Monolingual Corpora
Quignard, M. A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation

Results of the French Evalda-Media evaluation campaign for literal understanding
Quintian, M. A methodology for the joint development of the Basque WordNet and Semcor
R  
Raake, A. US-based Method for Speech Reception Threshold Measurement in French

A joint intelligibility evaluation of French text-to-speech synthesis systems: the EvaSy SUS/ACR campaign
Racioppa, S. Ontology-based Information Extraction with SOBA
Radde, N. Multiple Dimension Levenshtein Edit Distance Calculations for Evaluating Automatic Speech Recognition Systems During Simultaneous Speech
Radev, D. Lexical similarity can distinguish between automatic and manual translations
Ragheb, A. Building Annotated Written and Spoken Arabic LRs in NEMLAR Project
Raidt, S. Does a Virtual Talking Face Generate Proper Multimodal Cues to Draw User’s Attention to Points of Interest?
Rajman, M. X-Score: Automatic Evaluation of Machine Translation Grammaticality

CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

Extending the Wizard of Oz Methodologie for Multimodal Language-enabled Systems
Rambow, O. Developing and Using a Pilot Dialectal Arabic Treebank

Parallel Syntactic Annotation of Multiple Languages

Inter-annotator Agreement on a Multilingual Semantic Annotation Task
Ramos, M. Local Document Relevance Clustering in IR Using Collocation Information
Rapp R. Example-Based Machine Translation Using a Dictionary of Word Pairs
Rapti, S. All Greek to me! An automatic Greeklish to Greek transliteration system
Rashwan, M. Building Annotated Written and Spoken Arabic LRs in NEMLAR Project
Ratsch, E. Finding the Appropriate Generalization Level for Binary Ontological Relations Extracted from the Genia Corpus
Rayner, M. REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications
Rebrus, P. Morphdb.hu: Hungarian lexical database and morphological grammar
Reeder, F. Parallel Syntactic Annotation of Multiple Languages
Reidsma, D. Annotating Emotions in Meetings
Reigem, Ø. Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains
Renso, C. Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases
Reyle, U. Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology
Reynaert, M. Corpus-Induced Corpus Clean-up
Richard, G. Fear-type emotions of the SAFE Corpus: annotation issues
Richter, M. Corpus Portal for Search in Monolingual Corpora
Rieser, V. The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player
Rijke, M. The Multilingual Question Answering Track at CLEF
Ritchie, G. Building a Lexical Database for an Interactive Joke-Generator
Ritz, J. Extraction tools for collocations and their morphosyntactic specificities
Roark, B. SParseval: Evaluation Metrics for Parsing Speech
Robaldo, L. From Natural Language to Databases via Ontologies
Robba, I. Data, Annotations and Measures in EASY the Evaluation Campaign for Parsers of French.

FRASQUES: A Question Answering system in the EQueR evaluation campaign
Rodríguez, H. Building a WordNet for Arabic
Rohrer, C. Improving coverage and parsing quality of a large-scale LFG for German
Rojas, D. Predicting MT Quality as a Function of the Source Language
Rojc, M. Are you ready for a call? - Spontaneous conversations in tourism for speech-to-speech translation systems
Romacker, M. Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon
Romanelli, M. A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
Romano, L. I-CAB: the Italian Content Annotation Bank
Romary, L. Representing Linguistic Corpora and Their Annotations

Foundations of Modern Language Resource Archives

Metadata Profile in the ISO Data Category Registry

An API for accessing the Data Category Registry

A Lexicalized Tree-Adjoining Grammar for Vietnamese
Rosset, S. Results of the French Evalda-Media evaluation campaign for literal understanding

The Ritel Corpus - An annotated Human-Machine open-domain question answering spoken dialog corpus
Rosso, P. Mining Knowledge fromWikipedia for the Question Answering task
Roussanaly, A. A Lexicalized Tree-Adjoining Grammar for Vietnamese
Roventini, A. Using Core Ontology for Domain Lexicon Structuring

Linking Verbal Entries of Different Lexical Resources
Ruimy, N. Merging two Ontology-based Lexical Resources

Structuring a Domain Vocabulary in a General Knowledge Environment
Ruiz, N. Tangible Objects for the Acquisition of Multimodal Interaction Patterns
Ruiz-casado, M. The wraetlic NLP suite
Rumshisky, A. Inducing Sense-Discriminating Context Patterns from Sense-Tagged Corpora

Towards a Generative Lexical Resource: The Brandeis Semantic Ontology
Rung, A. Morphdb.hu: Hungarian lexical database and morphological grammar
Rus, V. The Look and Feel of a Confident Entailer
Russel, A. ANNEX - a web-based Framework for Exploiting Annotated Media Resources

ELAN: a Professional Framework for Multimodality Research

Ontology-based Language Archive Utilization
Ryant, N. Extending VerbNet with Novel Verb Classes
Ryu, P. Compiling large language resources using lexical similarity metrics for domain taxonomy learning
S  
Sacaleanu, B. The Multilingual Question Answering Track at CLEF
Sadanobu, T. Multimedia Database of Meetings and Informal Interactions for Tracking Participant Involvement and Discourse Flow
Sagara, K. Features of Terms in Actual Nursing Activities
Saggion, H. Multilingual Multidocument Summarization Tools and Evaluation

Language Resources for Background Gathering
Sagot, B. The Lefff 2 syntactic lexicon for French: architecture, acquisition, use

Deep non-probabilistic parsing of large corpora
Sagri, M. The LOIS Project
Sahlgren, M. Towards pertinent evaluation methodologies for word-space models
Sailer, M. The Collection of Distributionally Idiosyncratic Items: A Multilingual Resource for Linguistic Research
Saint-dizier, P. PrepNet: a Multilingual Lexical Description of Prepositions
Saiz, R. Structure, Annotation and Tools in the Basque ZT Corpus
Salmenkivi, M. Finding representative sets of dialect words for geographical regions
Salmon-alt, S. Automatic extraction of subcategorization frames for French
Salvador, L. CoGrOO: a Brazilian-Portuguese Grammar Checker based on the CETENFOLHA Corpus
Sameti, H. Building and Incorporating Language Models for Persian Continuous Speech Recognition Systems
Samy, D. Building a Parallel Multilingual Corpus (Arabic-Spanish-English)
Sanders, G. Edit Distance: A Metric for Machine Translation Evaluation
Sandoval, A. Building a Parallel Multilingual Corpus (Arabic-Spanish-English)
Sandrelli, A. Tagging a Corpus of Interpreted Speeches: the European Parliament Interpreting Corpus (EPIC)
Sannikov, A. A Syntactically and Semantically Tagged Corpus of Russian: State of the Art and Prospects
Sansò, A. Documenting variation across Europe and the Mediterranean: the Pavia Typological Database
Santos, D. Annotating COMPARA, a Grammar-aware Parallel Corpus

Corpógrafo V3 - From Terminological Aid to Semi-automatic Knowledge Engineering

HAREM: An Advanced NER Evaluation Contest for Portuguese

The Multilingual Question Answering Track at CLEF
Santos, R. The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon
Saraki, M. Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy
Saratxaga, I. Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque
Sarmento, L. Corpógrafo V3 - From Terminological Aid to Semi-automatic Knowledge Engineering

Component Evaluation in a Question Answering System

BACO - A large database of text and co-occurrences
Saruwatari, H. Transcription Cost Reduction for Constructing Acoustic Models Using Acoustic Likelihood Selection Criteria
Sasaki, F. Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources
Sassi, M. Next Generation Language Resources using Grid

Natural Language Processing: A Terminological and Statistical Approach
Sassolini, E. Dialectal resources on-line: the ALT-Web experience

Next Generation Language Resources using Grid
Sauri, R. SlinkET: A Partial Modal Parser for Events
Savy, R. Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb).
Scalise, S. MORBO/COMP: a multilingual database of compound words
Schaden, S. “Casselberveetovallarga” and other Unpronounceable Places: The CrossTowns Corpus

Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure
Schehl, J. The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player
Scherer, S. Wizard-of-oz Data Collection for Perception and Interaction in Multi-user Environments
Schiehlen, M. The Mass-Count Distinction: Acquisition and Disambiguation
Schiel, F. Bikers Accessing the Web: The SmartWeb Motorbike Corpus

SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus
Schiffrin, A. Methodological Aspects of Semantic Annotation
Schiller, M. A corpus of tutorial dialogs on theorem proving; the influence of the presentation of the study-material
Schlesinger, P. Semi-automatic Building of Swedish Collocation Lexicon
Schlyter, S. CEFLE and Direkt Profil: a New Computer Learner Corpus in French L2 and a System for Grammatical Profiling
Schrader, B. Non-probabilistic alignment of rare German and English nominal expressions
Schrader, N. LexikoNet - a lexical database based on type and role hierarchies
Schreiber, G. Conversion of WordNet to a standard RDF/OWL representation
Schulte, S. Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet
Schulz, S. Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon

Language Specific and Topic Focused Web Crawling
Schuurman, I. Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development

Syntactic Annotation of Large Corpora in STEVIN
Schuurman, I. METIS-II: Machine Translation for Low Resource Languages
Schwenker, F. Wizard-of-oz Data Collection for Perception and Interaction in Multi-user Environments
Schäfer, U. Automatic Testing and Evaluation of Multilingual Language Technology Resources and Components

OntoNERdIE -- Mapping and Linking Ontologies to Named Entity Recognition and Information Extraction Resources

Preprocessing and Tokenisation Standards in DELPH-IN Tools
Sebastiani, F. SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining
Seco, N. HAREM: An Advanced NER Evaluation Contest for Portuguese
Segouat, J. Sign Language corpus analysis: Synchronisation of linguistic annotation and numerical data
Seidenari, C. The DiaCORIS project: a diachronic corpus of written Italian

POS tagset design for Italian
Semecký, J. On Automatic Assignment of Verb Valency Frames in Czech
Semmar, N. Evaluation of multilingual text alignment systems: the ARCADE II project

A Deep Linguistic Analysis for Cross-language Information Retrieval
Senda, Y. Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing
Senn, J. Collaborative Annotation that Lasts Forever: Using Peer-to-Peer Technology for Disseminating Corpora and Language Resources
Serafini, L. The role of lexical resources in matching classification schemas
Servan, C. Results of the French Evalda-Media evaluation campaign for literal understanding
Severinson, K. Developing a ContextualizedMultimodal Corpus for Human-Robot InteractionKeywords: Multimodal corpus, Human-Robot Interaction, Spatiality,Dialogue systems, Gesture
Sgall, P. Corpus Annotation as a Test of a Linguistic Theory
Shafran, I. SParseval: Evaluation Metrics for Parsing Speech
Shahin, M. Building Annotated Written and Spoken Arabic LRs in NEMLAR Project
Sharoff, S. Using collocations from comparable corpora to find translation equivalents

A Uniform Interface to Large-Scale Linguistic Resources

Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language
Shehata, A. A Dependency-based Algorithm for Grammar Conversion
Sheridan, P. Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval
Shikano, K. Transcription Cost Reduction for Constructing Acoustic Models Using Acoustic Likelihood Selection Criteria

Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech
Shima, H. Modular Approach to Error Analysis and Evaluation for Multilingual Question Answering
Siddharthan, A. Parallel Syntactic Annotation of Multiple Languages
Siegel, M. Ontology-based Information Extraction with SOBA
Silva, J. Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project
Simon, E. Morphdb.hu: Hungarian lexical database and morphological grammar
Simov, K. Shallow Semantic Annotation of Bulgarian
Simões, A. T2O - Recycling Thesauri into a Multilingual Ontology
Sin, K. Court Stenography-To-Text (“STT”) in Hong Kong: A Jurilinguistic Engineering Effort
Sinohara, Y. Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing
Sinopalnikova, A. Intelligent Dictionary Interfaces: Usability Evaluation of Access-Supporting Enhancements
Sitbon, L. Tools and methods for objective or contextual evaluation of topic segmentation
Sizov, V. A Syntactically and Semantically Tagged Corpus of Russian: State of the Art and Prospects
Sjöbergh, J. Towards Holistic Summarization -- Selecting Summaries, Not Sentences
Skiba, R. Comparison of Resource Discovery Methods

LAMUS: the Language Archive Management and Upload System
Skwarska, K. Valency Lexicon of Czech Verbs: Alternation-Based Model
Slavcheva, M. Semantic Descriptors: The Case of Reflexive Verbs
Sloane, Z. Competitive Evaluation of Commercially Available Speech Recognizers in Multiple Languages
Sloetjes, H. ELAN: a Professional Framework for Multimodality Research
Smaïli, K. Linguistic features modeling based on Partial New Cache
Smits, F. JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality
Smrž, P. Automatic Acquisition of Semantics-Extraction Patterns

Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator

Intelligent Dictionary Interfaces: Usability Evaluation of Access-Supporting Enhancements
Snover, M. SParseval: Evaluation Metrics for Parsing Speech
Soler, J. MEDUSA: User-Centred Design and usability evaluation of Automatic Speech Recognition telephone services in Telefónica Móviles España

Usability evaluation of 3G multimodal services in Telefónica Móviles España
Soler, V. Retrieving Terminological Data from the TxtCeram Tagged Domain Corpus: A First Step towards a Terminological Ontology
Sologaistoa, A. Structure, Annotation and Tools in the Basque ZT Corpus
Somers, H. Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali
Sonntag, D. A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications
Soria, C. Moving to dynamic computational lexicons with LeXFlow

Lexical Markup Framework (LMF)

Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology

Next Generation Language Resources using Grid
Sornlertlamvanich, V. Blind Evaluation for Thai Search Engines

Word Knowledge Acquisition for Computational Lexicon Construction

A Conditional Random Field Framework for Thai Morphological Analysis
Sornprasert, P. Ontology Driven K-Portal Construction and K-Service Provision
Soroa, A. Structure, Annotation and Tools in the Basque ZT Corpus
Soudi, A. IMORPHĒ: An Inheritance and Equivalence Based Morphology Description Compiler
Sousa, M. Building a historical corpus for Classical Portuguese: some technological aspects
Spadoni, G. Using Core Ontology for Domain Lexicon Structuring
Speranza, M. I-CAB: the Italian Content Annotation Bank
Spinsanti, L. Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases
Sporleder, C. Identifying Named Entities in Text Databases from the Natural History Domain
Spranger, K. The Mass-Count Distinction: Acquisition and Disambiguation
Sprugnoli, R. I-CAB: the Italian Content Annotation Bank
Srichaivattana, P. Blind Evaluation for Thai Search Engines
Stanković, R. WS4LR: A Workstation for Lexical Resources
Ştefănescu, D. Acquis Communautaire Sentence Alignment using Support Vector Machines

Aligning Multilingual Thesauri
Steffen, J. The pragmatic combination of different crosslingual resources
Stein, D. A German Sign Language Corpus of the Domain Weather Report
Steinberger, R. Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation

The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages
Stewart, R. SParseval: Evaluation Metrics for Parsing Speech
Stock, O. The Affective Weight of Lexicon
Stolarski, M. UAM Text Tools - a flexible NLP architecture
Storrer, A. Automated detection and annotation of term definitions in German text corpora
Strapparava, C. The Affective Weight of Lexicon
Strassel, S. Integrated Linguistic Resources for Language Exploitation Technologies

A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit

Linguistic Resources for Speech Parsing

An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks
Strauß, P. Wizard-of-oz Data Collection for Perception and Interaction in Multi-user Environments
Strube, M. Part-of-Speech Tagging of Transcribed Speech
Strömqvist, S. Technologies for a Federation of Language Resource Archives
Stuck, F. Evaluation of multilingual text alignment systems: the ARCADE II project
Subba, R. Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations
Suderman, K. Integrating Linguistic Resources: The American National Corpus Model
Suktarachan, M. Ontology Driven K-Portal Construction and K-Service Provision
Sutcliffe, R. The Multilingual Question Answering Track at CLEF
Suzuki, E. On the Web Trilingual Sign Language Dictionary to Learn the foreign Sign Language without Learning a Target Spoken Language
Suzuki, H. RefRef: A Tool for Viewing and Exploring Coreference Space
Suzuki, T. On the Web Trilingual Sign Language Dictionary to Learn the foreign Sign Language without Learning a Target Spoken Language
Svendsen, T. FonDat1: A Speech Synthesis Corpus for Norwegian
Swanenberg, J. A Unified Structure for Dutch Dialect Dictionary Data
Szarvas, G. A highly accurate Named Entity corpus for Hungarian
Szklanny, K. Creation and analysis of a Polish speech database for use in unit selection synthesis
Sá, T. COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions
Sánchez, A. MEDUSA: User-Centred Design and usability evaluation of Automatic Speech Recognition telephone services in Telefónica Móviles España
T  
Tabessi, D. Developing and Using a Pilot Dialectal Arabic Treebank
Tablan, V. User-friendly ontology authoring using a controlled language

Creating Tools for Morphological Analysis of Sumerian
Taboada, M. Methods for Creating Semantic Orientation Dictionaries
Tadić, M. Evaluating Morphosyntactic Tagging of Croatian Texts
Taib, R. Tangible Objects for the Acquisition of Multimodal Interaction Patterns
Taiichi, H. A new approach to syntactic annotation
Takanashi, K. Dependency-structure Annotation to Corpus of Spontaneous Japanese
Takeda, K. Statistical Analysis for Thesaurus Construction using an Encyclopedic Corpus
Takenobu, T. A new approach to syntactic annotation
Taljard, E. Grammar-based tools for the creation of tagging resources for an unresourced language: the case of Northern Sotho
Talley, J. Bootstrapping New Language ASR Capabilities: Achieving Best Letter-to-Sound Performance under Resource Constraints
Tamburini, F. The DiaCORIS project: a diachronic corpus of written Italian

POS tagset design for Italian
Tan, T. A French Non-Native Corpus for Automatic Speech Recognition
Tate, C. Task-based MT Evaluation: From Who/When/Where Extraction to Event Understanding
Tateishi, M. Analyzing the Effects of Spoken Dialog Systems on Driving Behavior
Tateisi, Y. Linguistic and Biological Annotations of Biological Interaction Events
Tchobanov, A. Interoperability of audio corpora : the case of the French corpora
Teixeira, J. Progmatica: A Prosodic Database for European Portuguese
Tenfjord, K. The ASK Corpus - a Language Learner Corpus of Norwegian as a Second Language
Terenzi, E. Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations
Tesconi, M. Moving to dynamic computational lexicons with LeXFlow
Teunissen, L. The Dutch-Flemish HLT Programme STEVIN: Essential Speech and Language Technology Resources
Thamvijit, D. Ontology Driven K-Portal Construction and K-Service Provision
Theeramunkong, T. Word Knowledge Acquisition for Computational Lexicon Construction
Thulin, J. CEFLE and Direkt Profil: a New Computer Learner Corpus in French L2 and a System for Grammatical Profiling
Tian, J. ECESS Inter-Module Interface Specification for Speech Synthesis
Tiedemann, J. ISA & ICA - Two Web Interfaces for Interactive Alignment of Bitexts alignment of parallel texts
Timimi, I. Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model

CESTA: First Conclusions of the Technolangue MT Evaluation Campaign
Tiscornia, D. The LOIS Project
Tobin, R. Rule-Based Chunking and Reusability
Toda, T. Transcription Cost Reduction for Constructing Acoustic Models Using Acoustic Likelihood Selection Criteria
Tohyama, H. Collection of Simultaneous Interpreting Patterns by Using Bilingual Spoken Monologue Corpus
Tomaso, V. Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano
Tongchim, S. Blind Evaluation for Thai Search Engines
Tono, Y. An Annotated Corpus Management Tool: ChaKi
Topp, E. Developing a ContextualizedMultimodal Corpus for Human-Robot InteractionKeywords: Multimodal corpus, Human-Robot Interaction, Spatiality,Dialogue systems, Gesture
Toumouh, A. Ongoing Developments in Automatically Adapting Lexical Resources to the Biomedical Domain
Traue, H. Wizard-of-oz Data Collection for Perception and Interaction in Multi-user Environments
Trawinski, B. The Collection of Distributionally Idiosyncratic Items: A Multilingual Resource for Linguistic Research
Trilsbeek, P. LAMUS: the Language Archive Management and Upload System
Trippel, T. A BLARK extension for temporal annotation mining

Building a historical corpus for Classical Portuguese: some technological aspects

Feature-based Encoding and Querying Language Resources with Character Semantics
Trón, V. Morphdb.hu: Hungarian lexical database and morphological grammar

Using a morphological analyzer in high precision POS tagging of Hungarian
Tseng, C. Oriental COCOSDA: Past, Present and Future
Tseng, S. Discourse functions of duration in Mandarin: resource design and implementation
Tsiakoulis, P. All Greek to me! An automatic Greeklish to Greek transliteration system
Tsou, B. Toward a Pan-Chinese Thesaurus

Court Stenography-To-Text (“STT”) in Hong Kong: A Jurilinguistic Engineering Effort
Tsujii, J. Linguistic and Biological Annotations of Biological Interaction Events
Tufiş, D. Dependency-Based Phrase Alignment

Acquis Communautaire Sentence Alignment using Support Vector Machines

The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages

RoCo-News: A Hand Validated Journalistic Corpus of Romanian
Tufi, D. Tagset Mapping and Statistical Training Data Cleaning-up

Aligning Multilingual Thesauri
Tummarello, G. A novel Textual Encoding paradigm based on Semantic Web tools and semantics
U  
Uchimoto, K. Dependency-structure Annotation to Corpus of Spontaneous Japanese

Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence
Uemlianin, I. Tools and resources for speech synthesis arising from a Welsh TTS project
Ueyama, M. Evaluation of Web-based Corpora: Effects of Seed Selection and Time Interval
Ulivieri, M. Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian
Umbert, M. Spanish Synthesis Corpora
Underwood, N. A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation

ROTE: A Tool to Support Users in Defining the Relative Importance of Quality Characteristics

Evaluating Symbiotic Systems: the challenge

The Evolution of an Evaluation Framework for a Text Mining System
Uryupina, O. Coreference Resolution with and without Linguistic Knowledge
Uszkoreit, H. The pragmatic combination of different crosslingual resources
Utiyama, M. Getting Deeper Semantics than Berkeley FrameNet with MSFA
V  
Vajda, P. Morphdb.hu: Hungarian lexical database and morphological grammar
Valitutti, A. The Affective Weight of Lexicon
Valverde, A. Structure, Annotation and Tools in the Basque ZT Corpus
Vandeghinste, V. Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development

Syntactic Annotation of Large Corpora in STEVIN
Vandeghinste, V. METIS-II: Machine Translation for Low Resource Languages
Vaneyghen, J. A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora
Vanni, M. Parallel Syntactic Annotation of Multiple Languages

Formal v. Informal: Register-Differentiated Arabic MT Evaluation in the PLATO Paradigm
Vanrullen, T. Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy
Varga, D. The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages

Using a morphological analyzer in high precision POS tagging of Hungarian
Varona, A. Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA
Vasileiadis, S. Greek Named Entity Recognition using Support Vector Machines, Maximum Entropy and Onetime
Vasilescu, I. Fear-type emotions of the SAFE Corpus: annotation issues
Vasiljevs, A. EuroTermBank - a Terminology Resource based on Best Practice
Vauvert, G. The ALVIS Format for Linguistically Annotated Documents
Veenendaal, R. Functioning of the Centre for Dutch Language and Speech Technology
Vela, M. Generic NLP Tools for Supporting Shallow Ontology Building
Vela, O. Multilingual Lexical Semantic Resources for Ontology Translation
Verberne, S. Data for question answering: The case of why
Verdonik, D. SINOD - Slovenian non-native speech database

Are you ready for a call? - Spontaneous conversations in tourism for speech-to-speech translation systems
Verhagen, M. SlinkET: A Partial Modal Parser for Events

Towards a Generative Lexical Resource: The Brandeis Semantic Ontology

Annotation of Temporal Relations with Tango

BULB: A Unified Lexical Browser
Vetulani, G. Syntactic Lexicon of Polish Predicative Nouns
Vetulani, Z. Syntactic Lexicon of Polish Predicative Nouns
Vide, C. Example-Based Machine Translation Using a Dictionary of Word Pairs
Vigário, M. FreP: An electronic tool for extracting frequency information of phonological units from Portuguese written text
Vilain, M. Leveraging Machine Readable Dictionaries in Discriminative Sequence Models
Vilar, D. Error Analysis of Statistical Machine Translation Output
Vilela, R. HAREM: An Advanced NER Evaluation Contest for Portuguese
Villemonte de la Clergerie, E. The Lefff 2 syntactic lexicon for French: architecture, acquisition, use
Villaneau, J. Results of the French Evalda-Media evaluation campaign for literal understanding
Vilnat, A. Data, Annotations and Measures in EASY the Evaluation Campaign for Parsers of French.

FRASQUES: A Question Answering system in the EQueR evaluation campaign

EQueR: the French Evaluation campaign of Question-Answering Systems
Vitas, D. WS4LR: A Workstation for Lexical Resources
Vivaldi, J. SKELETON: Specialised knowledge retrieval on the basis of terms and conceptual relations
Voghera, M. An observatory on Spoken Italian linguistic resources and descriptive standards.
Vogt, T. Improving Automatic Emotion Recognition from Speech via Gender Differentiaion
Voll, K. Methods for Creating Semantic Orientation Dictionaries
Voss, C. Task-based MT Evaluation: From Who/When/Where Extraction to Event Understanding
Vossen, P. Building a WordNet for Arabic
Vriend, F. A Unified Structure for Dutch Dialect Dictionary Data
Vázquez, G. The Sensem Corpus: a Corpus Annotated at the Syntactic and Semantic Level
Véronis, J. Evaluation of multilingual text alignment systems: the ARCADE II project
W  
Węcel, K. Linguistic Suite for Polish Cadastral System
Wagacha, P. A Grapheme-Based Approach for Accent Restoration in Gikuyu
Waldron, B. Preprocessing and Tokenisation Standards in DELPH-IN Tools
Walker, C. An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks
Walker, K. The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

Low-cost Customized Speech Corpus Creation for Speech Technology Applications
Walker, M. Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives

Learning Database Content for Spoken Dialogue System Design
Waller, A. Building a Lexical Database for an Interactive Joke-Generator
Walter, R. Interoperability of audio corpora : the case of the French corpora
Walter, W. Wizard-of-oz Data Collection for Perception and Interaction in Multi-user Environments
Wandmacher, T. Training Language Models without Appropriate Language Resources: Experiments with an AAC System for Disabled People
Wang, F. Towards Unified Chinese Segmentation Algorithm

Evaluation of Stop Word Lists in Chinese Language
Wang, M. Modular Approach to Error Analysis and Evaluation for Multilingual Question Answering
Wang, X. TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis
Wanner, L. Local Document Relevance Clustering in IR Using Collocation Information
Watanabe, M. Automatic Construction of Japanese WordNet
Weidenbacher, U. Wizard-of-oz Data Collection for Perception and Interaction in Multi-user Environments
Weissenbacher, D. The ALVIS Format for Linguistically Annotated Documents
Wellinghoff, S. Automated detection and annotation of term definitions in German text corpora
Wellner, B. Leveraging Machine Readable Dictionaries in Discriminative Sequence Models
Wenjie, L. A Study on Terminology Extraction Based on Classified Corpora
Westerhout, E. A pilot study for a Corpus of Dutch Aphasic Speech (CoDAS)
Widdows, D. Collaborative Annotation that Lasts Forever: Using Peer-to-Peer Technology for Disseminating Corpora and Language Resources

The Information Commons Gazetteer

Ongoing Developments in Automatically Adapting Lexical Resources to the Biomedical Domain
Widiger, A. Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation

The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages
Wieloc, K. Linguistic Suite for Polish Cadastral System
Wilks, Y. An Incremental Tri-Partite Approach To Ontology Learning

A Closer Look at Skip-gram Modelling

Evaluating Automatically Generated Timelines from the Web
Williams, B. Tools and resources for speech synthesis arising from a Welsh TTS project
Wintner, S. A Computational Lexicon of Contemporary Hebrew
Witt, A. Exploiting logical document structure for anaphora resolution
Wittenburg, P. Comparison of Resource Discovery Methods

ELAN: a Professional Framework for Multimodality Research

LAMUS: the Language Archive Management and Upload System

Foundations of Modern Language Resource Archives

LEXUS, a web-based tool for manipulating lexical resources lexicon, standards, tool

Metadata Profile in the ISO Data Category Registry

Technologies for a Federation of Language Resource Archives

Ontology-based Language Archive Utilization

An API for accessing the Data Category Registry
Wolska, M. A corpus of tutorial dialogs on theorem proving; the influence of the presentation of the study-material
Wong, M. Skeleton Parsing in Chinese: Annotation Scheme and Guidelines
Wong., K. Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach
Wrede, B. BITT: A Corpus for Topic Tracking Evaluation on Multimodal Human-Robot-Interaction
Wu, A. A Hebrew Tree Bank Based on Cantillation Marks
Wypych, M. General and Task-Specific Corpus Resources for Polish Adult Learners of English
X  
Xia, Y. Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach
Xu, F. The pragmatic combination of different crosslingual resources
Xu, J. Error Analysis of Statistical Machine Translation Output
Xu, R. Interaction between Lexical Base and Ontology with Formal Concept Analysis

The Design and Construction of A Chinese Collocation Bank
Xue, N. Annotating the Predicate-Argument Structure of Chinese Nominalizations
Y  
Yakushiji, A. Linguistic and Biological Annotations of Biological Interaction Events
Yamaguchi, Y. Layered Speech-Act Annotation for Spoken Dialogue Corpus
Yamamoto, E. Semantic Analysis of Abstract Nouns to Compile a Thesaurus of Adjectives

Detection of inconsistencies in concept classifications in a large dictionary - Toward an improvement of the EDR electronic dictionary
Yang, J. Competitive Evaluation of Commercially Available Speech Recognizers in Multiple Languages
Yang, Q. Development of a phoneme-to-phoneme (p2p) converter to improve the grapheme-to-phoneme (g2p) conversion of names
Yaseen, M. Building Annotated Written and Spoken Arabic LRs in NEMLAR Project
Yingsaeree, C. Ontology Driven K-Portal Construction and K-Service Provision
Yirong, C. A Study on Terminology Extraction Based on Classified Corpora
Yona, S. A Computational Lexicon of Contemporary Hebrew
Yung, L. SParseval: Evaluation Metrics for Parsing Speech
Z  
Žabokrtsky, Z. Valency Lexicon of Czech Verbs: Alternation-Based Model

Towards a Slovene Dependency Treebank
Zaghouani, W. Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation

Evaluation of multilingual text alignment systems: the ARCADE II project
Zahedi, M. A German Sign Language Corpus of the Domain Weather Report
Zanobini, S. The role of lexical resources in matching classification schemas
Zanzotto, F. Mixing WordNet, VerbNet and PropBank for studying verb relations

A Dependency-based Algorithm for Grammar Conversion
Žele, A. Towards a Slovene Dependency Treebank
Zgank, A. SINOD - Slovenian non-native speech database
Zhang, Y. Automated Deep Lexical Acquisition for Robust Open Texts Processing
Zhifang, S. A Study on Terminology Extraction Based on Classified Corpora
Zhou, L. Summarizing Answers for Complicated Questions

Automated Summarization Evaluation with Basic Elements.
Ziegenhain, U. TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis
Zinger, S. Exploiting text for extracting image processing resources
Zou, F. Towards Unified Chinese Segmentation Algorithm

Evaluation of Stop Word Lists in Chinese Language