LREC 2006 - Proceedings sorted by authors

Abramowicz, W.

Annotation of Emotions in Real-Life Video Interviews: Variability between Coders

Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches

Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews

Abrilian, S.

Adly, N.

Building a Heterogeneous Information Retrieval Test Collection of Arabic Document Images

Agić, Ž.

Evaluating Morphosyntactic Tagging of Croatian Texts

Agirre, E.

A methodology for the joint development of the Basque WordNet and Semcor

A Preliminary Study for Building the Basque PropBank

Ågren, M.

CEFLE and Direkt Profil: a New Computer Learner Corpus in French L2 and a System for Grammatical Profiling

Agüero, P.

Spanish Synthesis Corpora

Ahmad, K.

Ontological and Terminological Commitments and the Discourse of Specialist Communities

Visual Surveillance and Video Annotation and Description

Sentiments on a Grid: Analysis of Streaming News and Views

Aholab, I.

Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque

Ahrenberg, L.

KUNSTI - Knowledge Generation for Norwegian Language Technology

Aikawa, T.

Detecting Inter-domain Semantic Shift using Syntactic Similarity

Predicting MT Quality as a Function of the Source Language

Ailomaa, M.

Extending the Wizard of Oz Methodologie for Multimodal Language-enabled Systems

Ajani, G.

A Development Tool For Multilingual Ontology-based Conceptual

Ajot, J.

Multiple Dimension Levenshtein Edit Distance Calculations for Evaluating Automatic Speech Recognition Systems During Simultaneous Speech

Akahori, I.

Analyzing the Effects of Spoken Dialog Systems on Driving Behavior

Akam, H.

Recurrent Markov Cluster (RMCL) Algorithm for the Refinement of the Semantic Network

Akiba, T.

Exploiting Dynamic Passage Retrieval for Spoken Question Recognition and Context Processing towards Speech-driven Information Access Dialogue

Al-kufaishi, A.

Building Annotated Written and Spoken Arabic LRs in NEMLAR Project

Alabau, V.

Bilingual speech corpus in two phonetically similar languages

Alcina, A.

Retrieving Terminological Data from the TxtCeram Tagged Domain Corpus: A First Step towards a Terminological Ontology

Aldezabal, I.

A methodology for the joint development of the Basque WordNet and Semcor

A Preliminary Study for Building the Basque PropBank

Alegria, I.

Structure, Annotation and Tools in the Basque ZT Corpus

Alex, B.

The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text

Alessandro, P.

Integrating Methods and LRs for Automatic Keyword Extraction from Open Domain Texts

Alfonseca, E.

Building a Parallel Multilingual Corpus (Arabic-Spanish-English)

The wraetlic NLP suite

Alkhalifa, M.

Allison, B.

T2O - Recycling Thesauri into a Multilingual Ontology

Almeida, J.

Alonge, A.

The Italian Metaphor Database

Alonso, L.

The Sensem Corpus: a Corpus Annotated at the Syntactic and Semantic Level

Alphonse, E.

The ALVIS Format for Linguistically Annotated Documents

Amdal, I.

FonDat1: A Speech Synthesis Corpus for Norwegian

Amigó, E.

Iqmt: A Framework for Automatic Machine Translation Evaluation

Amsalu, S.

Data-driven Amharic-English Bilingual Lexicon Acquisition

Ananiadou, S.

Clustering acronyms in biomedical text for disambiguation

Towards a terminological resource for biomedical text mining

Anderson, W.

Finite state tokenisation of an orthographical disjunctive agglutinative language: The verbal segment of Northern Sotho

Ando, R.

Analysis of TimeBank as a Resource for TimeML Parsing

Andrade, R.

Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon

Andrassy, B.

Human and machine recognition as a function of SNR

Andre, E.

Improving Automatic Emotion Recognition from Speech via Gender Differentiaion

Andreevskaia, A.

Semantic Tag Extraction from WordNet Glosses

Andrew, G.

Tregex and Tsurgeon: tools for querying and manipulating tree data structures

Andrews, W.

The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

Anstein, S.

Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology

Anthony, C.

Methods for Creating Semantic Orientation Dictionaries

Antoine, J.

Training Language Models without Appropriate Language Resources: Experiments with an AAC System for Disabled People

Antunes, S.

COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions

Apresjan, J.

A Syntactically and Semantically Tagged Corpus of Russian: State of the Art and Prospects

Areta, N.

Structure, Annotation and Tools in the Basque ZT Corpus

Aribau, E.

Extending the Wizard of Oz Methodologie for Multimodal Language-enabled Systems

Armstrong, S.

Arnaiz, A.

A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker

Arntzen, P.

Identifying Named Entities in Text Databases from the Natural History Domain

Artola, X.

Structure, Annotation and Tools in the Basque ZT Corpus

Asahara, M.

The pragmatic combination of different crosslingual resources

Aslan, I.

Assem, M.

Conversion of WordNet to a standard RDF/OWL representation

Athanassia-lida, D.

Language identification from suprasegmental cues: Speech synthesis of Greek utterances from different dialectal variations.

Atserias, J.

FreeLing 1.3: Syntactic and semantic services in an open-source NLP library

Attia, M.

Building Annotated Written and Spoken Arabic LRs in NEMLAR Project

Aue, A.

Detecting Inter-domain Semantic Shift using Syntactic Similarity

Aunimo, L.

Usability evaluation of 3G multimodal services in Telefónica Móviles España

Ávila, C.

Awab, S.

Open Source Corpus Analysis Tools for Malay

Ayache, C.

Data, Annotations and Measures in EASY the Evaluation Campaign for Parsers of French.

Results of the French Evalda-Media evaluation campaign for literal understanding

The Multilingual Question Answering Track at CLEF

EQueR: the French Evaluation campaign of Question-Answering Systems

Babych, B.

Using collocations from comparable corpora to find translation equivalents

Badia, T.

METIS-II: Machine Translation for Low Resource Languages

Bae , H.

A Methodology for Developing Multilingual Resources for Terminology

Bahrani, M.

Building and Incorporating Language Models for Persian Continuous Speech Recognition Systems

Bailly, G.

A joint prosody evaluation of French text-to-speech synthesis systems

A joint intelligibility evaluation of French text-to-speech synthesis systems: the EvaSy SUS/ACR campaign

Does a Virtual Talking Face Generate Proper Multimodal Cues to Draw User’s Attention to Points of Interest?

Balasubramanya, M.

Collaborative Annotation that Lasts Forever: Using Peer-to-Peer Technology for Disseminating Corpora and Language Resources

The Information Commons Gazetteer

Baldwin, T.

Reconsidering Language Identification for Written Language Resources

Open Source Corpus Analysis Tools for Malay

Balfourier, J.

Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy

Banchs, R.

Acceptance Testing of a Spoken Language Translation System

Barbu, A.

Romanian Valence Dictionary in XML Format

Barker, E.

Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives

Barreto, F.

Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project

Bartolini, R.

Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases

Basili, R.

A Tree Kernel approach to Question and Answer Classification in Question Answering Systems

Baude, O.

Interoperability of audio corpora : the case of the French corpora

Bechet, F.

Results of the French Evalda-Media evaluation campaign for literal understanding

Beck, D.

Automatic Testing and Evaluation of Multilingual Language Technology Resources and Components

Becker, T.

The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player

Beermann, D.

Word Sense Disambiguation and Semantic Disambiguation for Construction Types in Deep Processing Grammars

Behrens, F.

The Eclipse Annotator: an extensible system for multimodal corpus creation

Bel, N.

Lexical Markup Framework (LMF)

New tools for the encoding of lexical data extracted from corpus

Bellot, P.

Question Answering Evaluation Survey

Tools and methods for objective or contextual evaluation of topic segmentation

Bendahman, C.

Building Annotated Written and Spoken Arabic LRs in NEMLAR Project

Bendazzoli, C.

Tagging a Corpus of Interpreted Speeches: the European Parliament Interpreting Corpus (EPIC)

Benedí, J.

Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA

Bennett, C.

Visual Surveillance and Video Annotation and Description

Benzmüller, C.

A corpus of tutorial dialogs on theorem proving; the influence of the presentation of the study-material

Berck, P.

ANNEX - a web-based Framework for Exploiting Annotated Media Resources

Ontology-based Language Archive Utilization

Bergler, S.

Semantic Tag Extraction from WordNet Glosses

Berglund, A.

Extraction of Temporal Information from Texts in Swedish

Bergmann, S.

Set-up of a Unit-Selection Synthesis with a Prominent Voice

Berlocher, I.

Morphological annotation of Korean with Directly Maintainable Resources

Bernardi, R.

Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano

POS tagset design for Italian

Bernsen, N.

H. C. Andersen Conversation Corpus

Field Evaluation of a Single-Word Pronunciation Training System

Bertagna, F.

Moving to dynamic computational lexicons with LeXFlow

Representation and Inference for Open-Domain QA: Strength and Limits of two Italian Semantic Lexicons

Next Generation Language Resources using Grid

Besacier, L.

A French Non-Native Corpus for Automatic Speech Recognition

Best, C.

Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation

Betteridge, J.

A Factored Functional Dependency Transformation of the English Penn Treebank for Probabilistic Surface Generation

Bibiko, H.

Ontology-based Language Archive Utilization

Bick, E.

Turning a Dependency Treebank into a PSG-style Constituent Treebank

Biemann, C.

Corpus Portal for Search in Monolingual Corpora

Bies, A.

Developing and Using a Pilot Dialectal Arabic Treebank

Linguistic Resources for Speech Parsing

Bilotti, M.

Evaluation for Scenario Question Answering Systems

Bindi, R.

Proper Names and Linguistic Dynamics

Bird, S.

Reconsidering Language Identification for Written Language Resources

Bisetto, A.

MORBO/COMP: a multilingual database of compound words

Bishop, E.

Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy

Blache, P.

Black, W.

Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation

Blackler, K.

Blaylock, N.

The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player

Blitz, R.

CLiMB ToolKit: A Case Study of Iterative Evaluation in a Multidisciplinary Project

Boekestein, M.

Functioning of the Centre for Dutch Language and Speech Technology

Boelhouwer, B.

Exploring opportunities for Comparability and Enrichment by Linking lexical databases

Boella, G.

A Development Tool For Multilingual Ontology-based Conceptual

Bogacka, A.

General and Task-Specific Corpus Resources for Polish Adult Learners of English

Boguraev, B.

Analysis of TimeBank as a Resource for TimeML Parsing

Boguslavsky, I.

A Syntactically and Semantically Tagged Corpus of Russian: State of the Art and Prospects

Bojar, O.

Czech-English Word Alignment

Bolognesi, A.

POS tagset design for Italian

Bonafonte, A.

ECESS Inter-Module Interface Specification for Speech Synthesis

TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis

Acceptance Testing of a Spoken Language Translation System

Spanish Synthesis Corpora

GAIA: Common Framework for the Development of Speech Translation Technologies

Bonastre, J.

Towards automatic transcription of Somali language

Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

Bonneau-maynard, H.

Results of the French Evalda-Media evaluation campaign for literal understanding

Bontcheva, K.

User-friendly ontology authoring using a controlled language

Bordoni, L.

Towards an Ontology for Art and Colours

Boril, H.

Methodology of Lombard Speech Database Acquisition: Experiences with CLSD

Boril, T.

Methodology of Lombard Speech Database Acquisition: Experiences with CLSD

Bosch, A.

Identifying Named Entities in Text Databases from the Natural History Domain

Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development

Bosch, S.

Towards machine-readable lexicons for South African Bantu languages

Bosco, C.

Comparing linguistic information in treebank annotations

Bouamrane, M.

Gathering a corpus of multimodal computer-mediated meetings

Bouillon, P.

REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications

Boullier, P.

The Lefff 2 syntactic lexicon for French: architecture, acquisition, use

Deep non-probabilistic parsing of large corpora

Bouquet, P.

The role of lexical resources in matching classification schemas

Boves, L.

User requirements analysis for the design of a reference corpus of written Dutch

Data for question answering: The case of why

A Unified Structure for Dutch Dialect Dictionary Data

Braffort, A.

Sign Language corpus analysis: Synchronisation of linguistic annotation and numerical data

Braga, D.

Progmatica: A Prosodic Database for European Portuguese

Branco, A.

Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project

Brekke, M.

Automatic Term Extraction from Knowledge Bank of Economics

Breuer, S.

Set-up of a Unit-Selection Synthesis with a Prominent Voice

ECESS Inter-Module Interface Specification for Speech Synthesis

Brew, C.

A Cross-language Approach to Rapid Creation of New Morpho-syntactically Annotated Resources

Brewster, C.

An Incremental Tri-Partite Approach To Ontology Learning

Briscoe, T.

A Large Subcategorization Lexicon for Natural Language Processing Applications

Broeder, D.

Comparison of Resource Discovery Methods

LAMUS: the Language Archive Management and Upload System

Foundations of Modern Language Resource Archives

Metadata Profile in the ISO Data Category Registry

Technologies for a Federation of Language Resource Archives

Brugman, H.

A Web Based General Thesaurus Browser to Support Indexing of Television and Radio Programs

ELAN: a Professional Framework for Multimodality Research

Brännström, A.

SYMBERED - a Symbol-Concept Editing Tool

Buckwalter, T.

Developing and Using a Pilot Dialectal Arabic Treebank

Lexicon Development for Varieties of Spoken Colloquial Arabic

Buehler, D.

Stochastic Spoken Natural Language Parsing in the Framework of the French MEDIA Evaluation Campaign

Buitelaar, P.

Ontology-based Information Extraction with SOBA

Bungeroth, J.

A German Sign Language Corpus of the Domain Weather Report

Bunt, H.

Methodological Aspects of Semantic Annotation

Dimensions in Dialogue Act Annotation

Burchardt, A.

SALTO - A Versatile Multi-Level Annotation Tool

The SALSA Corpus: a German Corpus Resource for Lexical Semantics

Burger, S.

Competitive Evaluation of Commercially Available Speech Recognizers in Multiple Languages

Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus

Buscaldi, D.

Mining Knowledge fromWikipedia for the Question Answering task

Bustamante, F.

A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker

Spelling Error Patterns in Spanish for Word Processing Applications

Byron, D.

PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit

The OSU Quake 2004 corpus of two-party situated problem-solving dialogs

Cabral, L.

Corpógrafo V3 - From Terminological Aid to Semi-automatic Knowledge Engineering

Cabré, M.

SKELETON: Specialised knowledge retrieval on the basis of terms and conceptual relations

Calistri-yeh, R.

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

Calvanese, D.

Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano

Calzolari, F.

Next Generation Language Resources using Grid

Calzolari, N.

Moving to dynamic computational lexicons with LeXFlow

Lexical Markup Framework (LMF)

Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology

Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian

Next Generation Language Resources using Grid

Campbell, J.

The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

Campbell, N.

Multimedia Database of Meetings and Informal Interactions for Tracking Participant Involvement and Discourse Flow

Capilla, J.

The Sensem Corpus: a Corpus Annotated at the Syntactic and Semantic Level

Caracciolo, C.

Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases

Cardoso, N.

HAREM: An Advanced NER Evaluation Contest for Portuguese

Caridakis, G.

Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews

Carl, M.

METIS-II: Machine Translation for Low Resource Languages

Carletta, J.

Unleashing the killer corpus: experiences in creating the multi-everything AMI Meeting Corpus

Cartoni, B.

Dealing with unknown words by simple decomposition: feasibility studies with Italian prefixes.

Casas, B.

FreeLing 1.3: Syntactic and semantic services in an open-source NLP library

Caselli, T.

Annotating Bridging Anaphors in Italian: in Search of Reliability

Castagnoli, S.

The LOIS Project

Casteleiro, J.

COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions

Castellón, I.

The Sensem Corpus: a Corpus Annotated at the Syntactic and Semantic Level

Castro, M.

Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA

Catizone, R.

Evaluating Automatically Generated Timelines from the Web

Cavalli-sforza, V.

IMORPHĒ: An Inheritance and Equivalence Based Morphology Description Compiler

Ceauşu, A.

Dependency-Based Phrase Alignment

Acquis Communautaire Sentence Alignment using Support Vector Machines

Cerezo, P.

MEDUSA: User-Centred Design and usability evaluation of Automatic Speech Recognition telephone services in Telefónica Móviles España

Usability evaluation of 3G multimodal services in Telefónica Móviles España

Chalamandaris, A.

All Greek to me! An automatic Greeklish to Greek transliteration system

Language identification from suprasegmental cues: Speech synthesis of Greek utterances from different dialectal variations.

Chalvin, A.

Dictionary Building with the Jibiki Platform: the GDEF case

Chamnongthai, K.

Semantic-Based Keyword Recovery Function for Keyword Extraction System

Charniak, E.

Word Knowledge Acquisition for Computational Lexicon Construction

Charoenporn, T.

Chateau, N.

Creation of a corpus of multimodal spontaneous expressions of emotions in Human-Machine Interaction

Chatzichrisafis, N.

REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications

Chaudiron, S.

Techno-langue: The French National Initiative for Human Language Technologies (HLT)

Chen, H.

Constructing a Named Entity Ontology from Web Corpora

Tagging Heterogeneous Evaluation Corpora for Opinionated Tasks

Chen, L.

An Open Source Prosodic Feature Extraction Tool

Chenfour, N.

Building Annotated Written and Spoken Arabic LRs in NEMLAR Project

Cheng, D.

Sentiments on a Grid: Analysis of Streaming News and Views

Chesley, P.

Automatic extraction of subcategorization frames for French

Cheung, L.

Court Stenography-To-Text (“STT”) in Hong Kong: A Jurilinguistic Engineering Effort

Chiao, Y.

Evaluation of multilingual text alignment systems: the ARCADE II project

Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model

Chiari, I.

Slips and errors in spoken data transcription

Choi, K.

Compiling large language resources using lexical similarity metrics for domain taxonomy learning

Chou, Y.

Hantology-A Linguistic Resource for Chinese Language Processing and Studying

Choukri, K.

Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR:Results from the first evaluation campaign

CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

Building Annotated Written and Spoken Arabic LRs in NEMLAR Project

Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology

The BLARK concept and BLARK for Arabic

Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian

Evaluation of multimodal components within CHIL: The evaluation packages and results

Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model

TC-STAR: New language resources for ASR and SLT purposes

Chvatalova, K.

Annotation Guidelines for Czech-English Word Alignment

Cieri, C.

The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

Integrated Linguistic Resources for Language Exploitation Technologies

Low-cost Customized Speech Corpus Creation for Speech Technology Applications

More Data and Tools for More Languages and Research Areas: A Progress Report on LDC Activities

Corpus Support for Machine Translation at LDC

Cimiano, P.

Finding the Appropriate Generalization Level for Binary Ontological Relations Extracted from the Genia Corpus

Ontology-based Information Extraction with SOBA

Cinkova, S.

Semi-automatic Building of Swedish Collocation Lexicon

Cinková, S.

From PropBank to EngValLex: Adapting the PropBank-Lexicon to the Valency Theory of the Functional Generative Description

Ciobanu, D.

Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language

Ciravegna, F.

A Methodology and Tool for Representing Language Resources for Information Extraction

An Incremental Tri-Partite Approach To Ontology Learning

Civera, J.

Bilingual Machine-Aided Indexing

Claus, A.

LAMUS: the Language Archive Management and Upload System

Clavel, C.

Fear-type emotions of the SAFE Corpus: annotation issues

Clément, L.

The Lefff 2 syntactic lexicon for French: architecture, acquisition, use

Coelho, L.

Progmatica: A Prosodic Database for European Portuguese

Cole, A.

Integrated Linguistic Resources for Language Exploitation Technologies

Corpus Development and Publication

Colletta, J.

Creation of a corpus of multimodal spontaneous expressions of emotions in Human-Machine Interaction

Comelles, E.

FreeLing 1.3: Syntactic and semantic services in an open-source NLP library

Compernolle, D.

A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora

Copestake, A.

Preprocessing and Tokenisation Standards in DELPH-IN Tools

Coppen, P.

Data for question answering: The case of why

Corti, L.

Component Evaluation in a Question Answering System

Costa, L.

Coussé, E.

Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus

Couturier, J.

Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts

Cowie, R.

Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches

Cramer, I.

Building an Evaluation Corpus for German Question Answering by Harvesting Wikipedia

Cristea, D.

Temporality in relation with discourse structure

Transferring Coreference Chains through Word Alignment

Crocco, C.

Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb).

Crysmann, B.

Exploring HPSG-based Treebanks for Probabilistic Parsing HPSG grammar extraction

Csirik, J.

A highly accurate Named Entity corpus for Hungarian

Cucchiarini, C.

JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality

The Dutch-Flemish HLT Programme STEVIN: Essential Speech and Language Technology Resources

Cucurullo, N.

Dialectal resources on-line: the ALT-Web experience

Cucurullo, S.

Next Generation Language Resources using Grid

Cunningham, H.

User-friendly ontology authoring using a controlled language

Creating Tools for Morphological Analysis of Sumerian

Cutugno, F.

Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb).

An observatory on Spoken Italian linguistic resources and descriptive standards.

Cvetko-orešnik, V.

Building a resource for studying translation shifts

Cyrus, L.

D'halleweyn, E.

The Dutch-Flemish HLT Programme STEVIN: Essential Speech and Language Technology Resources

D'haro, L.

Error Analysis of Statistical Machine Translation Output

Dabbadie, M.

Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model

CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

Daelemans, W.

A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora

Dale, R.

Named Entity Extraction with Conjunction Disambiguation

Dalianis, H.

Hand-crafted versus Machine-learned Inflectional Rules: The Euroling-SiteSeeker Stemmer and CST's Lemmatiser

Dalli, A.

Evaluating Automatically Generated Timelines from the Web

Dannélls, D.

Recognizing Acronyms and their Definitions in Swedish Medical Texts

Darwish, K.

Building a Heterogeneous Information Retrieval Test Collection of Arabic Document Images

Daumke, P.

Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon

Debili, F.

Exploiting text for extracting image processing resources

Declerck, T.

SynAF: Towards a Standard for Syntactic Annotation

Multilingual Lexical Semantic Resources for Ontology Translation

Generic NLP Tools for Supporting Shallow Ontology Building

Dell’orletta, F.

Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment

Deléglise, P.

Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions

Deng, X.

Towards Unified Chinese Segmentation Algorithm

Evaluation of Stop Word Lists in Chinese Language

Denis, A.

A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation

Results of the French Evalda-Media evaluation campaign for literal understanding

Depoorter, G.

Functioning of the Centre for Dutch Language and Speech Technology

Derivière, J.

The ALVIS Format for Linguistically Annotated Documents

Derouin, M.

Lemma-oriented dictionaries, concept-oriented terminology and translation memories

Desipri, E.

Adding multi-layer semantics to the Greek Dependency Treebank

Desypri, E.

Multi-domain Multi-lingual Named Entity Recognition: Revisiting & Grounding the resources issue

Devillers, L.

Annotation of Emotions in Real-Life Video Interviews: Variability between Coders

Fear-type emotions of the SAFE Corpus: annotation issues

Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches

Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews

Dhonnchadha, E.

A Part-of-speech tagger for Irish using Finite-State Morphology and Constraint Grammar Disambiguation

Diab, M.

Developing and Using a Pilot Dialectal Arabic Treebank

Diamantaras, K.

Greek Named Entity Recognition using Support Vector Machines, Maximum Entropy and Onetime

Diaz de Ilarraza, A.

Structure, Annotation and Tools in the Basque ZT Corpus

Dini, L.

Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano

Dinu, A.

On the data base of Romanian syllables and some of its quantitative and cryptographic aspects

Dinu, L.

On the data base of Romanian syllables and some of its quantitative and cryptographic aspects

Dipersio, D.

Integrated Linguistic Resources for Language Exploitation Technologies

Dobrov, B.

Development of Linguistic Ontology on Natural Sciences and Technology

Doddington, G.

The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

Dorow, B.

Ongoing Developments in Automatically Adapting Lexical Resources to the Biomedical Domain

Dorr, B.

SParseval: Evaluation Metrics for Parsing Speech

Parallel Syntactic Annotation of Multiple Languages

Douglas-cowie, E.

Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches

Douxchamps, D.

Multimedia Database of Meetings and Informal Interactions for Tracking Participant Involvement and Discourse Flow

Dragon, R.

Set-up of a Unit-Selection Synthesis with a Prominent Voice

Draxler, C.

Speech Recordings in Public Schools in Germany - the Perfect Show Case for Web-based Recordings and Annotation

Dreuw, P.

A German Sign Language Corpus of the Domain Weather Report

Drouin, P.

Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts

Ducret, J.

Metadata Profile in the ISO Data Category Registry

An API for accessing the Data Category Registry

Dutoit, D.

Alexandria: A Powerful Multilingual Resource for Web

Dybkjær, H.

Act-Topic Patterns for Automatically Checking Dialogue Models

Dybkjær, L.

H. C. Andersen Conversation Corpus

Act-Topic Patterns for Automatically Checking Dialogue Models

Dziubalska-kołaczyk, K.

General and Task-Specific Corpus Resources for Polish Adult Learners of English

D’alessandro, C.

A joint prosody evaluation of French text-to-speech synthesis systems

A joint intelligibility evaluation of French text-to-speech synthesis systems: the EvaSy SUS/ACR campaign

Džeroski, S.

Spelling Error Patterns in Spanish for Word Processing Applications

Díaz, E.

Ehrette, T.

Fear-type emotions of the SAFE Corpus: annotation issues

Eisele, A.

Parallel Corpora and Phrase-Based Statistical Machine Translation for New Language Pairs via Multiple Intermediaries

El-bèze, M.

Question Answering Evaluation Survey

Elisei, F.

Does a Virtual Talking Face Generate Proper Multimodal Cues to Draw User’s Attention to Points of Interest?

Elkateb, S.

CLiMB ToolKit: A Case Study of Iterative Evaluation in a Multidisciplinary Project

Elson, D.

Emam, O.

Building a Heterogeneous Information Retrieval Test Collection of Arabic Document Images

Enea, A.

Next Generation Language Resources using Grid

Erjavec, T.

Towards a Slovene Dependency Treebank

The English-Slovene ACQUIS corpus

The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages

Building Slovene WordNet

Erk, K.

SALTO - A Versatile Multi-Level Annotation Tool

Shalmaneser - A Toolchain For Shallow Semantic Parsing

The SALSA Corpus: a German Corpus Resource for Lexical Semantics

Erp, M.

Identifying Named Entities in Text Databases from the Natural History Domain

Espeja, S.

New tools for the encoding of lexical data extracted from corpus

Estellés, A.

Retrieving Terminological Data from the TxtCeram Tagged Domain Corpus: A First Step towards a Terminological Ontology

Estrela, A.

The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon

Estrella, P.

A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation

Estève, Y.

Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions

Esuli, A.

SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining

Etxeberria, J.

A methodology for the joint development of the Basque WordNet and Semcor

A Preliminary Study for Building the Basque PropBank

Eugenio, B.

Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations

Evans, G.

Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali

Ezeiza, N.

Structure, Annotation and Tools in the Basque ZT Corpus

Fairon, C.

A translated corpus of 30,000 French SMS

A framework for real-time dictionary updating

Fakotakis, N.

Dealing with Imbalanced Data using Bayesian Techniques

Farkas, R.

A highly accurate Named Entity corpus for Hungarian

Farre, B.

SYMBERED - a Symbol-Concept Editing Tool

Farwell, D.

Parallel Syntactic Annotation of Multiple Languages

Fayyaz, S.

What in the world is a Shahab?: Wide Coverage Named Entity Recognition for Arabic

Febrer, A.

Generation of Language Resources for the Development of Speech Technologies in Catalan

Feldman, A.

A Cross-language Approach to Rapid Creation of New Morpho-syntactically Annotated Resources

Felföldi, L.

A highly accurate Named Entity corpus for Hungarian

Feliu, J.

SKELETON: Specialised knowledge retrieval on the basis of terms and conceptual relations

Fellbaum, C.

KUNSTI - Knowledge Generation for Norwegian Language Technology

Fenstad, J.

Fernandes, F.

A BLARK extension for temporal annotation mining

Fernández-montraveta, A.

The Sensem Corpus: a Corpus Annotated at the Syntactic and Semantic Level

Ferreira, E.

Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project

Ferret, O.

Building a network of topical relations from a corpus

Fersøe, H.

Building Annotated Written and Spoken Arabic LRs in NEMLAR Project

Filipe, P.

A Framework to Integrate Ubiquitous Knowledge Modeling

Filipowska, A.

Multiple Dimension Levenshtein Edit Distance Calculations for Evaluating Automatic Speech Recognition Systems During Simultaneous Speech

Fiscus, J.

Fišer, T.

Building Slovene WordNet

Fliedner, G.

Towards Natural Interactive Question Answering

Fluart, F.

Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation

Fluhr, C.

Exploiting text for extracting image processing resources

A Deep Linguistic Analysis for Cross-language Information Retrieval

Forăscu, C.

Temporality in relation with discourse structure

Forslund, A.

Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation

Forst, M.

The importance of precise tokenizing for deep grammars

Improving coverage and parsing quality of a large-scale LFG for German

Fosler-lussier, E.

The OSU Quake 2004 corpus of two-party situated problem-solving dialogs

Foster, J.

Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives

Francopoulo, G.

Lexical Markup Framework (LMF)

Frank, A.

SALTO - A Versatile Multi-Level Annotation Tool

The SALSA Corpus: a German Corpus Resource for Lexical Semantics

Frasnelli, E.

Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano

Freitas, D.

Progmatica: A Prosodic Database for European Portuguese

Friedrich, J.

Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian

Frota, S.

FreP: An electronic tool for extracting frequency information of phonological units from Portuguese written text

Frère, Y.

Greek Named Entity Recognition using Support Vector Machines, Maximum Entropy and Onetime

Fujii, A.

Test Collections for Patent Retrieval and Patent Classification in the Fifth NTCIRWorkshop

Statistical Analysis for Thesaurus Construction using an Encyclopedic Corpus

Fujita, A.

Augmenting a Semantic Verb Lexicon with a Large Scale Collection of Example Sentences

Fukumoto, J.

Automated Summarization Evaluation with Basic Elements.

Gaizauskas, R.

Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives

Language Resources for Background Gathering

Galiano, I.

Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA

Galliano, S.

Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

Gangemi, A.

Conversion of WordNet to a standard RDF/OWL representation

Gantner, Z.

Multilingual Lexical Semantic Resources for Ontology Translation

Garbis, A.

Dealing with Imbalanced Data using Bayesian Techniques

Garcia, M.

A joint prosody evaluation of French text-to-speech synthesis systems

TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis

A joint intelligibility evaluation of French text-to-speech synthesis systems: the EvaSy SUS/ACR campaign

Evaluation of multimodal components within CHIL: The evaluation packages and results

Gavrilidou, M.

Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology

Gazendam, L.

A Web Based General Thesaurus Browser to Support Indexing of Television and Radio Programs

Gegg-harrison, W.

PYCOT: An Optimality Theory-based Pronoun Resolution Toolkit

Geilfuss, M.

SAM - an annotation editor for parallel texts

Genabith, J.

A Part-of-speech tagger for Irish using Finite-State Morphology and Constraint Grammar Disambiguation

Geoffrois, E.

Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

George, M.

Lexical Markup Framework (LMF)

Georgescul, M.

TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface

Gerstenberger, C.

The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player

Geyken, A.

LexikoNet - a lexical database based on type and role hierarchies

Giampicollo, D.

A BLARK extension for temporal annotation mining

Discourse functions of duration in Mandarin: resource design and implementation

Feature-based Encoding and Querying Language Resources with Character Semantics

Gibbon, D.

Gillam, L.

Sentiments on a Grid: Analysis of Streaming News and Views

Gillard, L.

Question Answering Evaluation Survey

Gillis, S.

Regional Bias in the Broad Phonetic Transcriptions of the Spoken Dutch Corpus

Giménez, J.

Iqmt: A Framework for Automatic Machine Translation Evaluation

Ginés, M.

A Spell Checker for a World Language: The New Microsoft’s Spanish Spell Checker

Giouli, V.

Multi-domain Multi-lingual Named Entity Recognition: Revisiting & Grounding the resources issue

Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology

Giovannetti, E.

Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases

Giral, A.

CLiMB ToolKit: A Case Study of Iterative Evaluation in a Multidisciplinary Project

Girardi, C.

A Grapheme-Based Approach for Accent Restoration in Gikuyu

Githinji, P.

Godfrey, J.

The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

Goecke, D.

Exploiting logical document structure for anaphora resolution

Goggi, S.

Natural Language Processing: A Terminological and Statistical Approach

Gollan, C.

TC-STAR: New language resources for ASR and SLT purposes

González, M.

FreeLing 1.3: Syntactic and semantic services in an open-source NLP library

Gonçalves, J.

The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon

Gotti, F.

MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation

Goux, P.

The Look and Feel of a Confident Entailer

Graesser, A.

Graff, D.

Lexicon Development for Varieties of Spoken Colloquial Arabic

Granfeldt, J.

CEFLE and Direkt Profil: a New Computer Learner Corpus in French L2 and a System for Grammatical Profiling

Grau, B.

FRASQUES: A Question Answering system in the EQueR evaluation campaign

EQueR: the French Evaluation campaign of Question-Answering Systems

Gravier, G.

Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News

Green, A.

Developing a ContextualizedMultimodal Corpus for Human-Robot Interaction

Green, R.

Parallel Syntactic Annotation of Multiple Languages

Grefenstette, G.

Exploiting text for extracting image processing resources

Grishina, E.

Spoken Russian in the Russian National Corpus (RNC)

Gros, J.

The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text

Rule-Based Chunking and Reusability

Grover, C.

Gruenstein, J.

NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora

Grégoire, N.

Elaborating the parameterized Equivalence Class Method for Dutch

Grønnum, N.

DanPASS - A Danish Phonetically Annotated Spontaneous Speech Corpus

Guevara, E.

MORBO/COMP: a multilingual database of compound words

Guirao, J.

Building a Parallel Multilingual Corpus (Arabic-Spanish-English)

The wraetlic NLP suite

Gurrutxaga, A.

Structure, Annotation and Tools in the Basque ZT Corpus

Guthrie, D.

Guthrie, L.

Collection, Encoding and Linguistic Processing of a Swedish Medical Corpus - The MEDLEX Experience

Göteborgs, D.

Haamid, S.

Building Annotated Written and Spoken Arabic LRs in NEMLAR Project

Habash, N.

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

Developing and Using a Pilot Dialectal Arabic Treebank

Parallel Syntactic Annotation of Multiple Languages

Inter-annotator Agreement on a Multilingual Semantic Annotation Task

Haddad, B.

Building Annotated Written and Spoken Arabic LRs in NEMLAR Project

Hafezi, N.

Building and Incorporating Language Models for Persian Continuous Speech Recognition Systems

Hahn, U.

Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon

Hain, H.

ECESS Inter-Module Interface Specification for Speech Synthesis

TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis

Hajic, J.

Perspectives of Turning Prague Dependency Treebank into a Knowledge Base

Hajicova, E.

Corpus Annotation as a Test of a Linguistic Theory

Hale, J.

Talbanken05: A Swedish Treebank with Phrase Structure and Dependency Annotation

MaltParser: A Data-Driven Parser-Generator for Dependency Parsing

Hall, J.

Halpin, H.

Automatic Evaluation and Composition of NLP Pipelines with Web Services

Halácsy, P.

Morphdb.hu: Hungarian lexical database and morphological grammar

Using a morphological analyzer in high precision POS tagging of Hungarian

Hamabe, R.

Dependency-structure Annotation to Corpus of Spontaneous Japanese

Hamme, H.

JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality

Hamon, O.

Evaluation of Automatic Speech Recognition and Speech Language Translation within TC-STAR:Results from the first evaluation campaign

X-Score: Automatic Evaluation of Machine Translation Grammaticality

CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model

Hamon, T.

The ALVIS Format for Linguistically Annotated Documents

Han, S.

Evaluation of Stop Word Lists in Chinese Language

Hana, J.

A Cross-language Approach to Rapid Creation of New Morpho-syntactically Annotated Resources

Hansen, T.

Field Evaluation of a Single-Word Pronunciation Training System

Hanser, S.

Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon

Harabagiu, S.

An Answer Bank for Temporal Inference

Impact of Question Decomposition on the Quality of Answer Summaries

Harper, M.

SParseval: Evaluation Metrics for Parsing Speech

Linguistic Resources for Speech Parsing

An Open Source Prosodic Feature Extraction Tool

Hartley, A.

CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

Using collocations from comparable corpora to find translation equivalents

Hartley, T.

Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language

Hartrumpf, S.

The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon

Hartung, M.

Finding the Appropriate Generalization Level for Binary Ontological Relations Extracted from the Genia Corpus

Hasan, S.

Creating a Large-Scale Arabic to French Statistical MachineTranslation System

Hashimoto, K.

NPs for Events: Experiments in Coreference Annotation

Computer-aided summarisation -- what the user really wants

Hasler, L.

Hassel, M.

Towards Holistic Summarization -- Selecting Summaries, Not Sentences

Haton, J.

Linguistic features modeling based on Partial New Cache

Havasi, C.

Towards a Generative Lexical Resource: The Brandeis Semantic Ontology

BULB: A Unified Lexical Browser

Hayashi, Y.

A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons

Hayashida, N.

Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence

Heid, B.

KUNSTI - Knowledge Generation for Norwegian Language Technology

Heid, U.

Extraction tools for collocations and their morphosyntactic specificities

Grammar-based tools for the creation of tagging resources for an unresourced language: the case of Northern Sotho

Hein, A.

Building a Swedish-Turkish Parallel Corpus

Helbig, H.

The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon

Hellan, L.

Word Sense Disambiguation and Semantic Disambiguation for Construction Types in Deep Processing Grammars

Helmreich, S.

Parallel Syntactic Annotation of Multiple Languages

Henriksen, L.

EuroTermBank - a Terminology Resource based on Best Practice

The MULINCO corpus and corpus platform

Hernáez, I.

Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque

Herwijnen, O.

JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality

Heuvel, H.

Development of a phoneme-to-phoneme (p2p) converter to improve the grapheme-to-phoneme (g2p) conversion of names

TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis

A Unified Structure for Dutch Dialect Dictionary Data

TC-STAR: New language resources for ASR and SLT purposes

Heylen, D.

Annotating Emotions in Meetings

Hickl, A.

What in the world is a Shahab?: Wide Coverage Named Entity Recognition for Arabic

Impact of Question Decomposition on the Quality of Answer Summaries

Higashinaka, R.

Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives

Higgins, M.

Collaborative Annotation that Lasts Forever: Using Peer-to-Peer Technology for Disseminating Corpora and Language Resources

The Information Commons Gazetteer

Hirano, T.

Augmenting a Semantic Verb Lexicon with a Large Scale Collection of Example Sentences

Hiroshi, I.

Exploiting Multiple Semantic Resources for Answer Selection

Hiyakumoto, L.

Hjelm, H.

Extraction of Cross Language Term Correspondences

Hlaváèová, J.

New Approach to Frequency Dictionaries - Czech Example

Hobbs, J.

An Annotated Corpus of Typical Durations of Events

Hockey, B.

REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications

Hodász, G.

Evaluation Methods of a Linguistically Enriched Translation Memory System

Hoege, H.

Human and machine recognition as a function of SNR

Hoffmann, H.

Wizard-of-oz Data Collection for Perception and Interaction in Multi-user Environments

Hofland, K.

The ASK Corpus - a Language Learner Corpus of Norwegian as a Second Language

Hoidekr, J.

Benefit of a Class-based Language Model for Real-time Closed-captioning of TV Ice-hockey Commentaries

Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech

Horacek, H.

A corpus of tutorial dialogs on theorem proving; the influence of the presentation of the study-material

Hoste, V.

KNACK-2002: a Richly Annotated Corpus of Dutch Written Text

Hout, R.

A Unified Structure for Dutch Dialect Dictionary Data

Hovy, E.

Parallel Syntactic Annotation of Multiple Languages

Summarizing Answers for Complicated Questions

Automated Summarization Evaluation with Basic Elements.

Huang, C.

Hantology-A Linguistic Resource for Chinese Language Processing and Studying

Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus

Huang, S.

The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

Huang, Z.

An Open Source Prosodic Feature Extraction Tool

Hughes, B.

Reconsidering Language Identification for Written Language Resources

Searching for Language Resources on the Web: User Behaviour in the Open Language Archives Community

Feature-based Encoding and Querying Language Resources with Character Semantics

Huh, K.

Morphological annotation of Korean with Directly Maintainable Resources

Human, C.

An Answer Bank for Temporal Inference

Höge, H.

TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis

Hüttenrauch, H.

Developing a ContextualizedMultimodal Corpus for Human-Robot InteractionKeywords: Multimodal corpus, Human-Robot Interaction, Spatiality,Dialogue systems, Gesture

Ibekwe-sanjuan, F.

A task-oriented framework for evaluating theme detection systems: A discussion paper

Ide, N.

Representing Linguistic Corpora and Their Annotations

Integrating Linguistic Resources: The American National Corpus Model

Ignat, C.

Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation

The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages

Iida, R.

Augmenting a Semantic Verb Lexicon with a Large Scale Collection of Example Sentences

Ikehara, S.

Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy

Imran, S.

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

Imura, M.

Multimedia Database of Meetings and Informal Interactions for Tracking Participant Involvement and Discourse Flow

Inagaki, Y.

Layered Speech-Act Annotation for Spoken Dialogue Corpus

A Syntactically Annotated Corpus of Japanese Spoken Monologue

A Corpus Search System Utilizing Lexical Dependency Structure

Inkpen, D.

Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words

Innselset, K.

Automatic Term Extraction from Knowledge Bank of Economics

Inui, K.

Augmenting a Semantic Verb Lexicon with a Large Scale Collection of Example Sentences

Inácio, S.

Annotating COMPARA, a Grammar-aware Parallel Corpus

Iomdin, B.

A Syntactically and Semantically Tagged Corpus of Russian: State of the Art and Prospects

Iomdin, L.

A Syntactically and Semantically Tagged Corpus of Russian: State of the Art and Prospects

Ion, R.

Dependency-Based Phrase Alignment

Ionescu, E.

Romanian Valence Dictionary in XML Format

Ircing, P.

Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech

Iria, J.

A Methodology and Tool for Representing Language Resources for Information Extraction

An Incremental Tri-Partite Approach To Ontology Learning

Irie, Y.

Layered Speech-Act Annotation for Spoken Dialogue Corpus

Irimia, E.

RoCo-News: A Hand Validated Journalistic Corpus of Romanian

Isahara, H.

Semantic Analysis of Abstract Nouns to Compile a Thesaurus of Adjectives

Dependency-structure Annotation to Corpus of Spontaneous Japanese

Blind Evaluation for Thai Search Engines

Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence

Creation of a Japanese Adverb Dictionary that Includes Information on the Speaker’s Communicative Intention Using Machine Learning

Word Knowledge Acquisition for Computational Lexicon Construction

Detection of inconsistencies in concept classifications in a large dictionary - Toward an improvement of the EDR electronic dictionary

A Conditional Random Field Framework for Thai Morphological Analysis

Getting Deeper Semantics than Berkeley FrameNet with MSFA

Isbihani, A.

Creating a Large-Scale Arabic to French Statistical MachineTranslation System

Ishida, T.

A Dictionary Model for Unifying Machine Readable Dictionaries and Computational Concept Lexicons

Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence

Islam, M.

Second Order Co-occurrence PMI for Determining the Semantic Similarity of Words

Itagaki, M.

Detecting Inter-domain Semantic Shift using Syntactic Similarity

Itahashi, S.

Oriental COCOSDA: Past, Present and Future

Itai, A.

A Computational Lexicon of Contemporary Hebrew

Itou, K.

Statistical Analysis for Thesaurus Construction using an Encyclopedic Corpus

Ivanovska-naskova, R.

Development of the First LRs for Macedonian: Current Projects

Iwahashi, N.

Multimedia Database of Meetings and Informal Interactions for Tracking Participant Involvement and Discourse Flow

Iwayama, M.

Test Collections for Patent Retrieval and Patent Classification in the Fifth NTCIRWorkshop

Izagirre, E.

A methodology for the joint development of the Basque WordNet and Semcor

Jacobson, M.

Interoperability of audio corpora : the case of the French corpora

Jakopin, P.

Jansen, H.

“Casselberveetovallarga” and other Unpronounceable Places: The CrossTowns Corpus

Jekosch, U.

Jellinghaus, M.

Multilingual parallel treebanking: a lean and flexible approach

Jin, H.

Lexicon Development for Varieties of Spoken Colloquial Arabic

Johanson, É.

Building a Swedish-Turkish Parallel Corpus

Johansson, C.

Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains

Johansson, R.

Extraction of Temporal Information from Texts in Swedish

Construction of a FrameNet Labeler for Swedish Text

Johnson, M.

Towards machine-readable lexicons for South African Bantu languages

Jones, J.

Jones, R.

Tools and resources for speech synthesis arising from a Welsh TTS project

Jongejan, B.

Hand-crafted versus Machine-learned Inflectional Rules: The Euroling-SiteSeeker Stemmer and CST's Lemmatiser

Jouis, C.

Hierarchical Relationships "is-a": Distinguishing Belonging, Inclusion and Part/of Relationships.

Juan, A.

Bilingual Machine-Aided Indexing

Jung, J.

Recurrent Markov Cluster (RMCL) Algorithm for the Refinement of the Semantic Network

Justo, R.

Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA

Jänsch, K.

Speech Recordings in Public Schools in Germany - the Perfect Show Case for Web-based Recordings and Annotation

Jørgensen, L.

The BLARK concept and BLARK for Arabic

Kabadjov, M.

SINOD - Slovenian non-native speech database

Kacic, Z.

Kacmarcik, G.

RefRef: A Tool for Viewing and Exploring Coreference Space

Kageura, K.

A Self-Referring Quantitative Evaluation of the ATR Basic Travel Expression Corpus (BTEC)

Kahn, J.

Bikers Accessing the Web: The SmartWeb Motorbike Corpus

SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus

Kaiser, M.

Kaisser, M.

The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player

Kaji, H.

Automatic Construction of Japanese WordNet

Kakihana, K.

On the Web Trilingual Sign Language Dictionary to Learn the foreign Sign Language without Learning a Target Spoken Language

Kamp, P.

Technologies for a Federation of Language Resource Archives

Kanamaru, T.

Creation of a Japanese Adverb Dictionary that Includes Information on the Speaker’s Communicative Intention Using Machine Learning

Kando, N.

Test Collections for Patent Retrieval and Patent Classification in the Fifth NTCIRWorkshop

Kanzaki, K.

Semantic Analysis of Abstract Nouns to Compile a Thesaurus of Adjectives

Detection of inconsistencies in concept classifications in a large dictionary - Toward an improvement of the EDR electronic dictionary

Kaplan, R.

The importance of precise tokenizing for deep grammars

Karpouzis, K.

Manual Annotation and Automatic Image Processing of Multimodal Emotional Behaviours: Validating the Annotation of TV Interviews

Kashioka, H.

A Syntactically Annotated Corpus of Japanese Spoken Monologue

Kato, N.

A Syntactically Annotated Corpus of Japanese Spoken Monologue

Kato, T.

Transcription Cost Reduction for Constructing Acoustic Models Using Acoustic Likelihood Selection Criteria

Kato, Y.

A Corpus Search System Utilizing Lexical Dependency Structure

Katz, B.

Using Semantic Overlap Scoring in Answering TREC Relationship Questions

US-based Method for Speech Reception Threshold Measurement in French

Kawaguchi, N.

Layered Speech-Act Annotation for Spoken Dialogue Corpus

Kawahara, D.

Case Frame Compilation from the Web using High-Performance Computing

Kawahara, T.

Dependency-structure Annotation to Corpus of Spontaneous Japanese

Kawanami, H.

Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech

Kawtrakul, A.

Ontology Driven K-Portal Construction and K-Service Provision

Keller, E.

ECESS Inter-Module Interface Specification for Speech Synthesis

Kemps-snijders, M.

LEXUS, a web-based tool for manipulating lexical resources lexicon

Ontology-based Language Archive Utilization

An API for accessing the Data Category Registry

Kepler, F.

A novel Textual Encoding paradigm based on Semantic Web tools and semantics

Kermanidis, K.

Dealing with Imbalanced Data using Bayesian Techniques

Kiefer, B.

Preprocessing and Tokenisation Standards in DELPH-IN Tools

Kiilerich, S.

H. C. Andersen Conversation Corpus

Field Evaluation of a Single-Word Pronunciation Training System

Kikui, G.

A Self-Referring Quantitative Evaluation of the ATR Basic Travel Expression Corpus (BTEC)

Kim, J.

Linguistic and Biological Annotations of Biological Interaction Events

Kimler, M.

Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation

King, M.

A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation

Evaluating Symbiotic Systems: the challenge

Kinoshita, J.

CoGrOO: a Brazilian-Portuguese Grammar Checker based on the CETENFOLHA Corpus

Kipper, K.

Extending VerbNet with Novel Verb Classes

Kiss, I.

TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis

Kitamura, T.

Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech

Kiyota, Y.

A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames

Klakow, D.

Building an Evaluation Corpus for German Question Answering by Harvesting Wikipedia

Klassmann, A.

Comparison of Resource Discovery Methods

ELAN: a Professional Framework for Multimodality Research

Klatt, S.

A Corpus-based Approach to the Interpretation of Unknown Words with an Application to German

Klavans, J.

CLiMB ToolKit: A Case Study of Iterative Evaluation in a Multidisciplinary Project

Klein, W.

Foundations of Modern Language Resource Archives

Knippen, R.

Annotation of Temporal Relations with Tango

Ko, J.

Exploiting Multiple Semantic Resources for Answer Selection

Analyzing the Effects of Spoken Dialog Systems on Driving Behavior

Kocsor, A.

A highly accurate Named Entity corpus for Hungarian

Kogure, K.

Recognizing Acronyms and their Definitions in Swedish Medical Texts

Kokkinakis, D.

Kongkachandra, R.

Semantic-Based Keyword Recovery Function for Keyword Extraction System

Konings, N.

Development of a phoneme-to-phoneme (p2p) converter to improve the grapheme-to-phoneme (g2p) conversion of names

Konstandinidis, A.

Multi-domain Multi-lingual Named Entity Recognition: Revisiting & Grounding the resources issue

Kordoni, V.

Automated Deep Lexical Acquisition for Robust Open Texts Processing

Korhonen, A.

A Large Subcategorization Lexicon for Natural Language Processing Applications

Extending VerbNet with Novel Verb Classes

Kornai, A.

Using a morphological analyzer in high precision POS tagging of Hungarian

Kotzé, P.

Finite state tokenisation of an orthographical disjunctive agglutinative language: The verbal segment of Northern Sotho

Koutsombogera, M.

Adding multi-layer semantics to the Greek Dependency Treebank

Kouylekov, M.

Building a Large-Scale Repository of Textual Entailment Rules

Kowalski, A.

SALTO - A Versatile Multi-Level Annotation Tool

The SALSA Corpus: a German Corpus Resource for Lexical Semantics

Kraif, O.

Evaluation of multilingual text alignment systems: the ARCADE II project

Krasnyanskaya, A.

Building Annotated Written and Spoken Arabic LRs in NEMLAR Project

The BLARK concept and BLARK for Arabic

Krauwer, S.

Kremer, G.

Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology

Kristiansen, M.

Automatic Term Extraction from Knowledge Bank of Economics

Krstev, C.

Word Knowledge Acquisition for Computational Lexicon Construction

A Conditional Random Field Framework for Thai Morphological Analysis

Kruengkrai, C.

Kruijff-korbayova, I.

A corpus of tutorial dialogs on theorem proving; the influence of the presentation of the study-material

The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player

Kruijff-korbayová, I.

Annotation Guidelines for Czech-English Word Alignment

Kruijsen, J.

A Unified Structure for Dutch Dialect Dictionary Data

Kruschwitz, U.

A Large Subcategorization Lexicon for Natural Language Processing Applications

Krymolowski, Y.

Krynicki, G.

General and Task-Specific Corpus Resources for Polish Adult Learners of English

Ku, L.

Tagging Heterogeneous Evaluation Corpora for Opinionated Tasks

Kugler, U.

Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano

Kuhn, A.

Results of the French Evalda-Media evaluation campaign for literal understanding

Kuhn, J.

Multilingual parallel treebanking: a lean and flexible approach

Kulick, S.

Linguistic Resources for Speech Parsing

Kuroda, K.

Getting Deeper Semantics than Berkeley FrameNet with MSFA

Kurohashi, S.

Case Frame Compilation from the Web using High-Performance Computing

Kuwahara, N.

KUNSTI - Knowledge Generation for Norwegian Language Technology

Kvale, K.

Kwong, O.

Toward a Pan-Chinese Thesaurus

Labropoulou, P.

Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology

Lacatusu, F.

Impact of Question Decomposition on the Quality of Answer Summaries

Lafourcade, M.

Conceptual Vector Learning - Comparing Bootstrapping from a Thesaurus or Induction by Emergence

Lai, T.

Court Stenography-To-Text (“STT”) in Hong Kong: A Jurilinguistic Engineering Effort

Laib, M.

A Deep Linguistic Analysis for Cross-language Information Retrieval

Lamel, L.

Experimental detection of vowel pronunciation variants in Amharic

Langkilde-geary, I.

A Factored Functional Dependency Transformation of the English Penn Treebank for Probabilistic Surface Generation

Langlais, P.

MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation

Laoudi, J.

Task-based MT Evaluation: From Who/When/Where Extraction to Event Understanding

Laporte, E.

Morphological annotation of Korean with Directly Maintainable Resources

Laprun, C.

Multiple Dimension Levenshtein Edit Distance Calculations for Evaluating Automatic Speech Recognition Systems During Simultaneous Speech

Laskowski, K.

Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus

Laurent, D.

Evaluation of multilingual text alignment systems: the ARCADE II project

Lavecchia, C.

Linguistic features modeling based on Partial New Cache

Le, A.

Edit Distance: A Metric for Machine Translation Evaluation

Le, P.

A Lexicalized Tree-Adjoining Grammar for Vietnamese

Lease, M.

SParseval: Evaluation Metrics for Parsing Speech

Linguistic Resources for Speech Parsing

Lechenadec, G.

Creation of a corpus of multimodal spontaneous expressions of emotions in Human-Machine Interaction

Ledinek, N.

A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit

Linguistic Resources for Speech Parsing

Lee, H.

Lefevre, F.

Results of the French Evalda-Media evaluation campaign for literal understanding

Lehireche, A.

Ongoing Developments in Automatically Adapting Lexical Resources to the Biomedical Domain

Lehmann, J.

What in the world is a Shahab?: Wide Coverage Named Entity Recognition for Arabic

Leidner, J.

Building an Evaluation Corpus for German Question Answering by Harvesting Wikipedia

Lenci, A.

Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases

Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment

Lenzi, V.

Lepetit, X.

From Natural Language to Databases via Ontologies

A Development Tool For Multilingual Ontology-based Conceptual

Lesmo, L.

Lesourd, H.

A corpus of tutorial dialogs on theorem proving; the influence of the presentation of the study-material

Leturia, I.

Structure, Annotation and Tools in the Basque ZT Corpus

Levin, L.

Parallel Syntactic Annotation of Multiple Languages

Levinson, S.

Foundations of Modern Language Resource Archives

Levy, R.

Tregex and Tsurgeon: tools for querying and manipulating tree data structures

Li, S.

Interaction between Lexical Base and Ontology with Formal Concept Analysis

The Design and Construction of A Chinese Collocation Bank

Li, Wei.

Mining Implicit Entities in Queries

Mining Implicit Entities in Queries

Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach

Li, Wen.

Interaction between Lexical Base and Ontology with Formal Concept Analysis

Mining Implicit Entities in Queries

Liang, Y.

Tagging Heterogeneous Evaluation Corpora for Opinionated Tasks

Liberman, M.

Ligozat, A.

FRASQUES: A Question Answering system in the EQueR evaluation campaign

Lin, C.

Summarizing Answers for Complicated Questions

Automated Summarization Evaluation with Basic Elements.

Lin, F.

Modular Approach to Error Analysis and Evaluation for Multilingual Question Answering

Lin, M.

Constructing a Named Entity Ontology from Web Corpora

Lisowska, A.

ROTE: A Tool to Support Users in Defining the Relative Importance of Quality Characteristics

The Evolution of an Evaluation Framework for a Text Mining System

Extending the Wizard of Oz Methodologie for Multimodal Language-enabled Systems

Littman, J.

Towards a Generative Lexical Resource: The Brandeis Semantic Ontology

Liu, W.

SParseval: Evaluation Metrics for Parsing Speech

Linguistic Resources for Speech Parsing

Liu, Y.

Lleida, E.

Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA

Lombardo, V.

Comparing linguistic information in treebank annotations

Lopatkova, M.

Valency Lexicon of Czech Verbs: Alternation-Based Model

López de Letona, I.

Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA

Loukachevitch, N.

Development of Linguistic Ontology on Natural Sciences and Technology

Lowery, K.

A Hebrew Tree Bank Based on Cantillation Marks

Lu, Q.

Interaction between Lexical Base and Ontology with Formal Concept Analysis

Mining Implicit Entities in Queries

The Design and Construction of A Chinese Collocation Bank

Luca, E.

Rebuilding Lexical Resources for Information Retrieval using Sense Folder Detection and Merging Methods

Lucas, P.

Collaborative Annotation that Lasts Forever: Using Peer-to-Peer Technology for Disseminating Corpora and Language Resources

The Information Commons Gazetteer

Lundälv, M.

SYMBERED - a Symbol-Concept Editing Tool

Luning, J.

A Study on Terminology Extraction Based on Classified Corpora

Luz, S.

Gathering a corpus of multimodal computer-mediated meetings

L’homme, M.

A Methodology for Developing Multilingual Resources for Terminology

Lønning, J.

Discriminant-Based MRS Banking

Ma, Q.

Semantic Analysis of Abstract Nouns to Compile a Thesaurus of Adjectives

Ma, W.

Uniform and Effective Tagging of a Heterogeneous Giga-word Corpus

Ma, X.

Integrated Linguistic Resources for Language Exploitation Technologies

Champollion: A Robust Parallel Text Sentence Aligner

Corpus Support for Machine Translation at LDC

Maamouri, M.

Integrated Linguistic Resources for Language Exploitation Technologies

Developing and Using a Pilot Dialectal Arabic Treebank

Lexicon Development for Varieties of Spoken Colloquial Arabic

Maas, J.

BITT: A Corpus for Topic Tracking Evaluation on Multimodal Human-Robot-Interaction

Maccartney, B.

Generating Typed Dependency Parses from Phrase Structure Parses

Mackinlay, A.

Reconsidering Language Identification for Written Language Resources

Macklovitch, E.

TransType2 : The Last Word

Madsen, T.

Field Evaluation of a Single-Word Pronunciation Training System

Maeda, K.

Integrated Linguistic Resources for Language Exploitation Technologies

A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit

Linguistic Resources for Speech Parsing

Low-cost Customized Speech Corpus Creation for Speech Technology Applications

An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks

Maegaard, B.

Building Annotated Written and Spoken Arabic LRs in NEMLAR Project

KUNSTI - Knowledge Generation for Norwegian Language Technology

The BLARK concept and BLARK for Arabic

The MULINCO corpus and corpus platform

Maffiolo, V.

Creation of a corpus of multimodal spontaneous expressions of emotions in Human-Machine Interaction

Magdy, W.

Building a Heterogeneous Information Retrieval Test Collection of Arabic Document Images

Magnini, B.

I-CAB: the Italian Content Annotation Bank

The Multilingual Question Answering Track at CLEF

Building a Large-Scale Repository of Textual Entailment Rules

Mah, C.

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

Maia, B.

Corpógrafo V3 - From Terminological Aid to Semi-automatic Knowledge Engineering

Mairesse, F.

Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives

Maks, I.

Exploring opportunities for Comparability and Enrichment by Linking lexical databases

Malaisé, V.

A Web Based General Thesaurus Browser to Support Indexing of Television and Radio Programs

Maltese, G.

Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian

Mamede, N.

A Framework to Integrate Ubiquitous Knowledge Modeling

Mammini, M.

Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian

Mangeot, M.

Dictionary Building with the Jibiki Platform: the GDEF case

Mani, I.

Annotation of Temporal Relations with Tango

Manning, C.

Generating Typed Dependency Parses from Phrase Structure Parses

Manurung, R.

Building a Lexical Database for an Interactive Joke-Generator

Manzano-macho, D.

Multilingual Lexical Semantic Resources for Ontology Translation

Maragoudakis, M.

Dealing with Imbalanced Data using Bayesian Techniques

Marchetti, A.

Moving to dynamic computational lexicons with LeXFlow

Marchi, S.

Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases

Marco, F.

Integrating Methods and LRs for Automatic Keyword Extraction from Open Domain Texts

Mareüil, P.

A joint prosody evaluation of French text-to-speech synthesis systems

A joint intelligibility evaluation of French text-to-speech synthesis systems: the EvaSy SUS/ACR campaign

Mariani, J.

Techno-langue: The French National Initiative for Human Language Technologies (HLT)

Marimon, M.

New tools for the encoding of lexical data extracted from corpus

Marinelli, R.

Using Core Ontology for Domain Lexicon Structuring

Proper Names and Linguistic Dynamics

Markantonatou, S.

METIS-II: Machine Translation for Low Resource Languages

Markó, K.

Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon

Language Specific and Topic Focused Web Crawling

Markus, A.

SINOD - Slovenian non-native speech database

Marneffe, M.

Generating Typed Dependency Parses from Phrase Structure Parses

Martens, J.

Development of a phoneme-to-phoneme (p2p) converter to improve the grapheme-to-phoneme (g2p) conversion of names

Martin, A.

The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

Martin, E.

Sign Language corpus analysis: Synchronisation of linguistic annotation and numerical data

Martin, F.

FreP: An electronic tool for extracting frequency information of phonological units from Portuguese written text

Martin, J.

Martin, M.

A Development Tool For Multilingual Ontology-based Conceptual

Martinez, C.

Bilingual speech corpus in two phonetically similar languages

Marton, G.

Using Semantic Overlap Scoring in Answering TREC Relationship Questions

Maruyama, T.

Dependency-structure Annotation to Corpus of Spontaneous Japanese

Masaki, N.

Gathering a corpus of multimodal computer-mediated meetings

Masoodian, M.

Massimo, M.

Integrating Methods and LRs for Automatic Keyword Extraction from Open Domain Texts

Matsubara, S.

Layered Speech-Act Annotation for Spoken Dialogue Corpus

A Syntactically Annotated Corpus of Japanese Spoken Monologue

A Corpus Search System Utilizing Lexical Dependency Structure

Collection of Simultaneous Interpreting Patterns by Using Bilingual Spoken Monologue Corpus

Matsumoto, Y.

An Annotated Corpus Management Tool: ChaKi

Augmenting a Semantic Verb Lexicon with a Large Scale Collection of Example Sentences

Matusov, E.

Training a Statistical Machine Translation System without GIZA++

Mauclair, J.

Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions

Mauser, A.

Training a Statistical Machine Translation System without GIZA++

Maynard, D.

Creating Tools for Morphological Analysis of Sumerian

Mazur, P.

Named Entity Extraction with Conjunction Disambiguation

Mazze, A.

A Development Tool For Multilingual Ontology-based Conceptual

Mazzoli, T.

Towards an Ontology for Art and Colours

Mcrorie, M.

Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches

Medelyan, O.

Language Specific and Topic Focused Web Crawling

Medero, J.

A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit

An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks

Medlock, B.

An Introduction to NLP-based Textual Anonymisation

Megyesi, B.

Building a Swedish-Turkish Parallel Corpus

Melichar, M.

Extending the Wizard of Oz Methodologie for Multimodal Language-enabled Systems

Melloni, C.

MORBO/COMP: a multilingual database of compound words

Melz, R.

Compiling large language resources using lexical similarity metrics for domain taxonomy learning

Mendes, A.

Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project

COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions

Mendizabal, K.

A methodology for the joint development of the Basque WordNet and Semcor

Menezes, C.

CoGrOO: a Brazilian-Portuguese Grammar Checker based on the CETENFOLHA Corpus

Merino, D.

Meur, A.

Lemma-oriented dictionaries, concept-oriented terminology and translation memories

Meurer, P.

The ASK Corpus - a Language Learner Corpus of Norwegian as a Second Language

Michailidis, I.

Greek Named Entity Recognition using Support Vector Machines, Maximum Entropy and Onetime

Mieskes, M.

Part-of-Speech Tagging of Transcribed Speech

Miguel, A.

Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA

Mihelic, A.

Exploiting Parallel Corpora for Supervised Word-Sense Disambiguation in English-Hungarian Machine Translation

Miháltz, M.

Milde, J.

The Eclipse Annotator: an extensible system for multimodal corpus creation

SAM - an annotation editor for parallel texts

Miller, K.

Parallel Syntactic Annotation of Multiple Languages

Formal v. Informal: Register-Differentiated Arabic MT Evaluation in the PLATO Paradigm

Minker, W.

Wizard-of-oz Data Collection for Perception and Interaction in Multi-user Environments

Stochastic Spoken Natural Language Parsing in the Framework of the French MEDIA Evaluation Campaign

Mitamura, T.

Parallel Syntactic Annotation of Multiple Languages

Modular Approach to Error Analysis and Evaluation for Multilingual Question Answering

Analyzing the Effects of Spoken Dialog Systems on Driving Behavior

Mititelu, V.

Romanian Valence Dictionary in XML Format

Mitkov, R.

If “it” were “then”, then when was “it”? Establishing the anaphoric role of “then”

Miyake, M.

Recurrent Markov Cluster (RMCL) Algorithm for the Refinement of the Semantic Network

Mohamed, Z.

Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali

Monachesi, P.

A pilot study for a Corpus of Dutch Aphasic Speech (CoDAS)

Monachini, M.

Monceau, L.

FRASQUES: A Question Answering system in the EQueR evaluation campaign

Montemagni, S.

Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment

Dialectal resources on-line: the ALT-Web experience

Morărescu, P.

Principles for annotating and reasoning with spatial information

Morbidoni, C.

A novel Textual Encoding paradigm based on Semantic Web tools and semantics

Morel, M.

A joint prosody evaluation of French text-to-speech synthesis systems

A joint intelligibility evaluation of French text-to-speech synthesis systems: the EvaSy SUS/ACR campaign

Moreno, A.

TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis

Generation of Language Resources for the Development of Speech Technologies in Catalan

Spanish Synthesis Corpora

TC-STAR: New language resources for ASR and SLT purposes

Moreno-sandoval, A.

The wraetlic NLP suite

Moriceau, V.

Language Challenges for Data Fusion in Question-Answering

Morita, T.

A Tree Kernel approach to Question and Answer Classification in Question Answering Systems

Moschitti, A.

Mostefa, D.

Mouradi, A.

Building Annotated Written and Spoken Arabic LRs in NEMLAR Project

Movassagh, H.

Building and Incorporating Language Models for Persian Continuous Speech Recognition Systems

Mukbel, C.

Building Annotated Written and Spoken Arabic LRs in NEMLAR Project

Mulkar, R.

An Annotated Corpus of Typical Durations of Events

Mulloni, A.

Automatic Detection of Orthographic Cues for Cognate Recognition

Murase, F.

Analyzing the Effects of Spoken Dialog Systems on Driving Behavior

Murata, M.

Creation of a Japanese Adverb Dictionary that Includes Information on the Speaker’s Communicative Intention Using Machine Learning

Musacchio, M.

Ontological and Terminological Commitments and the Discourse of Specialist Communities

Mustafa el Hadi, W.

Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model

CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

Márquez, L.

Generation of Language Resources for the Development of Speech Technologies in Catalan

Mögele, H.

Bikers Accessing the Web: The SmartWeb Motorbike Corpus

SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus

Möller, S.

Set-up of a Unit-Selection Synthesis with a Prominent Voice

Mühlenbock, K.

SYMBERED - a Symbol-Concept Editing Tool

KUNSTI - Knowledge Generation for Norwegian Language Technology

Nagi, M.

Building a Heterogeneous Information Retrieval Test Collection of Arabic Document Images

Nakagawa, H.

A Domain Ontology Production Tool Kit Based on Automatically Constructed Case Frames

Nakamura, S.

Oriental COCOSDA: Past, Present and Future

Nakasone, H.

The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

Nam, J.

Morphological annotation of Korean with Directly Maintainable Resources

Nascimento, M.

Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project

COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions

The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon

Nathan, D.

Technologies for a Federation of Language Resource Archives

Naumann, K.

NPs for Events: Experiments in Coreference Annotation

Navarretta, C.

Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque

Navas, E.

Navigli, R.

Reducing the Granularity of a Computational Lexicon via an Automatic Mapping to a Coarse-Grained Sense Inventory

Nazarenko, A.

The ALVIS Format for Linguistically Annotated Documents

Nederhof, M.

LEXUS, a web-based tool for manipulating lexical resources lexicon, standards, tool

Negri, M.

Tree Searching/Rewriting Formalism

Nemec, P.

Nenadic, G.

Towards a terminological resource for biomedical text mining

Neumann, G.

Exploring HPSG-based Treebanks for Probabilistic Parsing HPSG grammar extraction

Neumann, H.

Wizard-of-oz Data Collection for Perception and Interaction in Multi-user Environments

Neuvel, S.

Applying Lexical Constraints on Morpho-Syntactic Patterns for the Identification of Conceptual-Relational Content in Specialized Texts

Ney, H.

POS-based Word Reorderings for Statistical Machine Translation

Error Analysis of Statistical Machine Translation Output

Training a Statistical Machine Translation System without GIZA++

A German Sign Language Corpus of the Domain Weather Report

Creating a Large-Scale Arabic to French Statistical MachineTranslation System

Nezda, L.

What in the world is a Shahab?: Wide Coverage Named Entity Recognition for Arabic

Nguyen, T.

Evaluation of multilingual text alignment systems: the ARCADE II project

A Lexicalized Tree-Adjoining Grammar for Vietnamese

Nicholson, J.

Reconsidering Language Identification for Written Language Resources

Niekrasz, J.

NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora

Nilsson, J.

Talbanken05: A Swedish Treebank with Phrase Structure and Dependency Annotation

MaltParser: A Data-Driven Parser-Generator for Dependency Parsing

Nimaan, A.

Towards automatic transcription of Somali language

Nimb, S.

LEXADV - a multilingual semantic Lexicon for Adverbs

Nissim, M.

The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text

Nitta, Y.

Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy

Nivre, J.

Talbanken05: A Swedish Treebank with Phrase Structure and Dependency Annotation

MaltParser: A Data-Driven Parser-Generator for Dependency Parsing

Nocera, P.

Towards automatic transcription of Somali language

Nohama, P.

Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon

Noord, G.

Syntactic Annotation of Large Corpora in STEVIN

Noriko, S.

Multimedia Database of Meetings and Informal Interactions for Tracking Participant Involvement and Discourse Flow

Novak, V.

Perspectives of Turning Prague Dependency Treebank into a Knowledge Base

Nováček, V.

Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator

Novák, A.

Morphological Tools for Six Small Uralic Languages

Nugues, P.

Extraction of Temporal Information from Texts in Swedish

CEFLE and Direkt Profil: a New Computer Learner Corpus in French L2 and a System for Grammatical Profiling

Construction of a FrameNet Labeler for Swedish Text

Nunes, F.

Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project

Nyberg, E.

Evaluation for Scenario Question Answering Systems

Exploiting Multiple Semantic Resources for Answer Selection

Analyzing the Effects of Spoken Dialog Systems on Driving Behavior

Nøklestad, A.

Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains

Nürnberger, A.

Rebuilding Lexical Resources for Information Retrieval using Sense Folder Detection and Merging Methods

Obrębski, T.

Syntactic Lexicon of Polish Predicative Nouns

Obradović, I.

UAM Text Tools - a flexible NLP architecture

Obrebski, T.

Odijk, J.

The Dutch-Flemish HLT Programme STEVIN: Essential Speech and Language Technology Resources

Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian

Oellinger, T.

Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation

Oepen, S.

Discriminant-Based MRS Banking

Offenga, F.

Comparison of Resource Discovery Methods

LAMUS: the Language Archive Management and Upload System

Metadata Profile in the ISO Data Category Registry

Technologies for a Federation of Language Resource Archives

Offersgaard, L.

Statistical Analysis for Thesaurus Construction using an Encyclopedic Corpus

Ohishi, Y.

Ohno, T.

A Syntactically Annotated Corpus of Japanese Spoken Monologue

Ohta, T.

Linguistic and Biological Annotations of Biological Interaction Events

Ohtani, A.

Clustering acronyms in biomedical text for disambiguation

Towards a terminological resource for biomedical text mining

Okazaki, N.

Okumura, M.

Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing

Oliveira, S.

The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon

Oliver, D.

Creation and analysis of a Polish speech database for use in unit selection synthesis

Oliver, T.

Visual Surveillance and Video Annotation and Description

Oltramari, A.

LexiPass methodology: a conceptual path from frames to senses and back

Onelli, C.

The DiaCORIS project: a diachronic corpus of written Italian

Oostdijk, N.

User requirements analysis for the design of a reference corpus of written Dutch

Data for question answering: The case of why

Orasan, C.

NPs for Events: Experiments in Coreference Annotation

Computer-aided summarisation -- what the user really wants

Transferring Coreference Chains through Word Alignment

Oravecz, C.

Using a morphological analyzer in high precision POS tagging of Hungarian

Ordelman, R.

Annotating Emotions in Meetings

Orsolini, P.

Natural Language Processing: A Terminological and Statistical Approach

Osenova, P.

The Multilingual Question Answering Track at CLEF

Shallow Semantic Annotation of Bulgarian

Osswald, R.

The Representation of German Prepositional Verbs in a Semantically Based Computer Lexicon

Ostendorf, M.

Automatic Term Extraction from Knowledge Bank of Economics

Øvsthus, K.

Ozaku, H.

Building a Lexical Database for an Interactive Joke-Generator

O’mara, D.

Pacheco, E.

Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon

Pado, S.

SALTO - A Versatile Multi-Level Annotation Tool

Shalmaneser - A Toolchain For Shallow Semantic Parsing

The SALSA Corpus: a German Corpus Resource for Lexical Semantics

Padró, L.

FreeLing 1.3: Syntactic and semantic services in an open-source NLP library

Padró, M.

FreeLing 1.3: Syntactic and semantic services in an open-source NLP library

Paetzold, J.

Language Specific and Topic Focused Web Crawling

Paggio, P.

Annotating Information Structure in a Corpus of Spoken Danish

Pain, H.

Building a Lexical Database for an Interactive Joke-Generator

Pajas, P.

Wizard-of-oz Data Collection for Perception and Interaction in Multi-user Environments

Palm, G.

Palmer, M.

Extending VerbNet with Novel Verb Classes

Palumbo, G.

Ontological and Terminological Commitments and the Discourse of Specialist Communities

Pan, F.

An Annotated Corpus of Typical Durations of Events

Paoli, M.

Dialectal resources on-line: the ALT-Web experience

Papageorgiou, H.

Multi-domain Multi-lingual Named Entity Recognition: Revisiting & Grounding the resources issue

Adding multi-layer semantics to the Greek Dependency Treebank

Pardelli, G.

Natural Language Processing: A Terminological and Statistical Approach

Paroubek, P.

Data, Annotations and Measures in EASY the Evaluation Campaign for Parsers of French.

Passonneau, R.

Measuring Agreement on Set-valued Items (MASI) for Semantic and Pragmatic Annotation

Inter-annotator Agreement on a Multilingual Semantic Annotation Task

CLiMB ToolKit: A Case Study of Iterative Evaluation in a Multidisciplinary Project

Pastra, K.

Beyond Multimedia Integration: corpora and annotations for cross-media decision mechanisms

Patel, A.

Patry, A.

MOOD: A Modular Object-Oriented Decoder for Statistical Machine Translation

Paulsson, N.

Building Annotated Written and Spoken Arabic LRs in NEMLAR Project

Paumier, S.

A translated corpus of 30,000 French SMS

A framework for real-time dictionary updating

Pauw, G.

KNACK-2002: a Richly Annotated Corpus of Dutch Written Text

A Grapheme-Based Approach for Accent Restoration in Gikuyu

A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora

Pazienza, M.

Mixing WordNet, VerbNet and PropBank for studying verb relations

Pease, A.

Ontology Driven K-Portal Construction and K-Service Provision

Pechsiri, C.

Pecina, P.

Semi-automatic Building of Swedish Collocation Lexicon

Pedersen, B.

Query Expansion on Compounds

Pekar, V.

Automatic Detection of Orthographic Cues for Cognate Recognition

Pellegrini, T.

Experimental detection of vowel pronunciation variants in Amharic

Pennacchiotti, M.

Mixing WordNet, VerbNet and PropBank for studying verb relations

Pereira, L.

COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions

The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon

Pereira, A.

The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon

Permpool, T.

Ontology Driven K-Portal Construction and K-Service Provision

Persson, E.

CEFLE and Direkt Profil: a New Computer Learner Corpus in French L2 and a System for Grammatical Profiling

Pet, M.

Lexical Markup Framework (LMF)

Petel, S.

The Ritel Corpus - An annotated Human-Machine open-domain question answering spoken dialog corpus

Peters, C.

The Impact of Evaluation on Multilingual Information Retrieval System Development

Peters, W.

Creating Tools for Morphological Analysis of Sumerian

The LOIS Project

Petersen, U.

Querying Both Parallel And Treebank Corpora: Evaluation Of A Corpus Query System

Petit-renaud, S.

Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions

Peñas, A.

Pianta, E.

A novel Textual Encoding paradigm based on Semantic Web tools and semantics

Piazza, F.

Picchi, E.

Dialectal resources on-line: the ALT-Web experience

Next Generation Language Resources using Grid

Pietrala, D.

General and Task-Specific Corpus Resources for Polish Adult Learners of English

Pinto, A.

Corpógrafo V3 - From Terminological Aid to Semi-automatic Knowledge Engineering

Piperidis, S.

Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology

Pirrelli, V.

Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases

Searching treebanks for functional constraints: cross-lingual experiments in grammatical relation assignment

Piskorski, J.

Temporality in relation with discourse structure

Pistol, I.

Pitel, G.

A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation

Plank, B.

Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano

Pociello, E.

A methodology for the joint development of the Basque WordNet and Semcor

A Preliminary Study for Building the Basque PropBank

Podvesky, P.

Semi-automatic Building of Swedish Collocation Lexicon

Poesio, M.

Exploiting Parallel Corpora for Supervised Word-Sense Disambiguation in English-Hungarian Machine Translation

Pohl, G.

Poietti, D.

The DiaCORIS project: a diachronic corpus of written Italian

Polajnar, T.

User-friendly ontology authoring using a controlled language

Polifroni, J.

Learning Database Content for Spoken Dialogue System Design

Polin, Z.

Structure, Annotation and Tools in the Basque ZT Corpus

Poller, P.

The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player

Pollák, P.

Methodology of Lombard Speech Database Acquisition: Experiences with CLSD

Pomikálek, J.

Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator

Popescu-belis, A.

CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

TQB: Accessing Multimodal Data Using a Transcript-based Query and Browsing Interface

A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation

Popovic, M.

POS-based Word Reorderings for Statistical Machine Translation

Poprat, M.

Language Specific and Topic Focused Web Crawling

Porcelijn, T.

Identifying Named Entities in Text Databases from the Natural History Domain

Postolache, O.

Annotation Guidelines for Czech-English Word Alignment

Transferring Coreference Chains through Word Alignment

Pouli, K.

Adding multi-layer semantics to the Greek Dependency Treebank

Pouliquen, B.

Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation

The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages

Povlsen, C.

EuroTermBank - a Terminology Resource based on Best Practice

The MULINCO corpus and corpus platform

Pražák, A.

Benefit of a Class-based Language Model for Real-time Closed-captioning of TV Ice-hockey Commentaries

Pretorius, L.

Towards machine-readable lexicons for South African Bantu languages

Prinsloo, D.

Grammar-based tools for the creation of tagging resources for an unresourced language: the case of Northern Sotho

Prodanof, I.

Annotating Bridging Anaphors in Italian: in Search of Reliability

Prokopidis, P.

Adding multi-layer semantics to the Greek Dependency Treebank

Prokopova, M.

Czech-English Word Alignment

Protopapas, A.

All Greek to me! An automatic Greeklish to Greek transliteration system

Przybocki, M.

The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

Edit Distance: A Metric for Machine Translation Evaluation

Psutka, J.

Benefit of a Class-based Language Model for Real-time Closed-captioning of TV Ice-hockey Commentaries

Benefit of a Class-based Language Model for Real-time Closed-captioning of TV Ice-hockey Commentaries

Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech

Puliti, P.

A novel Textual Encoding paradigm based on Semantic Web tools and semantics

Puscasu, G.

If “it” were “then”, then when was “it”? Establishing the anaphoric role of “then”

Pustejovsk, J.

SlinkET: A Partial Modal Parser for Events

Pustejovsky, J.

Inducing Sense-Discriminating Context Patterns from Sense-Tagged Corpora

Towards a Generative Lexical Resource: The Brandeis Semantic Ontology

Annotation of Temporal Relations with Tango

BULB: A Unified Lexical Browser

Pérez, A.

Multilingual Lexical Semantic Resources for Ontology Translation

Pérez, J.

ECESS Inter-Module Interface Specification for Speech Synthesis

Acceptance Testing of a Spoken Language Translation System

GAIA: Common Framework for the Development of Speech Translation Technologies

Pîrvan, F.

Tagset Mapping and Statistical Training Data Cleaning-up

Qin, L.

A Study on Terminology Extraction Based on Classified Corpora

Quasthoff, U.

Corpus Portal for Search in Monolingual Corpora

Quignard, M.

A Deep-Parsing Approach to Natural Language Understanding in Dialogue System: Results of a Corpus-Based Evaluation

Results of the French Evalda-Media evaluation campaign for literal understanding

Quintian, M.

A methodology for the joint development of the Basque WordNet and Semcor

Raake, A.

US-based Method for Speech Reception Threshold Measurement in French

A joint intelligibility evaluation of French text-to-speech synthesis systems: the EvaSy SUS/ACR campaign

Racioppa, S.

Ontology-based Information Extraction with SOBA

Radde, N.

Multiple Dimension Levenshtein Edit Distance Calculations for Evaluating Automatic Speech Recognition Systems During Simultaneous Speech

Radev, D.

Ragheb, A.

Building Annotated Written and Spoken Arabic LRs in NEMLAR Project

Raidt, S.

Does a Virtual Talking Face Generate Proper Multimodal Cues to Draw User’s Attention to Points of Interest?

Rajman, M.

X-Score: Automatic Evaluation of Machine Translation Grammaticality

CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

Extending the Wizard of Oz Methodologie for Multimodal Language-enabled Systems

Rambow, O.

Developing and Using a Pilot Dialectal Arabic Treebank

Parallel Syntactic Annotation of Multiple Languages

Inter-annotator Agreement on a Multilingual Semantic Annotation Task

Ramos, M.

Local Document Relevance Clustering in IR Using Collocation Information

Rapp R.

Example-Based Machine Translation Using a Dictionary of Word Pairs

Rapti, S.

All Greek to me! An automatic Greeklish to Greek transliteration system

Rashwan, M.

Building Annotated Written and Spoken Arabic LRs in NEMLAR Project

Ratsch, E.

Finding the Appropriate Generalization Level for Binary Ontological Relations Extracted from the Genia Corpus

Rayner, M.

REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications

Rebrus, P.

Morphdb.hu: Hungarian lexical database and morphological grammar

Reeder, F.

Parallel Syntactic Annotation of Multiple Languages

Reidsma, D.

Annotating Emotions in Meetings

Reigem, Ø.

Developing a re-usable web-demonstrator for automatic anaphora resolution with support for manual editing of coreference chains

Renso, C.

Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases

Reyle, U.

Identifying and Classifying Terms in the Life Sciences: The Case of Chemical Terminology

Reynaert, M.

Corpus-Induced Corpus Clean-up

Richard, G.

Fear-type emotions of the SAFE Corpus: annotation issues

Richter, M.

Corpus Portal for Search in Monolingual Corpora

Rieser, V.

The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player

Rijke, M.

Building a Lexical Database for an Interactive Joke-Generator

Ritchie, G.

Ritz, J.

Extraction tools for collocations and their morphosyntactic specificities

Roark, B.

From Natural Language to Databases via Ontologies

Robaldo, L.

Robba, I.

Data, Annotations and Measures in EASY the Evaluation Campaign for Parsers of French.

FRASQUES: A Question Answering system in the EQueR evaluation campaign

Rodríguez, H.

Improving coverage and parsing quality of a large-scale LFG for German

Rohrer, C.

Rojas, D.

Predicting MT Quality as a Function of the Source Language

Rojc, M.

Are you ready for a call? - Spontaneous conversations in tourism for speech-to-speech translation systems

Romacker, M.

Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon

Romanelli, M.

A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications

Romano, L.

Representing Linguistic Corpora and Their Annotations

Foundations of Modern Language Resource Archives

Metadata Profile in the ISO Data Category Registry

An API for accessing the Data Category Registry

A Lexicalized Tree-Adjoining Grammar for Vietnamese

Romary, L.

Rosset, S.

Results of the French Evalda-Media evaluation campaign for literal understanding

The Ritel Corpus - An annotated Human-Machine open-domain question answering spoken dialog corpus

Rosso, P.

Mining Knowledge fromWikipedia for the Question Answering task

Roussanaly, A.

A Lexicalized Tree-Adjoining Grammar for Vietnamese

Roventini, A.

Using Core Ontology for Domain Lexicon Structuring

Linking Verbal Entries of Different Lexical Resources

Ruimy, N.

Merging two Ontology-based Lexical Resources

Structuring a Domain Vocabulary in a General Knowledge Environment

Ruiz, N.

Tangible Objects for the Acquisition of Multimodal Interaction Patterns

Ruiz-casado, M.

The wraetlic NLP suite

Rumshisky, A.

Inducing Sense-Discriminating Context Patterns from Sense-Tagged Corpora

Towards a Generative Lexical Resource: The Brandeis Semantic Ontology

Rung, A.

Morphdb.hu: Hungarian lexical database and morphological grammar

Rus, V.

The Look and Feel of a Confident Entailer

Russel, A.

ANNEX - a web-based Framework for Exploiting Annotated Media Resources

ELAN: a Professional Framework for Multimodality Research

Ontology-based Language Archive Utilization

Ryant, N.

Extending VerbNet with Novel Verb Classes

Ryu, P.

Compiling large language resources using lexical similarity metrics for domain taxonomy learning

Sacaleanu, B.

Multimedia Database of Meetings and Informal Interactions for Tracking Participant Involvement and Discourse Flow

Sadanobu, T.

Sagara, K.

Multilingual Multidocument Summarization Tools and Evaluation

Language Resources for Background Gathering

Saggion, H.

Sagot, B.

The Lefff 2 syntactic lexicon for French: architecture, acquisition, use

Deep non-probabilistic parsing of large corpora

Sagri, M.

The LOIS Project

Sahlgren, M.

Towards pertinent evaluation methodologies for word-space models

Sailer, M.

The Collection of Distributionally Idiosyncratic Items: A Multilingual Resource for Linguistic Research

Saint-dizier, P.

PrepNet: a Multilingual Lexical Description of Prepositions

Saiz, R.

Structure, Annotation and Tools in the Basque ZT Corpus

Salmenkivi, M.

Finding representative sets of dialect words for geographical regions

Salmon-alt, S.

Automatic extraction of subcategorization frames for French

Salvador, L.

CoGrOO: a Brazilian-Portuguese Grammar Checker based on the CETENFOLHA Corpus

Sameti, H.

Building and Incorporating Language Models for Persian Continuous Speech Recognition Systems

Samy, D.

Building a Parallel Multilingual Corpus (Arabic-Spanish-English)

Sanders, G.

Edit Distance: A Metric for Machine Translation Evaluation

Sandoval, A.

Building a Parallel Multilingual Corpus (Arabic-Spanish-English)

Sandrelli, A.

Tagging a Corpus of Interpreted Speeches: the European Parliament Interpreting Corpus (EPIC)

Sannikov, A.

A Syntactically and Semantically Tagged Corpus of Russian: State of the Art and Prospects

Sansò, A.

Documenting variation across Europe and the Mediterranean: the Pavia Typological Database

Santos, D.

Annotating COMPARA, a Grammar-aware Parallel Corpus

Corpógrafo V3 - From Terminological Aid to Semi-automatic Knowledge Engineering

HAREM: An Advanced NER Evaluation Contest for Portuguese

The Multilingual Question Answering Track at CLEF

Santos, R.

The African Varieties of Portuguese: Compiling Comparable Corpora and Analyzing Data-Derived Lexicon

Saraki, M.

Building Carefully Tagged Bilingual Corpora to Cope with Linguistic Idiosyncrasy

Saratxaga, I.

Designing and Recording an Emotional Speech Database for Corpus Based Synthesis in Basque

Sarmento, L.

Corpógrafo V3 - From Terminological Aid to Semi-automatic Knowledge Engineering

Component Evaluation in a Question Answering System

BACO - A large database of text and co-occurrences

Saruwatari, H.

Transcription Cost Reduction for Constructing Acoustic Models Using Acoustic Likelihood Selection Criteria

Sasaki, F.

Work within the W3C Internationalization Activity and its Benefit for the Creation and Manipulation of Language Resources

Sassi, M.

Next Generation Language Resources using Grid

Natural Language Processing: A Terminological and Statistical Approach

Sassolini, E.

Dialectal resources on-line: the ALT-Web experience

Next Generation Language Resources using Grid

Sauri, R.

SlinkET: A Partial Modal Parser for Events

Savy, R.

Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb).

Scalise, S.

MORBO/COMP: a multilingual database of compound words

Schaden, S.

“Casselberveetovallarga” and other Unpronounceable Places: The CrossTowns Corpus

Evaluation of Automatically Generated Transcriptions of Non-Native Pronunciations using a Phonetic Distance Measure

Schehl, J.

The SAMMIE Corpus of Multimodal Dialogues with an MP3 Player

Scherer, S.

Wizard-of-oz Data Collection for Perception and Interaction in Multi-user Environments

Schiehlen, M.

The Mass-Count Distinction: Acquisition and Disambiguation

Schiel, F.

Bikers Accessing the Web: The SmartWeb Motorbike Corpus

SmartWeb UMTS Speech Data Collection: The SmartWeb Handheld Corpus

Schiffrin, A.

Methodological Aspects of Semantic Annotation

Schiller, M.

A corpus of tutorial dialogs on theorem proving; the influence of the presentation of the study-material

Schlesinger, P.

Semi-automatic Building of Swedish Collocation Lexicon

Schlyter, S.

CEFLE and Direkt Profil: a New Computer Learner Corpus in French L2 and a System for Grammatical Profiling

Schrader, B.

Non-probabilistic alignment of rare German and English nominal expressions

Schrader, N.

LexikoNet - a lexical database based on type and role hierarchies

Schreiber, G.

Conversion of WordNet to a standard RDF/OWL representation

Schulte, S.

Human Verb Associations as the Basis for Gold Standard Verb Classes: Validation against GermaNet and FrameNet

Schulz, S.

Semantic Atomicity and Multilinguality in the Medical Domain: Design Considerations for the MorphoSaurus Subword Lexicon

Language Specific and Topic Focused Web Crawling

Schuurman, I.

Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development

Syntactic Annotation of Large Corpora in STEVIN

Schuurman, I.

METIS-II: Machine Translation for Low Resource Languages

Schwenker, F.

Wizard-of-oz Data Collection for Perception and Interaction in Multi-user Environments

Schäfer, U.

Automatic Testing and Evaluation of Multilingual Language Technology Resources and Components

OntoNERdIE -- Mapping and Linking Ontologies to Named Entity Recognition and Information Extraction Resources

Preprocessing and Tokenisation Standards in DELPH-IN Tools

Sebastiani, F.

SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining

Seco, N.

HAREM: An Advanced NER Evaluation Contest for Portuguese

Segouat, J.

Sign Language corpus analysis: Synchronisation of linguistic annotation and numerical data

Seidenari, C.

The DiaCORIS project: a diachronic corpus of written Italian

POS tagset design for Italian

Semecký, J.

On Automatic Assignment of Verb Valency Frames in Czech

Semmar, N.

Evaluation of multilingual text alignment systems: the ARCADE II project

A Deep Linguistic Analysis for Cross-language Information Retrieval

Senda, Y.

Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing

Senn, J.

Collaborative Annotation that Lasts Forever: Using Peer-to-Peer Technology for Disseminating Corpora and Language Resources

Serafini, L.

The role of lexical resources in matching classification schemas

Servan, C.

Results of the French Evalda-Media evaluation campaign for literal understanding

Severinson, K.

Developing a ContextualizedMultimodal Corpus for Human-Robot InteractionKeywords: Multimodal corpus, Human-Robot Interaction, Spatiality,Dialogue systems, Gesture

Sgall, P.

Corpus Annotation as a Test of a Linguistic Theory

Shafran, I.

Building Annotated Written and Spoken Arabic LRs in NEMLAR Project

Shahin, M.

Sharoff, S.

Using collocations from comparable corpora to find translation equivalents

A Uniform Interface to Large-Scale Linguistic Resources

Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language

Shehata, A.

A Dependency-based Algorithm for Grammar Conversion

Sheridan, P.

Design, Construction and Validation of an Arabic-English Conceptual Interlingua for Cross-lingual Information Retrieval

Shikano, K.

Transcription Cost Reduction for Constructing Acoustic Models Using Acoustic Likelihood Selection Criteria

Long-term Analysis of Prosodic Features of Spoken Guidance System User Speech

Shima, H.

Modular Approach to Error Analysis and Evaluation for Multilingual Question Answering

Siddharthan, A.

Parallel Syntactic Annotation of Multiple Languages

Siegel, M.

Ontology-based Information Extraction with SOBA

Silva, J.

Open Resources and Tools for the Shallow Processing of Portuguese: The TagShare Project

Simon, E.

Morphdb.hu: Hungarian lexical database and morphological grammar

Simov, K.

Shallow Semantic Annotation of Bulgarian

Simões, A.

T2O - Recycling Thesauri into a Multilingual Ontology

Sin, K.

Court Stenography-To-Text (“STT”) in Hong Kong: A Jurilinguistic Engineering Effort

Sinohara, Y.

Automatic Terminology Intelligibility Estimation for Readership-oriented Technical Writing

Sinopalnikova, A.

Intelligent Dictionary Interfaces: Usability Evaluation of Access-Supporting Enhancements

Sitbon, L.

Tools and methods for objective or contextual evaluation of topic segmentation

Sizov, V.

A Syntactically and Semantically Tagged Corpus of Russian: State of the Art and Prospects

Sjöbergh, J.

Towards Holistic Summarization -- Selecting Summaries, Not Sentences

Skiba, R.

Comparison of Resource Discovery Methods

LAMUS: the Language Archive Management and Upload System

Skwarska, K.

Valency Lexicon of Czech Verbs: Alternation-Based Model

Slavcheva, M.

Semantic Descriptors: The Case of Reflexive Verbs

Sloane, Z.

Competitive Evaluation of Commercially Available Speech Recognizers in Multiple Languages

Sloetjes, H.

ELAN: a Professional Framework for Multimodality Research

Smaïli, K.

Linguistic features modeling based on Partial New Cache

Smits, F.

JASMIN-CGN: Extension of the Spoken Dutch Corpus with Speech of Elderly People, Children and Non-natives in the Human-Machine Interaction Modality

Smrž, P.

Automatic Acquisition of Semantics-Extraction Patterns

Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator

Intelligent Dictionary Interfaces: Usability Evaluation of Access-Supporting Enhancements

Snover, M.

Retrieving Terminological Data from the TxtCeram Tagged Domain Corpus: A First Step towards a Terminological Ontology

Soler, J.

Soler, V.

Sologaistoa, A.

Structure, Annotation and Tools in the Basque ZT Corpus

Somers, H.

Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali

Sonntag, D.

A Multimodal Result Ontology for Integrated Semantic Web Dialogue Applications

Soria, C.

Sornlertlamvanich, V.

Blind Evaluation for Thai Search Engines

Word Knowledge Acquisition for Computational Lexicon Construction

A Conditional Random Field Framework for Thai Morphological Analysis

Sornprasert, P.

Ontology Driven K-Portal Construction and K-Service Provision

Soroa, A.

Structure, Annotation and Tools in the Basque ZT Corpus

Soudi, A.

IMORPHĒ: An Inheritance and Equivalence Based Morphology Description Compiler

Sousa, M.

Building a historical corpus for Classical Portuguese: some technological aspects

Spadoni, G.

Using Core Ontology for Domain Lexicon Structuring

Speranza, M.

Creation and Use of Lexicons and Ontologies for NL Interfaces to Databases

Spinsanti, L.

Sporleder, C.

Identifying Named Entities in Text Databases from the Natural History Domain

Spranger, K.

The Mass-Count Distinction: Acquisition and Disambiguation

Sprugnoli, R.

Blind Evaluation for Thai Search Engines

Srichaivattana, P.

Stanković, R.

Acquis Communautaire Sentence Alignment using Support Vector Machines

Aligning Multilingual Thesauri

Ştefănescu, D.

Steffen, J.

The pragmatic combination of different crosslingual resources

Stein, D.

A German Sign Language Corpus of the Domain Weather Report

Steinberger, R.

Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation

The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages

Stewart, R.

The Affective Weight of Lexicon

Stock, O.

Stolarski, M.

UAM Text Tools - a flexible NLP architecture

Storrer, A.

Automated detection and annotation of term definitions in German text corpora

Strapparava, C.

The Affective Weight of Lexicon

Strassel, S.

Integrated Linguistic Resources for Language Exploitation Technologies

A New Phase in Annotation Tool Development at the Linguistic Data Consortium: The Evolution of the Annotation Graph Toolkit

Linguistic Resources for Speech Parsing

An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks

Strauß, P.

Wizard-of-oz Data Collection for Perception and Interaction in Multi-user Environments

Strube, M.

Part-of-Speech Tagging of Transcribed Speech

Strömqvist, S.

Technologies for a Federation of Language Resource Archives

Stuck, F.

Evaluation of multilingual text alignment systems: the ARCADE II project

Subba, R.

Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations

Suderman, K.

Integrating Linguistic Resources: The American National Corpus Model

Suktarachan, M.

Ontology Driven K-Portal Construction and K-Service Provision

Sutcliffe, R.

On the Web Trilingual Sign Language Dictionary to Learn the foreign Sign Language without Learning a Target Spoken Language

Suzuki, E.

Suzuki, H.

RefRef: A Tool for Viewing and Exploring Coreference Space

Suzuki, T.

On the Web Trilingual Sign Language Dictionary to Learn the foreign Sign Language without Learning a Target Spoken Language

Svendsen, T.

FonDat1: A Speech Synthesis Corpus for Norwegian

Swanenberg, J.

A Unified Structure for Dutch Dialect Dictionary Data

Szarvas, G.

A highly accurate Named Entity corpus for Hungarian

Szklanny, K.

Creation and analysis of a Polish speech database for use in unit selection synthesis

Sá, T.

COMBINA-PT: A Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions

Sánchez, A.

MEDUSA: User-Centred Design and usability evaluation of Automatic Speech Recognition telephone services in Telefónica Móviles España

Tabessi, D.

Developing and Using a Pilot Dialectal Arabic Treebank

Tablan, V.

User-friendly ontology authoring using a controlled language

Creating Tools for Morphological Analysis of Sumerian

Taboada, M.

Methods for Creating Semantic Orientation Dictionaries

Tadić, M.

Evaluating Morphosyntactic Tagging of Croatian Texts

Taib, R.

Tangible Objects for the Acquisition of Multimodal Interaction Patterns

Taiichi, H.

Dependency-structure Annotation to Corpus of Spontaneous Japanese

Takanashi, K.

Takeda, K.

Statistical Analysis for Thesaurus Construction using an Encyclopedic Corpus

Takenobu, T.

Grammar-based tools for the creation of tagging resources for an unresourced language: the case of Northern Sotho

Taljard, E.

Talley, J.

Bootstrapping New Language ASR Capabilities: Achieving Best Letter-to-Sound Performance under Resource Constraints

Tamburini, F.

The DiaCORIS project: a diachronic corpus of written Italian

POS tagset design for Italian

Tan, T.

A French Non-Native Corpus for Automatic Speech Recognition

Tate, C.

Task-based MT Evaluation: From Who/When/Where Extraction to Event Understanding

Tateishi, M.

Analyzing the Effects of Spoken Dialog Systems on Driving Behavior

Tateisi, Y.

Linguistic and Biological Annotations of Biological Interaction Events

Tchobanov, A.

Interoperability of audio corpora : the case of the French corpora

Teixeira, J.

Progmatica: A Prosodic Database for European Portuguese

Tenfjord, K.

The ASK Corpus - a Language Learner Corpus of Norwegian as a Second Language

Terenzi, E.

Building lexical resources for PrincPar, a large coverage parser that generates principled semantic representations

Tesconi, M.

Moving to dynamic computational lexicons with LeXFlow

Teunissen, L.

The Dutch-Flemish HLT Programme STEVIN: Essential Speech and Language Technology Resources

Thamvijit, D.

Ontology Driven K-Portal Construction and K-Service Provision

Theeramunkong, T.

Word Knowledge Acquisition for Computational Lexicon Construction

Thulin, J.

CEFLE and Direkt Profil: a New Computer Learner Corpus in French L2 and a System for Grammatical Profiling

Tian, J.

ECESS Inter-Module Interface Specification for Speech Synthesis

Tiedemann, J.

ISA & ICA - Two Web Interfaces for Interactive Alignment of Bitexts alignment of parallel texts

Timimi, I.

Terminological Resources Acquisition Tools: Toward a User-oriented Evaluation Model

CESTA: First Conclusions of the Technolangue MT Evaluation Campaign

Tiscornia, D.

The LOIS Project

Tobin, R.

Rule-Based Chunking and Reusability

Toda, T.

Transcription Cost Reduction for Constructing Acoustic Models Using Acoustic Likelihood Selection Criteria

Tohyama, H.

Collection of Simultaneous Interpreting Patterns by Using Bilingual Spoken Monologue Corpus

Tomaso, V.

Multilingual Search in Libraries. The case-study of the Free University of Bozen-Bolzano

Tongchim, S.

Blind Evaluation for Thai Search Engines

Tono, Y.

Developing a ContextualizedMultimodal Corpus for Human-Robot InteractionKeywords: Multimodal corpus, Human-Robot Interaction, Spatiality,Dialogue systems, Gesture

Topp, E.

Toumouh, A.

Ongoing Developments in Automatically Adapting Lexical Resources to the Biomedical Domain

Traue, H.

Wizard-of-oz Data Collection for Perception and Interaction in Multi-user Environments

Trawinski, B.

The Collection of Distributionally Idiosyncratic Items: A Multilingual Resource for Linguistic Research

Trilsbeek, P.

LAMUS: the Language Archive Management and Upload System

Trippel, T.

A BLARK extension for temporal annotation mining

Building a historical corpus for Classical Portuguese: some technological aspects

Feature-based Encoding and Querying Language Resources with Character Semantics

Trón, V.

Morphdb.hu: Hungarian lexical database and morphological grammar

Using a morphological analyzer in high precision POS tagging of Hungarian

Tseng, C.

Oriental COCOSDA: Past, Present and Future

Tseng, S.

Discourse functions of duration in Mandarin: resource design and implementation

Tsiakoulis, P.

All Greek to me! An automatic Greeklish to Greek transliteration system

Tsou, B.

Toward a Pan-Chinese Thesaurus

Court Stenography-To-Text (“STT”) in Hong Kong: A Jurilinguistic Engineering Effort

Tsujii, J.

Linguistic and Biological Annotations of Biological Interaction Events

Tufiş, D.

Dependency-Based Phrase Alignment

Acquis Communautaire Sentence Alignment using Support Vector Machines

The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages

RoCo-News: A Hand Validated Journalistic Corpus of Romanian

Tufi, D.

Tagset Mapping and Statistical Training Data Cleaning-up

Aligning Multilingual Thesauri

Tummarello, G.

A novel Textual Encoding paradigm based on Semantic Web tools and semantics

Uchimoto, K.

Dependency-structure Annotation to Corpus of Spontaneous Japanese

Automatic Detection and Semi-Automatic Revision of Non-Machine-Translatable Parts of a Sentence

Uemlianin, I.

Tools and resources for speech synthesis arising from a Welsh TTS project

Ueyama, M.

Evaluation of Web-based Corpora: Effects of Seed Selection and Time Interval

Ulivieri, M.

Unified Lexicon and Unified Morphosyntactic Specifications for Written and Spoken Italian

Umbert, M.

Spanish Synthesis Corpora

Underwood, N.

A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation

ROTE: A Tool to Support Users in Defining the Relative Importance of Quality Characteristics

Evaluating Symbiotic Systems: the challenge

The Evolution of an Evaluation Framework for a Text Mining System

Uryupina, O.

Coreference Resolution with and without Linguistic Knowledge

Uszkoreit, H.

The pragmatic combination of different crosslingual resources

Utiyama, M.

Getting Deeper Semantics than Berkeley FrameNet with MSFA

Vajda, P.

Morphdb.hu: Hungarian lexical database and morphological grammar

Valitutti, A.

The Affective Weight of Lexicon

Valverde, A.

Structure, Annotation and Tools in the Basque ZT Corpus

Vandeghinste, V.

Transferring PoS-tagging and lemmatization tools from spoken to written Dutch corpus development

Syntactic Annotation of Large Corpora in STEVIN

Vandeghinste, V.

METIS-II: Machine Translation for Low Resource Languages

Vaneyghen, J.

A mixed word / morphological approach for extending CELEX for high coverage on contemporary large corpora

Vanni, M.

Parallel Syntactic Annotation of Multiple Languages

Formal v. Informal: Register-Differentiated Arabic MT Evaluation in the PLATO Paradigm

Vanrullen, T.

Constraint-Based Parsing as an Efficient Solution: Results from the Parsing Evaluation Campaign EASy

Varga, D.

The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages

Using a morphological analyzer in high precision POS tagging of Hungarian

Varona, A.

Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA

Vasileiadis, S.

Greek Named Entity Recognition using Support Vector Machines, Maximum Entropy and Onetime

Vasilescu, I.

Fear-type emotions of the SAFE Corpus: annotation issues

Vasiljevs, A.

EuroTermBank - a Terminology Resource based on Best Practice

Vauvert, G.

The ALVIS Format for Linguistically Annotated Documents

Veenendaal, R.

Functioning of the Centre for Dutch Language and Speech Technology

Vela, M.

Generic NLP Tools for Supporting Shallow Ontology Building

Vela, O.

Multilingual Lexical Semantic Resources for Ontology Translation

Verberne, S.

Data for question answering: The case of why

Verdonik, D.

SINOD - Slovenian non-native speech database

Are you ready for a call? - Spontaneous conversations in tourism for speech-to-speech translation systems

Verhagen, M.

SlinkET: A Partial Modal Parser for Events

Towards a Generative Lexical Resource: The Brandeis Semantic Ontology

Annotation of Temporal Relations with Tango

BULB: A Unified Lexical Browser

Vetulani, G.

Syntactic Lexicon of Polish Predicative Nouns

Vetulani, Z.

Syntactic Lexicon of Polish Predicative Nouns

Vide, C.

Example-Based Machine Translation Using a Dictionary of Word Pairs

Vigário, M.

FreP: An electronic tool for extracting frequency information of phonological units from Portuguese written text

Vilain, M.

Leveraging Machine Readable Dictionaries in Discriminative Sequence Models

Vilar, D.

Error Analysis of Statistical Machine Translation Output

Vilela, R.

HAREM: An Advanced NER Evaluation Contest for Portuguese

Villemonte de la Clergerie, E.

The Lefff 2 syntactic lexicon for French: architecture, acquisition, use

Villaneau, J.

Results of the French Evalda-Media evaluation campaign for literal understanding

Vilnat, A.

Data, Annotations and Measures in EASY the Evaluation Campaign for Parsers of French.

FRASQUES: A Question Answering system in the EQueR evaluation campaign

EQueR: the French Evaluation campaign of Question-Answering Systems

Vitas, D.

SKELETON: Specialised knowledge retrieval on the basis of terms and conceptual relations

Vivaldi, J.

Voghera, M.

An observatory on Spoken Italian linguistic resources and descriptive standards.

Vogt, T.

Improving Automatic Emotion Recognition from Speech via Gender Differentiaion

Voll, K.

Methods for Creating Semantic Orientation Dictionaries

Voss, C.

Task-based MT Evaluation: From Who/When/Where Extraction to Event Understanding

Vossen, P.

A Unified Structure for Dutch Dialect Dictionary Data

Vriend, F.

Vázquez, G.

The Sensem Corpus: a Corpus Annotated at the Syntactic and Semantic Level

Véronis, J.

Evaluation of multilingual text alignment systems: the ARCADE II project

Węcel, K.

A Grapheme-Based Approach for Accent Restoration in Gikuyu

Wagacha, P.

Waldron, B.

Preprocessing and Tokenisation Standards in DELPH-IN Tools

Walker, C.

An Efficient Approach to Gold-Standard Annotation: Decision Points for Complex Tasks

Walker, K.

The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research

Low-cost Customized Speech Corpus Creation for Speech Technology Applications

Walker, M.

Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives

Learning Database Content for Spoken Dialogue System Design

Waller, A.

Building a Lexical Database for an Interactive Joke-Generator

Walter, R.

Interoperability of audio corpora : the case of the French corpora

Walter, W.

Wizard-of-oz Data Collection for Perception and Interaction in Multi-user Environments

Wandmacher, T.

Training Language Models without Appropriate Language Resources: Experiments with an AAC System for Disabled People

Wang, F.

Towards Unified Chinese Segmentation Algorithm

Evaluation of Stop Word Lists in Chinese Language

Wang, M.

Modular Approach to Error Analysis and Evaluation for Multilingual Question Answering

Wang, X.

TC-STAR:Specifications of Language Resources and Evaluation for Speech Synthesis

Wanner, L.

Local Document Relevance Clustering in IR Using Collocation Information

Watanabe, M.

Automatic Construction of Japanese WordNet

Weidenbacher, U.

Wizard-of-oz Data Collection for Perception and Interaction in Multi-user Environments

Weissenbacher, D.

The ALVIS Format for Linguistically Annotated Documents

Wellinghoff, S.

Automated detection and annotation of term definitions in German text corpora

Wellner, B.

Leveraging Machine Readable Dictionaries in Discriminative Sequence Models

Wenjie, L.

A Study on Terminology Extraction Based on Classified Corpora

Westerhout, E.

A pilot study for a Corpus of Dutch Aphasic Speech (CoDAS)

Widdows, D.

Collaborative Annotation that Lasts Forever: Using Peer-to-Peer Technology for Disseminating Corpora and Language Resources

The Information Commons Gazetteer

Ongoing Developments in Automatically Adapting Lexical Resources to the Biomedical Domain

Widiger, A.

Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation

The JRC-Acquis: A Multilingual Aligned Parallel Corpus with 20+ Languages

Wieloc, K.

An Incremental Tri-Partite Approach To Ontology Learning

A Closer Look at Skip-gram Modelling

Evaluating Automatically Generated Timelines from the Web

Wilks, Y.

Williams, B.

Tools and resources for speech synthesis arising from a Welsh TTS project

Wintner, S.

A Computational Lexicon of Contemporary Hebrew

Witt, A.

Exploiting logical document structure for anaphora resolution

Wittenburg, P.

Comparison of Resource Discovery Methods

ELAN: a Professional Framework for Multimodality Research

LAMUS: the Language Archive Management and Upload System

Foundations of Modern Language Resource Archives

LEXUS, a web-based tool for manipulating lexical resources lexicon, standards, tool

Metadata Profile in the ISO Data Category Registry

Technologies for a Federation of Language Resource Archives

Ontology-based Language Archive Utilization

An API for accessing the Data Category Registry

Wolska, M.

A corpus of tutorial dialogs on theorem proving; the influence of the presentation of the study-material

Wong, M.

Skeleton Parsing in Chinese: Annotation Scheme and Guidelines

Wong., K.

Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach

Wrede, B.

BITT: A Corpus for Topic Tracking Evaluation on Multimodal Human-Robot-Interaction

Wu, A.

A Hebrew Tree Bank Based on Cantillation Marks

Wypych, M.

General and Task-Specific Corpus Resources for Polish Adult Learners of English

Xia, Y.

Constructing A Chinese Chat Language Corpus with A Two-Stage Incremental Annotation Approach

Xu, F.

The pragmatic combination of different crosslingual resources

Xu, J.

Error Analysis of Statistical Machine Translation Output

Xu, R.

Interaction between Lexical Base and Ontology with Formal Concept Analysis

The Design and Construction of A Chinese Collocation Bank

Xue, N.

Annotating the Predicate-Argument Structure of Chinese Nominalizations

Yakushiji, A.

Linguistic and Biological Annotations of Biological Interaction Events

Yamaguchi, Y.

Layered Speech-Act Annotation for Spoken Dialogue Corpus

Yamamoto, E.

Yang, J.

Competitive Evaluation of Commercially Available Speech Recognizers in Multiple Languages

Yang, Q.

Development of a phoneme-to-phoneme (p2p) converter to improve the grapheme-to-phoneme (g2p) conversion of names

Yaseen, M.

Building Annotated Written and Spoken Arabic LRs in NEMLAR Project

Yingsaeree, C.

Ontology Driven K-Portal Construction and K-Service Provision

Yirong, C.

A Study on Terminology Extraction Based on Classified Corpora

Yona, S.

A Computational Lexicon of Contemporary Hebrew

Yung, L.

Valency Lexicon of Czech Verbs: Alternation-Based Model

Towards a Slovene Dependency Treebank

Žabokrtsky, Z.

Zaghouani, W.

Geocoding Multilingual Texts: Recognition, Disambiguation and Visualisation

Evaluation of multilingual text alignment systems: the ARCADE II project

Zahedi, M.

A German Sign Language Corpus of the Domain Weather Report

Zanobini, S.

The role of lexical resources in matching classification schemas

Zanzotto, F.

Mixing WordNet, VerbNet and PropBank for studying verb relations

A Dependency-based Algorithm for Grammar Conversion

Žele, A.