Back to Main Conference 2006
LREC 2006main

A Syntactically Annotated Corpus of Japanese Spoken Monologue

Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC 2006)

DOI:10.63317/45zizet5kmcv

Abstract

Recently, monologue data such as lecture and commentary by professionals have been considered as valuable intellectual resources, and have been gathering attention. On the other hand, in order to use these monologue data effectively and efficiently, it is necessary for the monologue data not only just to be accumulated but also to be structured. This paper describes the construction of a Japanese spoken monologue corpus in which dependency structure is given to each utterance. Spontaneous monologue includes a lot of very long sentences composed of two or more clauses. In these sentences, there may exist the subject or the adverb common to multi-clauses, and it may be considered that the subject or adverb depend on multi-predicates. In order to give the dependency information in a real fashion, our research allows that a bunsetsu depends on multiple bunsetsus.

Details

Paper ID
lrec2006-main-056
Pages
N/A
BibKey
ohno-etal-2006-syntactically
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-2-4
Conference
Fifth International Conference on Language Resources and Evaluation
Location
Genoa, Italy
Date
24 May 2006 26 May 2006

Authors

  • TO

    Tomohiro Ohno

  • SM

    Shigeki Matsubara

  • HK

    Hideki Kashioka

  • NK

    Naoto Kato

  • YI

    Yasuyoshi Inagaki

Links