Back to Main Conference 2004
LREC 2004main

Language Modeling Using Dynamic Bayesian Networks

Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004)

DOI:10.63317/45b8hz5tzn26

Abstract

In this paper we propose a new approach to language modeling based on dynamic Bayesian networks. The principle idea of our approach is to find the dependence relations between variables that represent different linguistic units (word, class, concept, ...) that constitutes a language model. In the context of this paper the linguistic units that we consider are syntactic classes and words. Our approach shou ld not be considered as a model combination technique. Rather, it is an original and coherent methodology that processes words and classes in the same model. We attempt to identify and model the dependence of words and classes on their linguistic context. Our ultimate goal is to devise an automatic mechanism that extracts the best dependence relations between a word and its context, i.e., lexical and syntactic. Preliminary results are very encouraging, in particular the model obtained with a Bayesian network where a word depends not only on previous word but also on syntactic classes of two previous words outperforms the bi-gram model.

Details

Paper ID
lrec2004-main-404
Pages
N/A
BibKey
deviren-etal-2004-language
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-1-6
Conference
Fourth International Conference on Language Resources and Evaluation
Location
Lisbon, Portugal
Date
26 May 2004 28 May 2004

Authors

  • MD

    Murat Deviren

  • KD

    Khalid Daoudi

  • KS

    Kamel Smaïli

Links