Abstract: MATH/CHEM/COMP 2002, Dubrovnik,
June 24-29, 2002
|
Modeling
distribution of constituent
structure tree for natural
language
Bozidar Tepes Department
of Information Science, Faculty of Philosophy, University of Zagreb, I. Lucica
3, HR-10000 Zagreb, Croatia This paper
describes distribution of constituent structure trees of natural language by
using Bayesian networks. First part of the paper describes probabilistic
context- free grammar (PCFG) together with head-driven phrase structure
grammar (HPSG). On these theoretical ideas for modeling formal and natural
languages, two Bayesian networks were modeled. First Bayesian network is
based on directed acyclic graph (DAG). Second network is based on
probabilities from hidden Markov model (HMM) of hidden maximal level
structure tree for sentences of natural language. Two networks were tested on
Database of grammatical sentences of Croatian language (http://infoz.ffzg.hr/tepes/). |