Constructing linguistically motivated structures from statistical grammars

Department of Nordic Studies and Linguistics (NorS)

Constructing linguistically motivated structures from statistical grammars

Research output: Contribution to journal › Conference article › Research › peer-review

Basirat, Ali
Heshaam Faili

This paper discusses two Hidden Markov Models (HMM) for linking linguistically motivated XTAG grammar and the automatically extracted LTAG used by MICA parser. The former grammar is a detailed LTAG enriched with feature structures. And the latter one is a huge size LTAG that due to its statistical nature is well suited to be used in statistical approaches. Lack of an efficient parser and sparseness in the supertags set are the main obstacles in using XTAG and MICA grammars respectively. The models were trained by the standard HMM training algorithm, Baum-Welch. To converge the training algorithm to a better local optimum, the initial state of the models also were estimated using two semi-supervised EM-based algorithms. The resulting accuracy of the model (about 91%) shows that the models can provide a satisfactory way for linking these grammars to share their capabilities together.

Original language	English
Journal	International Conference Recent Advances in Natural Language Processing, RANLP
Pages (from-to)	63-69
Number of pages	7
ISSN	1313-8502
Publication status	Published - 2011
Event	8th International Conference on Recent Advances in Natural Language Processing, RANLP 2011 - Hissar, Bulgaria Duration: 12 Sep 2011 → 14 Sep 2011

Conference

Conference	8th International Conference on Recent Advances in Natural Language Processing, RANLP 2011
Country	Bulgaria
City	Hissar
Period	12/09/2011 → 14/09/2011

ID: 366047680