Constructing linguistically motivated structures from statistical grammars
Publikation: Bidrag til tidsskrift › Konferenceartikel › Forskning › fagfællebedømt
This paper discusses two Hidden Markov Models (HMM) for linking linguistically motivated XTAG grammar and the automatically extracted LTAG used by MICA parser. The former grammar is a detailed LTAG enriched with feature structures. And the latter one is a huge size LTAG that due to its statistical nature is well suited to be used in statistical approaches. Lack of an efficient parser and sparseness in the supertags set are the main obstacles in using XTAG and MICA grammars respectively. The models were trained by the standard HMM training algorithm, Baum-Welch. To converge the training algorithm to a better local optimum, the initial state of the models also were estimated using two semi-supervised EM-based algorithms. The resulting accuracy of the model (about 91%) shows that the models can provide a satisfactory way for linking these grammars to share their capabilities together.
Originalsprog | Engelsk |
---|---|
Tidsskrift | International Conference Recent Advances in Natural Language Processing, RANLP |
Sider (fra-til) | 63-69 |
Antal sider | 7 |
ISSN | 1313-8502 |
Status | Udgivet - 2011 |
Begivenhed | 8th International Conference on Recent Advances in Natural Language Processing, RANLP 2011 - Hissar, Bulgarien Varighed: 12 sep. 2011 → 14 sep. 2011 |
Konference
Konference | 8th International Conference on Recent Advances in Natural Language Processing, RANLP 2011 |
---|---|
Land | Bulgarien |
By | Hissar |
Periode | 12/09/2011 → 14/09/2011 |
ID: 366047680