An unsupervised approach for linking automatically extracted and manually crafted LTAGs

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Though the lack of semantic representation of automatically extracted LTAGs is an obstacle in using these formalism, due to the advent of some powerful statistical parsers that were trained on them, these grammars have been taken into consideration more than before. Against of this grammatical class, there are some widely usage manually crafted LTAGs that are enriched with semantic representation but suffer from the lack of efficient parsers. The available representation of latter grammars beside the statistical capabilities of former encouraged us in constructing a link between them. Here, by focusing on the automatically extracted LTAG used by MICA [4] and the manually crafted English LTAG namely XTAG grammar [32], a statistical approach based on HMM is proposed that maps each sequence of former elementary trees onto a sequence of later elementary trees. To avoid of converging the HMM training algorithm in a local optimum state, an EM-based learning process for initializing the HMM parameters were proposed too. Experimental results show that the mapping method can provide a satisfactory way to cover the deficiencies arises in one grammar by the available capabilities of the other.

Original languageEnglish
Title of host publicationComputational Linguistics and Intelligent Text Processing - 12th International Conference, CICLing 2011, Proceedings
Number of pages14
Publication date2011
EditionPART 1
Pages68-81
ISBN (Print)9783642193996
DOIs
Publication statusPublished - 2011
Event12th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2011 - Tokyo, Japan
Duration: 20 Feb 201126 Feb 2011

Conference

Conference12th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2011
LandJapan
ByTokyo
Periode20/02/201126/02/2011
SeriesLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 1
Volume6608 LNCS
ISSN0302-9743

    Research areas

  • Automatically Extracted Tree Adjoining Grammar, Grammar Mapping, HMM Initialization, MICA, Semantic Representation, Supertagging, XTAG Derivation Tree

ID: 366047966