Automatic recognition of the function of singular neuter pronouns in texts and spoken data
Research output: Contribution to journal › Conference article › Research › peer-review
We describe the results of unsupervised (clustering) and supervised (classification) learning experiments with the purpose of recognising the function of singular neuter pronouns in Danish corpora of written and spoken language. Danish singular neuter pronouns comprise personal and demonstrative pronouns. They are very frequent and have many functions such as non-referential, cataphoric, deictic and anaphoric. The antecedents of discourse anaphoric singular neuter pronouns can be nominal phrases of different gender and number, verbal phrases, adjectival phrases, clauses or discourse segments of different size and they can refer to individual and abstract entities. Danish neuter pronouns occur in more constructions and have different distributions than the corresponding English pronouns it, this and that. The results of the classification experiments show a significant improvement of the performance with respect to the baseline in all types of data. The best results were obtained on text data, while the worst results were achieved on free-conversational, multi-party dialogues.
Original language | English |
---|---|
Journal | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
Pages (from-to) | 15-28 |
Number of pages | 14 |
ISSN | 0302-9743 |
DOIs | |
Publication status | Published - 2009 |
Event | 7th Discourse Anaphora and Anaphor Resolution Colloquium, DAARC 2009 - Goa, India Duration: 5 Nov 2009 → 6 Nov 2009 |
Conference
Conference | 7th Discourse Anaphora and Anaphor Resolution Colloquium, DAARC 2009 |
---|---|
Country | India |
City | Goa |
Period | 05/11/2009 → 06/11/2009 |
- Annotation, Individual and Abstract anaphora, Machine learning, Pronominal functions, Singular neuter pronouns, Text and Spoken corpora
Research areas
ID: 273031295