A Thesaurus-based Sentiment Lexicon for Danish: The Danish Sentiment Lexicon

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Standard

A Thesaurus-based Sentiment Lexicon for Danish : The Danish Sentiment Lexicon. / Nimb, Sanni; Olsen, Sussi; Pedersen, Bolette Sandford; Troelsgaard, Thomas.

Proceedings of the Language Resources and Evaluation Conference: LREC2022. Marseille : European Language Resources Association, 2022. p. 2826--2832.

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Harvard

Nimb, S, Olsen, S, Pedersen, BS & Troelsgaard, T 2022, A Thesaurus-based Sentiment Lexicon for Danish: The Danish Sentiment Lexicon. in Proceedings of the Language Resources and Evaluation Conference: LREC2022. European Language Resources Association, Marseille, pp. 2826--2832. <http://www.lrec-conf.org/proceedings/lrec2022/pdf/2022.lrec-1.302.pdf>

APA

Nimb, S., Olsen, S., Pedersen, B. S., & Troelsgaard, T. (2022). A Thesaurus-based Sentiment Lexicon for Danish: The Danish Sentiment Lexicon. In Proceedings of the Language Resources and Evaluation Conference: LREC2022 (pp. 2826--2832). European Language Resources Association. http://www.lrec-conf.org/proceedings/lrec2022/pdf/2022.lrec-1.302.pdf

Vancouver

Nimb S, Olsen S, Pedersen BS, Troelsgaard T. A Thesaurus-based Sentiment Lexicon for Danish: The Danish Sentiment Lexicon. In Proceedings of the Language Resources and Evaluation Conference: LREC2022. Marseille: European Language Resources Association. 2022. p. 2826--2832

Author

Nimb, Sanni ; Olsen, Sussi ; Pedersen, Bolette Sandford ; Troelsgaard, Thomas. / A Thesaurus-based Sentiment Lexicon for Danish : The Danish Sentiment Lexicon. Proceedings of the Language Resources and Evaluation Conference: LREC2022. Marseille : European Language Resources Association, 2022. pp. 2826--2832

Bibtex

@inproceedings{d6d32ca7c8664cddadf60d349623108f,
title = "A Thesaurus-based Sentiment Lexicon for Danish: The Danish Sentiment Lexicon",
abstract = "This paper describes how a newly published Danish sentiment lexicon with a high lexical coverage was compiled by use of lexicographic methods and based on the links between groups of words listed in semantic order in a thesaurus and the corresponding word sense descriptions in a comprehensive monolingual dictionary. The overall idea was to identify negative and positive sections in a thesaurus, extract the words from these sections and combine them with the dictionary information via the links. The annotation task of the dataset included several steps, and was based on the comparison of synonyms and near synonyms within a semantic field. In the cases where one of the words were included in the smaller Danish sentiment lexicon AFINN, its value there was used as inspiration and expanded to the synonyms when appropriate. In order to obtain a more practical lexicon with overall polarity values at lemma level, all the senses of the lemma were afterwards compared, taking into consideration dictionary information such as usage, style and frequency. The final lexicon contains 13,859 Danish polarity lemmas and includes morphological information. It is freely available at https://github.com/dsldk/danish-sentiment-lexicon (licence CC-BY-SA 4.0 International).",
author = "Sanni Nimb and Sussi Olsen and Pedersen, {Bolette Sandford} and Thomas Troelsgaard",
year = "2022",
month = jun,
day = "20",
language = "English",
pages = "2826----2832",
booktitle = "Proceedings of the Language Resources and Evaluation Conference",
publisher = "European Language Resources Association",

}

RIS

TY - GEN

T1 - A Thesaurus-based Sentiment Lexicon for Danish

T2 - The Danish Sentiment Lexicon

AU - Nimb, Sanni

AU - Olsen, Sussi

AU - Pedersen, Bolette Sandford

AU - Troelsgaard, Thomas

PY - 2022/6/20

Y1 - 2022/6/20

N2 - This paper describes how a newly published Danish sentiment lexicon with a high lexical coverage was compiled by use of lexicographic methods and based on the links between groups of words listed in semantic order in a thesaurus and the corresponding word sense descriptions in a comprehensive monolingual dictionary. The overall idea was to identify negative and positive sections in a thesaurus, extract the words from these sections and combine them with the dictionary information via the links. The annotation task of the dataset included several steps, and was based on the comparison of synonyms and near synonyms within a semantic field. In the cases where one of the words were included in the smaller Danish sentiment lexicon AFINN, its value there was used as inspiration and expanded to the synonyms when appropriate. In order to obtain a more practical lexicon with overall polarity values at lemma level, all the senses of the lemma were afterwards compared, taking into consideration dictionary information such as usage, style and frequency. The final lexicon contains 13,859 Danish polarity lemmas and includes morphological information. It is freely available at https://github.com/dsldk/danish-sentiment-lexicon (licence CC-BY-SA 4.0 International).

AB - This paper describes how a newly published Danish sentiment lexicon with a high lexical coverage was compiled by use of lexicographic methods and based on the links between groups of words listed in semantic order in a thesaurus and the corresponding word sense descriptions in a comprehensive monolingual dictionary. The overall idea was to identify negative and positive sections in a thesaurus, extract the words from these sections and combine them with the dictionary information via the links. The annotation task of the dataset included several steps, and was based on the comparison of synonyms and near synonyms within a semantic field. In the cases where one of the words were included in the smaller Danish sentiment lexicon AFINN, its value there was used as inspiration and expanded to the synonyms when appropriate. In order to obtain a more practical lexicon with overall polarity values at lemma level, all the senses of the lemma were afterwards compared, taking into consideration dictionary information such as usage, style and frequency. The final lexicon contains 13,859 Danish polarity lemmas and includes morphological information. It is freely available at https://github.com/dsldk/danish-sentiment-lexicon (licence CC-BY-SA 4.0 International).

M3 - Article in proceedings

SP - 2826

EP - 2832

BT - Proceedings of the Language Resources and Evaluation Conference

PB - European Language Resources Association

CY - Marseille

ER -

ID: 312128208