Identifying Parties in Manifestos and Parliament Speeches
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Standard
Identifying Parties in Manifestos and Parliament Speeches. / Navarretta, Costanza; Hansen, Dorte Haltrup.
Creating, Using and Linking of Parliamentary Corpora with Other Types of Political Discourse ( ParlaCLARIN II): LREC2020 Workshop PARLACLARIN 2. ed. / Darja Fiser; Maria Eskevich; Franciska de Jong. European Language Resources Association, 2020. p. 51-57.Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Harvard
APA
Vancouver
Author
Bibtex
}
RIS
TY - GEN
T1 - Identifying Parties in Manifestos and Parliament Speeches
AU - Navarretta, Costanza
AU - Hansen, Dorte Haltrup
PY - 2020
Y1 - 2020
N2 - This paper addresses differences in the word use of two left-winged and two right-winged Danish parties, and how these differences,which reflect some of the basic stances of the parties, can be used to automatically identify the party of politicians from their speeches.In the first study, the most frequent and characteristic lemmas in the manifestos of the political parties as well as their languagecomplexity are analysed. The analysis shows inter alia that the most frequently occurring lemmas in the manifestos reflect eitherthe ideology or the position of the parties towards specific subjects, confirming for Danish preceding studies of English and Germanmanifestos. Successively, we scaled our analysis applying NLP methods to the transcribed speeches by members of the same partiesin the Parliament (Hansards) and trained machine learning algorithms in order to determine to what extent it is possible to predict the party of the politicians from the speeches. The speeches are a subset of the Danish Parliament corpus 2009–2017. The best results of the classification experiments gave a weighted F1-score of 0.57. These results are significantly better than the results obtained by the majority classifier (weighted F1-score = 0.11) and by chance results. They show that the party of the politicians can be distinguished from their speeches in nearly 60% of the cases, even if they debate about the same subjects and thus often use the same terminology. In the future, we will include the subject of the speeches in the prediction experiments.
AB - This paper addresses differences in the word use of two left-winged and two right-winged Danish parties, and how these differences,which reflect some of the basic stances of the parties, can be used to automatically identify the party of politicians from their speeches.In the first study, the most frequent and characteristic lemmas in the manifestos of the political parties as well as their languagecomplexity are analysed. The analysis shows inter alia that the most frequently occurring lemmas in the manifestos reflect eitherthe ideology or the position of the parties towards specific subjects, confirming for Danish preceding studies of English and Germanmanifestos. Successively, we scaled our analysis applying NLP methods to the transcribed speeches by members of the same partiesin the Parliament (Hansards) and trained machine learning algorithms in order to determine to what extent it is possible to predict the party of the politicians from the speeches. The speeches are a subset of the Danish Parliament corpus 2009–2017. The best results of the classification experiments gave a weighted F1-score of 0.57. These results are significantly better than the results obtained by the majority classifier (weighted F1-score = 0.11) and by chance results. They show that the party of the politicians can be distinguished from their speeches in nearly 60% of the cases, even if they debate about the same subjects and thus often use the same terminology. In the future, we will include the subject of the speeches in the prediction experiments.
M3 - Article in proceedings
SN - 9791095546474
SP - 51
EP - 57
BT - Creating, Using and Linking of Parliamentary Corpora with Other Types of Political Discourse ( ParlaCLARIN II)
A2 - Fiser, Darja
A2 - Eskevich, Maria
A2 - de Jong, Franciska
PB - European Language Resources Association
ER -
ID: 241213825