Sussi Olsen
Academic Research Staff
Department of Nordic Studies and Linguistics
Emil Holms Kanal 2, 2300 København S, 22 Bygning 22 (Afsnit 1), Building: 22-3-38
Knowledge of languages
Danish, English and SpanishEducation: Masters degree in Spanish and Computational Linguistics at University of Copenhagen.
Since 1997 employed at Center for Sprogteknologi, University of Copenhagen, as a research associate.
Research projects and tasks:
2021 -> Part of the project the Central Word Register for Danish, COR, about building a joint Danish language resource for AI purposes.
2021 -> Part of two smaller EU projects, European Language Grid and European Language Equality, of which the main aim is to strengthen language technology for all languages in Europe. One of the tasks is to update the white paper on the state of language technology in the member countries.
2020-> Part of the EU project Federated TermBank, a continuation of eTTB that will build termbanks as satelites in each country, all synchronized with the central termbank Eurotermbank. Thus shared terminology will be updated locally as well as centrally.
2018-> Part of the DanNet2 project, about upgrading and expanding DanNet focussing on adjectives from the Danish Thesaurus. Furthermore larger part of DanNet will be linked to Princeton WordNet and a sentiment lexicon will be developed also based on the thesaurus.
2018 -> Part of the ELEXIS project that fosters cooperation and knowledge exchange between lexicographic communities within EU and to make lexicographic data available for language technology.
2017-> Part of the EU project eTranslation TermBank collecting terminology for the EU translation platform.
2017 -> Part of the EU project ELRC that collects language data relevant to public institutions for the EU translation platform that is open to public institutions.
2016 -17 Part of the CLARIN+ project (EU) dealing with identifying new CLARIN partners
2016-17 Part of a collaborational project with The Society for Danish Language and Literature with the aim of developing a lemmatiser and a pos tagger for 19th century Danish.
2015-17 Part of the EU project Language Technology Observatory, mainly engaged in identifying language resources for machine translation
2013-17 Part of project Semantic Processing across Domains. especially involved in semantic annotation of corpora.
Deltager i DigHumLab-projektet med forskelligartede opgaver
2012-2015 Part of the Nordic network LUNAS focusing on academic language use in the Nordic countries
From 2011 - 2013 participating in the EU project META-NORD mainly working with data collection and specification of subject domains.
From 2010 participating in the EU project LetsMT!
From 2008 till 2010 responsible for the work package 2.2 in the DK-CLARIN project, with the aim of compiling a 11 m. sub-corpus of language for special purposes. The corpus will be PoS-tagged and annotated with term status information.
Participated in the DAD project (2007-2009, funded by FKK) dealing with annotation of abstract anaphors and their references in written and spoken corpora.
From 2004 and ongoing validator of various lexicon and corpus projects, e.g. the EuroWordnet for ELRA and the national Dutch corpus. Supervisor for validators for all kinds of languages.
Participated in the SIMPLE-DK project in 2000 working with ontologically structured vocabularies.
From 1997-2004 mainly working at the computational lexicon project STO where the principal research and working areas have been the administration of the domain specific text corpora and the selection of the vocabularies from these, the treatment of multiword concepts along with the syntactic structuring of the vocabulary. Still doing updates and giving courses on STO.
Representative positions
Member of the board of representatives and the board of directors of the Danish Language Council
Former chair of the society of Lecicographers in Denmark
Education
MA
ID: 1498977
Most downloads
-
168
downloads
From Thesaurus to Framenet
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Published -
81
downloads
Providing a Catalogue of Language Resources for Commercial Users
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Published -
73
downloads
Designing the ELEXIS Parallel Sense-Annotated Dataset in 10 European Languages
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Published