An empirical study on the contribution of formal and semantic features to the grammatical gender of nouns

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

An empirical study on the contribution of formal and semantic features to the grammatical gender of nouns. / Basirat, Ali; Allassonnière-Tang, Marc; Berdicevskis, Aleksandrs.

I: Linguistics Vanguard, Bind 7, Nr. 1, 01.01.2021, s. 2020-0048.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Basirat, A, Allassonnière-Tang, M & Berdicevskis, A 2021, 'An empirical study on the contribution of formal and semantic features to the grammatical gender of nouns', Linguistics Vanguard, bind 7, nr. 1, s. 2020-0048. https://doi.org/10.1515/lingvan-2020-0048

APA

Basirat, A., Allassonnière-Tang, M., & Berdicevskis, A. (2021). An empirical study on the contribution of formal and semantic features to the grammatical gender of nouns. Linguistics Vanguard, 7(1), 2020-0048. https://doi.org/10.1515/lingvan-2020-0048

Vancouver

Basirat A, Allassonnière-Tang M, Berdicevskis A. An empirical study on the contribution of formal and semantic features to the grammatical gender of nouns. Linguistics Vanguard. 2021 jan. 1;7(1):2020-0048. https://doi.org/10.1515/lingvan-2020-0048

Author

Basirat, Ali ; Allassonnière-Tang, Marc ; Berdicevskis, Aleksandrs. / An empirical study on the contribution of formal and semantic features to the grammatical gender of nouns. I: Linguistics Vanguard. 2021 ; Bind 7, Nr. 1. s. 2020-0048.

Bibtex

@article{72b87e2080824fa38d7e6f708e4cafe4,
title = "An empirical study on the contribution of formal and semantic features to the grammatical gender of nouns",
abstract = "This study conducts an experimental evaluation of two hypotheses about the contributions of formal and semantic features to the grammatical gender assignment of nouns. One of the hypotheses (Corbett and Fraser 2000) claims that semantic features dominate formal ones. The other hypothesis, formulated within the optimal gender assignment theory (Rice 2006), states that form and semantics contribute equally. Both hypotheses claim that the combination of formal and semantic features yields the most accurate gender identification. In this paper, we operationalize and test these hypotheses by trying to predict grammatical gender using only character-based embeddings (that capture only formal features), only context-based embeddings (that capture only semantic features) and the combination of both. We performed the experiment using data from three languages with different gender systems (French, German and Russian). Formal features are a significantly better predictor of gender than semantic ones, and the difference in prediction accuracy is very large. Overall, formal features are also significantly better than the combination of form and semantics, but the difference is very small and the results for this comparison are not entirely consistent across languages.",
keywords = "Formal features, Gender, Neural networks, Semantics, Word embeddings",
author = "Ali Basirat and Marc Allassonni{\`e}re-Tang and Aleksandrs Berdicevskis",
note = "Funding Information: Research funding: The second author expresses his gratitude for the support of the IDEXLYON Fellowship Grant (16-IDEX-0005), University of Lyon Grant NSCO ED 476 (ANR-10-LABX-0081), and French National Research Agency (ANR-11-IDEX-0007). Publisher Copyright: {\textcopyright} 2021 Walter de Gruyter GmbH. All rights reserved.",
year = "2021",
month = jan,
day = "1",
doi = "10.1515/lingvan-2020-0048",
language = "English",
volume = "7",
pages = "2020--0048",
journal = "Linguistics Vanguard",
issn = "2199-174X",
publisher = "De Gruyter",
number = "1",

}

RIS

TY - JOUR

T1 - An empirical study on the contribution of formal and semantic features to the grammatical gender of nouns

AU - Basirat, Ali

AU - Allassonnière-Tang, Marc

AU - Berdicevskis, Aleksandrs

N1 - Funding Information: Research funding: The second author expresses his gratitude for the support of the IDEXLYON Fellowship Grant (16-IDEX-0005), University of Lyon Grant NSCO ED 476 (ANR-10-LABX-0081), and French National Research Agency (ANR-11-IDEX-0007). Publisher Copyright: © 2021 Walter de Gruyter GmbH. All rights reserved.

PY - 2021/1/1

Y1 - 2021/1/1

N2 - This study conducts an experimental evaluation of two hypotheses about the contributions of formal and semantic features to the grammatical gender assignment of nouns. One of the hypotheses (Corbett and Fraser 2000) claims that semantic features dominate formal ones. The other hypothesis, formulated within the optimal gender assignment theory (Rice 2006), states that form and semantics contribute equally. Both hypotheses claim that the combination of formal and semantic features yields the most accurate gender identification. In this paper, we operationalize and test these hypotheses by trying to predict grammatical gender using only character-based embeddings (that capture only formal features), only context-based embeddings (that capture only semantic features) and the combination of both. We performed the experiment using data from three languages with different gender systems (French, German and Russian). Formal features are a significantly better predictor of gender than semantic ones, and the difference in prediction accuracy is very large. Overall, formal features are also significantly better than the combination of form and semantics, but the difference is very small and the results for this comparison are not entirely consistent across languages.

AB - This study conducts an experimental evaluation of two hypotheses about the contributions of formal and semantic features to the grammatical gender assignment of nouns. One of the hypotheses (Corbett and Fraser 2000) claims that semantic features dominate formal ones. The other hypothesis, formulated within the optimal gender assignment theory (Rice 2006), states that form and semantics contribute equally. Both hypotheses claim that the combination of formal and semantic features yields the most accurate gender identification. In this paper, we operationalize and test these hypotheses by trying to predict grammatical gender using only character-based embeddings (that capture only formal features), only context-based embeddings (that capture only semantic features) and the combination of both. We performed the experiment using data from three languages with different gender systems (French, German and Russian). Formal features are a significantly better predictor of gender than semantic ones, and the difference in prediction accuracy is very large. Overall, formal features are also significantly better than the combination of form and semantics, but the difference is very small and the results for this comparison are not entirely consistent across languages.

KW - Formal features

KW - Gender

KW - Neural networks

KW - Semantics

KW - Word embeddings

UR - http://www.scopus.com/inward/record.url?scp=85102994991&partnerID=8YFLogxK

U2 - 10.1515/lingvan-2020-0048

DO - 10.1515/lingvan-2020-0048

M3 - Journal article

AN - SCOPUS:85102994991

VL - 7

SP - 2020

EP - 2048

JO - Linguistics Vanguard

JF - Linguistics Vanguard

SN - 2199-174X

IS - 1

ER -

ID: 366046076