Unsupervised Discovery of Gendered Language through Latent-Variable Modeling
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Documents
- OA-Unsupervised Discovery of Gendered Language
Final published version, 707 KB, PDF document
Studying the ways in which language is gendered has long been an area of interest in sociolinguistics. Studies have explored, for example, the speech of male and female characters in film and the language used to describe male and female politicians. In this paper, we aim not to merely study this phenomenon qualitatively, but instead to quantify the degree to which the language used to describe men and women is different and, moreover, different in a positive or negative way. To that end, we introduce a generative latent-variable model that jointly represents adjective (or verb) choice, with its sentiment, given the natural gender of a head (or dependent) noun. We find that there are significant differences between descriptions of male and female nouns and that these differences align with common gender stereotypes: Positive adjectives used to describe women are more often related to their bodies than adjectives used to describe men
Original language | English |
---|---|
Title of host publication | Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics |
Publisher | Association for Computational Linguistics |
Publication date | 2020 |
Pages | 1706-1716 |
DOIs | |
Publication status | Published - 2020 |
Event | 57th Annual Meeting of the Association for Computational Linguistics - Florence, Italy Duration: 1 Jul 2019 → 1 Jul 2019 |
Conference
Conference | 57th Annual Meeting of the Association for Computational Linguistics |
---|---|
Land | Italy |
By | Florence, |
Periode | 01/07/2019 → 01/07/2019 |
Number of downloads are based on statistics from Google Scholar and www.ku.dk
No data available
ID: 240629975