Joint Emotion Label Space Modelling for Affect Lexica

Research output: Contribution to journal › Journal article › Research

Standard

Joint Emotion Label Space Modelling for Affect Lexica. / De Bruyne, Luna ; Atanasova, Pepa Kostadinova; Augenstein, Isabelle.

In: arXiv.org, Vol. CoRR 2019, 2019.

Research output: Contribution to journal › Journal article › Research

Harvard

De Bruyne, L, Atanasova, PK & Augenstein, I 2019, 'Joint Emotion Label Space Modelling for Affect Lexica', arXiv.org, vol. CoRR 2019. <https://arxiv.org/abs/1911.08782>

APA

De Bruyne, L., Atanasova, P. K., & Augenstein, I. (2019). Joint Emotion Label Space Modelling for Affect Lexica. arXiv.org, CoRR 2019. https://arxiv.org/abs/1911.08782

Vancouver

De Bruyne L, Atanasova PK, Augenstein I. Joint Emotion Label Space Modelling for Affect Lexica. arXiv.org. 2019;CoRR 2019.

Author

De Bruyne, Luna ; Atanasova, Pepa Kostadinova ; Augenstein, Isabelle. / Joint Emotion Label Space Modelling for Affect Lexica. In: arXiv.org. 2019 ; Vol. CoRR 2019.

Bibtex

@article{161a7a59f4d54940be0bcc8f1973b7e9,

title = "Joint Emotion Label Space Modelling for Affect Lexica",

abstract = "Emotion lexica are commonly used resources to combat data poverty in automatic emotion detection. However, methodological issues emerge when employing them: lexica are often not very extensive, and the way they are constructed can vary widely -- from lab conditions to crowdsourced approaches and distant supervision. Furthermore, both categorical frameworks and dimensional frameworks coexist, in which theorists provide many different sets of categorical labels or dimensional axes. The heterogenous nature of the resulting emotion detection resources results in a need for a unified approach to utilising them. This paper contributes to the field of emotion analysis in NLP by a) presenting the first study to unify existing emotion detection resources automatically and thus learn more about the relationships between them; b) exploring the use of existing lexica for the above-mentioned task; c) presenting an approach to automatically combining emotion lexica, namely by a multi-view variational auto-encoder (VAE), which facilitates the mapping of datasets into a joint emotion label space. We test the utility of joint emotion lexica by using them as additional features in state-of-the art emotion detection models. Our overall findings are that emotion lexica can offer complementary information to even extremely large pre-trained models such as BERT. The performance of our models is comparable to state-of-the art models that are specifically engineered for certain datasets, and even outperform the state-of-the art on four datasets. ",

author = "{De Bruyne}, Luna and Atanasova, {Pepa Kostadinova} and Isabelle Augenstein",

year = "2019",

language = "English",

volume = "CoRR 2019",

journal = "arXiv.org",

}

RIS

TY - JOUR

T1 - Joint Emotion Label Space Modelling for Affect Lexica

AU - De Bruyne, Luna

AU - Atanasova, Pepa Kostadinova

AU - Augenstein, Isabelle

PY - 2019

Y1 - 2019

N2 - Emotion lexica are commonly used resources to combat data poverty in automatic emotion detection. However, methodological issues emerge when employing them: lexica are often not very extensive, and the way they are constructed can vary widely -- from lab conditions to crowdsourced approaches and distant supervision. Furthermore, both categorical frameworks and dimensional frameworks coexist, in which theorists provide many different sets of categorical labels or dimensional axes. The heterogenous nature of the resulting emotion detection resources results in a need for a unified approach to utilising them. This paper contributes to the field of emotion analysis in NLP by a) presenting the first study to unify existing emotion detection resources automatically and thus learn more about the relationships between them; b) exploring the use of existing lexica for the above-mentioned task; c) presenting an approach to automatically combining emotion lexica, namely by a multi-view variational auto-encoder (VAE), which facilitates the mapping of datasets into a joint emotion label space. We test the utility of joint emotion lexica by using them as additional features in state-of-the art emotion detection models. Our overall findings are that emotion lexica can offer complementary information to even extremely large pre-trained models such as BERT. The performance of our models is comparable to state-of-the art models that are specifically engineered for certain datasets, and even outperform the state-of-the art on four datasets.

AB - Emotion lexica are commonly used resources to combat data poverty in automatic emotion detection. However, methodological issues emerge when employing them: lexica are often not very extensive, and the way they are constructed can vary widely -- from lab conditions to crowdsourced approaches and distant supervision. Furthermore, both categorical frameworks and dimensional frameworks coexist, in which theorists provide many different sets of categorical labels or dimensional axes. The heterogenous nature of the resulting emotion detection resources results in a need for a unified approach to utilising them. This paper contributes to the field of emotion analysis in NLP by a) presenting the first study to unify existing emotion detection resources automatically and thus learn more about the relationships between them; b) exploring the use of existing lexica for the above-mentioned task; c) presenting an approach to automatically combining emotion lexica, namely by a multi-view variational auto-encoder (VAE), which facilitates the mapping of datasets into a joint emotion label space. We test the utility of joint emotion lexica by using them as additional features in state-of-the art emotion detection models. Our overall findings are that emotion lexica can offer complementary information to even extremely large pre-trained models such as BERT. The performance of our models is comparable to state-of-the art models that are specifically engineered for certain datasets, and even outperform the state-of-the art on four datasets.

M3 - Journal article

VL - CoRR 2019

JO - arXiv.org

JF - arXiv.org

ER -

ID: 255054414

Department of Computer Science