Ice and Fire: Dataset on Sentiment, Emotions, Toxicity, Sarcasm, Hate speech, Sympathy and More in Icelandic Blog Comments

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningfagfællebedømt

Standard

Ice and Fire : Dataset on Sentiment, Emotions, Toxicity, Sarcasm, Hate speech, Sympathy and More in Icelandic Blog Comments. / Friðriksdóttir, Steinunn Rut; Simonsen, Annika; Ásmundsson, Atli Snær; Friðjónsdóttir, Guðrún Lilja; Ingason, Anton Karl; Snæbjarnarson, Vésteinn; Einarsson, Hafsteinn.

Proceedings of the Fourth Workshop on Threat, Aggression & Cyberbullying @ LREC-COLING-2024: [TRAC-2024 Workshop]. red. / Ritesh Kumar; Atul Kr. Ojha; Atul Kr. Ojha; Shervin Malmasi; Bharathi Raja Chakravarthi; Bornini Lahiri; Siddharth Singh; Shyam Ratan. European Language Resources Association (ELRA), 2024. s. 73-84.

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningfagfællebedømt

Harvard

Friðriksdóttir, SR, Simonsen, A, Ásmundsson, AS, Friðjónsdóttir, GL, Ingason, AK, Snæbjarnarson, V & Einarsson, H 2024, Ice and Fire: Dataset on Sentiment, Emotions, Toxicity, Sarcasm, Hate speech, Sympathy and More in Icelandic Blog Comments. i R Kumar, AK Ojha, AK Ojha, S Malmasi, BR Chakravarthi, B Lahiri, S Singh & S Ratan (red), Proceedings of the Fourth Workshop on Threat, Aggression & Cyberbullying @ LREC-COLING-2024: [TRAC-2024 Workshop]. European Language Resources Association (ELRA), s. 73-84, 4th Workshop on Threat, Aggression and Cyberbullying, TRAC 2024, Torino, Italien, 20/05/2024. <https://aclanthology.org/2024.trac-1.9>

APA

Friðriksdóttir, S. R., Simonsen, A., Ásmundsson, A. S., Friðjónsdóttir, G. L., Ingason, A. K., Snæbjarnarson, V., & Einarsson, H. (2024). Ice and Fire: Dataset on Sentiment, Emotions, Toxicity, Sarcasm, Hate speech, Sympathy and More in Icelandic Blog Comments. I R. Kumar, A. K. Ojha, A. K. Ojha, S. Malmasi, B. R. Chakravarthi, B. Lahiri, S. Singh, & S. Ratan (red.), Proceedings of the Fourth Workshop on Threat, Aggression & Cyberbullying @ LREC-COLING-2024: [TRAC-2024 Workshop] (s. 73-84). European Language Resources Association (ELRA). https://aclanthology.org/2024.trac-1.9

Vancouver

Friðriksdóttir SR, Simonsen A, Ásmundsson AS, Friðjónsdóttir GL, Ingason AK, Snæbjarnarson V o.a. Ice and Fire: Dataset on Sentiment, Emotions, Toxicity, Sarcasm, Hate speech, Sympathy and More in Icelandic Blog Comments. I Kumar R, Ojha AK, Ojha AK, Malmasi S, Chakravarthi BR, Lahiri B, Singh S, Ratan S, red., Proceedings of the Fourth Workshop on Threat, Aggression & Cyberbullying @ LREC-COLING-2024: [TRAC-2024 Workshop]. European Language Resources Association (ELRA). 2024. s. 73-84

Author

Friðriksdóttir, Steinunn Rut ; Simonsen, Annika ; Ásmundsson, Atli Snær ; Friðjónsdóttir, Guðrún Lilja ; Ingason, Anton Karl ; Snæbjarnarson, Vésteinn ; Einarsson, Hafsteinn. / Ice and Fire : Dataset on Sentiment, Emotions, Toxicity, Sarcasm, Hate speech, Sympathy and More in Icelandic Blog Comments. Proceedings of the Fourth Workshop on Threat, Aggression & Cyberbullying @ LREC-COLING-2024: [TRAC-2024 Workshop]. red. / Ritesh Kumar ; Atul Kr. Ojha ; Atul Kr. Ojha ; Shervin Malmasi ; Bharathi Raja Chakravarthi ; Bornini Lahiri ; Siddharth Singh ; Shyam Ratan. European Language Resources Association (ELRA), 2024. s. 73-84

Bibtex

@inproceedings{0cba28a1adb04087b2d70cafd9824c8a,
title = "Ice and Fire: Dataset on Sentiment, Emotions, Toxicity, Sarcasm, Hate speech, Sympathy and More in Icelandic Blog Comments",
abstract = "This study introduces {"}Ice and Fire,{"} a Multi-Task Learning (MTL) dataset tailored for sentiment analysis in the Icelandic language. It encompasses a wide range of linguistic tasks, including sentiment and emotion detection, as well as the identification of toxicity, hate speech, encouragement, sympathy, sarcasm/irony, and trolling. With 261 fully annotated blog comments and 1,045 comments annotated in at least one task, this contribution marks a significant step forward in the field of Icelandic natural language processing. The dataset provides a comprehensive resource for understanding the nuances of online communication in Icelandic and an interface to expand the annotation effort. Despite the challenges inherent in subjective interpretation of text, our findings highlight the positive potential of this dataset to improve text analysis techniques and encourage more inclusive online discourse in Icelandic communities. With promising baseline performances, {"}Ice and Fire{"} sets the stage for future research to enhance automated text analysis and develop sophisticated language technologies, contributing to healthier online environments and advancing Icelandic language resources.",
keywords = "Icelandic Language Resources, Multi-Task Learning, Sentiment Analysis",
author = "Fri{\dh}riksd{\'o}ttir, {Steinunn Rut} and Annika Simonsen and {\'A}smundsson, {Atli Sn{\ae}r} and Fri{\dh}j{\'o}nsd{\'o}ttir, {Gu{\dh}r{\'u}n Lilja} and Ingason, {Anton Karl} and V{\'e}steinn Sn{\ae}bjarnarson and Hafsteinn Einarsson",
note = "Funding Information: Steinunn Rut Fri\u00F0riksd\u00F3ttir was supported by The Ludvig Storr Trust no. LSTORR2023-93030 and The Icelandic Language Technology Programme. Annika Simonsen was supported by The European Commission under grant agreement no. 101135671. V\u00E9steinn Sn\u00E6bjarnarson acknowledges support from the Pioneer Centre for AI, DNRF grant number P1. Publisher Copyright: {\textcopyright} 2024 ELRA Language Resource Association.; 4th Workshop on Threat, Aggression and Cyberbullying, TRAC 2024 ; Conference date: 20-05-2024",
year = "2024",
language = "English",
pages = "73--84",
editor = "Ritesh Kumar and Ojha, {Atul Kr.} and Ojha, {Atul Kr.} and Shervin Malmasi and Chakravarthi, {Bharathi Raja} and Bornini Lahiri and Siddharth Singh and Shyam Ratan",
booktitle = "Proceedings of the Fourth Workshop on Threat, Aggression & Cyberbullying @ LREC-COLING-2024",
publisher = "European Language Resources Association (ELRA)",

}

RIS

TY - GEN

T1 - Ice and Fire

T2 - 4th Workshop on Threat, Aggression and Cyberbullying, TRAC 2024

AU - Friðriksdóttir, Steinunn Rut

AU - Simonsen, Annika

AU - Ásmundsson, Atli Snær

AU - Friðjónsdóttir, Guðrún Lilja

AU - Ingason, Anton Karl

AU - Snæbjarnarson, Vésteinn

AU - Einarsson, Hafsteinn

N1 - Funding Information: Steinunn Rut Fri\u00F0riksd\u00F3ttir was supported by The Ludvig Storr Trust no. LSTORR2023-93030 and The Icelandic Language Technology Programme. Annika Simonsen was supported by The European Commission under grant agreement no. 101135671. V\u00E9steinn Sn\u00E6bjarnarson acknowledges support from the Pioneer Centre for AI, DNRF grant number P1. Publisher Copyright: © 2024 ELRA Language Resource Association.

PY - 2024

Y1 - 2024

N2 - This study introduces "Ice and Fire," a Multi-Task Learning (MTL) dataset tailored for sentiment analysis in the Icelandic language. It encompasses a wide range of linguistic tasks, including sentiment and emotion detection, as well as the identification of toxicity, hate speech, encouragement, sympathy, sarcasm/irony, and trolling. With 261 fully annotated blog comments and 1,045 comments annotated in at least one task, this contribution marks a significant step forward in the field of Icelandic natural language processing. The dataset provides a comprehensive resource for understanding the nuances of online communication in Icelandic and an interface to expand the annotation effort. Despite the challenges inherent in subjective interpretation of text, our findings highlight the positive potential of this dataset to improve text analysis techniques and encourage more inclusive online discourse in Icelandic communities. With promising baseline performances, "Ice and Fire" sets the stage for future research to enhance automated text analysis and develop sophisticated language technologies, contributing to healthier online environments and advancing Icelandic language resources.

AB - This study introduces "Ice and Fire," a Multi-Task Learning (MTL) dataset tailored for sentiment analysis in the Icelandic language. It encompasses a wide range of linguistic tasks, including sentiment and emotion detection, as well as the identification of toxicity, hate speech, encouragement, sympathy, sarcasm/irony, and trolling. With 261 fully annotated blog comments and 1,045 comments annotated in at least one task, this contribution marks a significant step forward in the field of Icelandic natural language processing. The dataset provides a comprehensive resource for understanding the nuances of online communication in Icelandic and an interface to expand the annotation effort. Despite the challenges inherent in subjective interpretation of text, our findings highlight the positive potential of this dataset to improve text analysis techniques and encourage more inclusive online discourse in Icelandic communities. With promising baseline performances, "Ice and Fire" sets the stage for future research to enhance automated text analysis and develop sophisticated language technologies, contributing to healthier online environments and advancing Icelandic language resources.

KW - Icelandic Language Resources

KW - Multi-Task Learning

KW - Sentiment Analysis

M3 - Article in proceedings

AN - SCOPUS:85195197382

SP - 73

EP - 84

BT - Proceedings of the Fourth Workshop on Threat, Aggression & Cyberbullying @ LREC-COLING-2024

A2 - Kumar, Ritesh

A2 - Ojha, Atul Kr.

A2 - Ojha, Atul Kr.

A2 - Malmasi, Shervin

A2 - Chakravarthi, Bharathi Raja

A2 - Lahiri, Bornini

A2 - Singh, Siddharth

A2 - Ratan, Shyam

PB - European Language Resources Association (ELRA)

Y2 - 20 May 2024

ER -

ID: 395094094