Semi-Supervised Exaggeration Detection of Health Science Press Releases

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Standard

Semi-Supervised Exaggeration Detection of Health Science Press Releases. / Wright, Dustin; Augenstein, Isabelle.

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 2021. p. 10824-10836.

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Harvard

Wright, D & Augenstein, I 2021, Semi-Supervised Exaggeration Detection of Health Science Press Releases. in Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, pp. 10824-10836, 2021 Conference on Empirical Methods in Natural Language Processing, Online and Punta Cana, Dominican Republic, 07/11/2021. https://doi.org/10.18653/v1/2021.emnlp-main.845

APA

Wright, D., & Augenstein, I. (2021). Semi-Supervised Exaggeration Detection of Health Science Press Releases. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (pp. 10824-10836). Association for Computational Linguistics. https://doi.org/10.18653/v1/2021.emnlp-main.845

Vancouver

Wright D, Augenstein I. Semi-Supervised Exaggeration Detection of Health Science Press Releases. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics. 2021. p. 10824-10836 https://doi.org/10.18653/v1/2021.emnlp-main.845

Author

Wright, Dustin ; Augenstein, Isabelle. / Semi-Supervised Exaggeration Detection of Health Science Press Releases. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 2021. pp. 10824-10836

Bibtex

@inproceedings{9a1ecffe5b4440a5ba06228711e52e33,
title = "Semi-Supervised Exaggeration Detection of Health Science Press Releases",
abstract = "Public trust in science depends on honest and factual communication of scientific papers. However, recent studies have demonstrated a tendency of news media to misrepresent scientific papers by exaggerating their findings. Given this, we present a formalization of and study into the problem of exaggeration detection in science communication. While there are an abundance of scientific papers and popular media articles written about them, very rarely do the articles include a direct link to the original paper, making data collection challenging, and necessitating the need for few-shot learning. We address this by curating a set of labeled press release/abstract pairs from existing expert annotated studies on exaggeration in press releases of scientific papers suitable for benchmarking the performance of machine learning models on the task. Using limited data from this and previous studies on exaggeration detection in science, we introduce MT-PET, a multi-task version of Pattern Exploiting Training (PET), which leverages knowledge from complementary cloze-style QA tasks to improve few-shot learning. We demonstrate that MT-PET outperforms PET and supervised learning both when data is limited, as well as when there is an abundance of data for the main task.",
author = "Dustin Wright and Isabelle Augenstein",
year = "2021",
doi = "10.18653/v1/2021.emnlp-main.845",
language = "English",
pages = "10824--10836",
booktitle = "Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing",
publisher = "Association for Computational Linguistics",
note = "2021 Conference on Empirical Methods in Natural Language Processing ; Conference date: 07-11-2021 Through 11-11-2021",

}

RIS

TY - GEN

T1 - Semi-Supervised Exaggeration Detection of Health Science Press Releases

AU - Wright, Dustin

AU - Augenstein, Isabelle

PY - 2021

Y1 - 2021

N2 - Public trust in science depends on honest and factual communication of scientific papers. However, recent studies have demonstrated a tendency of news media to misrepresent scientific papers by exaggerating their findings. Given this, we present a formalization of and study into the problem of exaggeration detection in science communication. While there are an abundance of scientific papers and popular media articles written about them, very rarely do the articles include a direct link to the original paper, making data collection challenging, and necessitating the need for few-shot learning. We address this by curating a set of labeled press release/abstract pairs from existing expert annotated studies on exaggeration in press releases of scientific papers suitable for benchmarking the performance of machine learning models on the task. Using limited data from this and previous studies on exaggeration detection in science, we introduce MT-PET, a multi-task version of Pattern Exploiting Training (PET), which leverages knowledge from complementary cloze-style QA tasks to improve few-shot learning. We demonstrate that MT-PET outperforms PET and supervised learning both when data is limited, as well as when there is an abundance of data for the main task.

AB - Public trust in science depends on honest and factual communication of scientific papers. However, recent studies have demonstrated a tendency of news media to misrepresent scientific papers by exaggerating their findings. Given this, we present a formalization of and study into the problem of exaggeration detection in science communication. While there are an abundance of scientific papers and popular media articles written about them, very rarely do the articles include a direct link to the original paper, making data collection challenging, and necessitating the need for few-shot learning. We address this by curating a set of labeled press release/abstract pairs from existing expert annotated studies on exaggeration in press releases of scientific papers suitable for benchmarking the performance of machine learning models on the task. Using limited data from this and previous studies on exaggeration detection in science, we introduce MT-PET, a multi-task version of Pattern Exploiting Training (PET), which leverages knowledge from complementary cloze-style QA tasks to improve few-shot learning. We demonstrate that MT-PET outperforms PET and supervised learning both when data is limited, as well as when there is an abundance of data for the main task.

U2 - 10.18653/v1/2021.emnlp-main.845

DO - 10.18653/v1/2021.emnlp-main.845

M3 - Article in proceedings

SP - 10824

EP - 10836

BT - Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

PB - Association for Computational Linguistics

T2 - 2021 Conference on Empirical Methods in Natural Language Processing

Y2 - 7 November 2021 through 11 November 2021

ER -

ID: 301732244