Machine Understanding of Scientific Language

Research output: Book/ReportPh.D. thesisResearch

Standard

Machine Understanding of Scientific Language. / Wright, Dustin.

Department of Computer Science, Faculty of Science, University of Copenhagen, 2023. 210 p.

Research output: Book/ReportPh.D. thesisResearch

Harvard

Wright, D 2023, Machine Understanding of Scientific Language. Department of Computer Science, Faculty of Science, University of Copenhagen.

APA

Wright, D. (2023). Machine Understanding of Scientific Language. Department of Computer Science, Faculty of Science, University of Copenhagen.

Vancouver

Wright D. Machine Understanding of Scientific Language. Department of Computer Science, Faculty of Science, University of Copenhagen, 2023. 210 p.

Author

Wright, Dustin. / Machine Understanding of Scientific Language. Department of Computer Science, Faculty of Science, University of Copenhagen, 2023. 210 p.

Bibtex

@phdthesis{08e7b0d4b0df4a23941cb6e505befb54,
title = "Machine Understanding of Scientific Language",
abstract = "Scientific information expresses human understanding of nature. This knowledge is largelydisseminated in different forms of text, including scientific papers, news articles, and discourseamong people on social media. While important for accelerating our pursuit of knowledge, notall scientific text is faithful to the underlying science. As the volume of this text has burgeonedonline in recent years, it has become a problem of societal importance to be able to identifythe faithfulness of a given piece of scientific text automatically. This thesis is concernedwith the cultivation of datasets, methods, and tools for machine understanding of scientificlanguage, in order to analyze and understand science communication at scale. To arrive atthis, I present several contributions in three areas of natural language processing and machinelearning: automatic fact checking, learning with limited data, and scientific text processing. Thesecontributions include new methods and resources for identifying check-worthy claims, adversarialclaim generation, multi-source domain adaptation, learning from crowd-sourced labels, cite-worthiness detection, zero-shot scientific fact checking, detecting exaggerated scientific claims,and modeling degrees of information change in science communication. Critically, I demonstratehow the research outputs of this thesis are useful for effectively learning from limited amounts ofscientific text in order to identify misinformative scientific statements and generate new insightsinto the science communication process.",
author = "Dustin Wright",
year = "2023",
language = "English",
publisher = "Department of Computer Science, Faculty of Science, University of Copenhagen",

}

RIS

TY - BOOK

T1 - Machine Understanding of Scientific Language

AU - Wright, Dustin

PY - 2023

Y1 - 2023

N2 - Scientific information expresses human understanding of nature. This knowledge is largelydisseminated in different forms of text, including scientific papers, news articles, and discourseamong people on social media. While important for accelerating our pursuit of knowledge, notall scientific text is faithful to the underlying science. As the volume of this text has burgeonedonline in recent years, it has become a problem of societal importance to be able to identifythe faithfulness of a given piece of scientific text automatically. This thesis is concernedwith the cultivation of datasets, methods, and tools for machine understanding of scientificlanguage, in order to analyze and understand science communication at scale. To arrive atthis, I present several contributions in three areas of natural language processing and machinelearning: automatic fact checking, learning with limited data, and scientific text processing. Thesecontributions include new methods and resources for identifying check-worthy claims, adversarialclaim generation, multi-source domain adaptation, learning from crowd-sourced labels, cite-worthiness detection, zero-shot scientific fact checking, detecting exaggerated scientific claims,and modeling degrees of information change in science communication. Critically, I demonstratehow the research outputs of this thesis are useful for effectively learning from limited amounts ofscientific text in order to identify misinformative scientific statements and generate new insightsinto the science communication process.

AB - Scientific information expresses human understanding of nature. This knowledge is largelydisseminated in different forms of text, including scientific papers, news articles, and discourseamong people on social media. While important for accelerating our pursuit of knowledge, notall scientific text is faithful to the underlying science. As the volume of this text has burgeonedonline in recent years, it has become a problem of societal importance to be able to identifythe faithfulness of a given piece of scientific text automatically. This thesis is concernedwith the cultivation of datasets, methods, and tools for machine understanding of scientificlanguage, in order to analyze and understand science communication at scale. To arrive atthis, I present several contributions in three areas of natural language processing and machinelearning: automatic fact checking, learning with limited data, and scientific text processing. Thesecontributions include new methods and resources for identifying check-worthy claims, adversarialclaim generation, multi-source domain adaptation, learning from crowd-sourced labels, cite-worthiness detection, zero-shot scientific fact checking, detecting exaggerated scientific claims,and modeling degrees of information change in science communication. Critically, I demonstratehow the research outputs of this thesis are useful for effectively learning from limited amounts ofscientific text in order to identify misinformative scientific statements and generate new insightsinto the science communication process.

M3 - Ph.D. thesis

BT - Machine Understanding of Scientific Language

PB - Department of Computer Science, Faculty of Science, University of Copenhagen

ER -

ID: 347874489