A Primer on Contrastive Pretraining in Language Processing

A Primer on Contrastive Pretraining in Language Processing: Methods, Lessons Learned, and Perspectives

Research output: Contribution to journal › Journal article › Research › peer-review

Documents

Fulltext
Accepted author manuscript, 460 KB, PDF document

Modern natural language processing (NLP) methods employ self-supervised pretraining objectives such as masked language modeling to boost the performance of various downstream tasks. These pretraining methods are frequently extended with recurrence, adversarial, or linguistic property masking. Recently, contrastive self-supervised training objectives have enabled successes in image representation pretraining by learning to contrast input-input pairs of augmented images as either similar or dissimilar. In NLP however, a single token augmentation can invert the meaning of a sentence during input-input contrastive learning, which led to input-output contrastive approaches that avoid the issue by instead contrasting over input-label pairs. In this primer, we summarize recent self-supervised and supervised contrastive NLP pretraining methods and describe where they are used to improve language modeling, zero to few-shot learning, pretraining data-efficiency, and specific NLP tasks. We overview key contrastive learning concepts with lessons learned from prior research and structure works by applications. Finally, we point to open challenges and future directions for contrastive NLP to encourage bringing contrastive NLP pretraining closer to recent successes in image representation pretraining.

Original language	English
Article number	203
Journal	ACM Computing Surveys
Volume	55
Issue number	10
Number of pages	17
ISSN	0360-0300
DOIs	https://doi.org/10.1145/3561970
Publication status	Published - 2023

Bibliographical note

Research areas

Contrastive learning

ID: 337589600

Department of Computer Science

A Primer on Contrastive Pretraining in Language Processing: Methods, Lessons Learned, and Perspectives

Documents

Bibliographical note

Research areas