Lexical Semantic Recognition

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Documents

In lexical semantics, full-sentence segmentation and segment labeling of various phenomena are generally treated separately, despite their interdependence. We hypothesize that a unified lexical semantic recognition task is an effective way to encapsulate previously disparate styles of annotation, including multiword expression identification / classification and supersense tagging. Using the STREUSLE corpus, we train a neural CRF sequence tagger and evaluate its performance along various axes of annotation. As the label set generalizes that of previous tasks (PARSEME, DiMSUM), we additionally evaluate how well the model generalizes to those test sets, finding that it approaches or surpasses existing models despite training only on STREUSLE. Our work also establishes baseline models and evaluation metrics for integrated and accurate modeling of lexical semantics, facilitating future work in this area.
Original languageEnglish
Title of host publicationProceedings of the 17th Workshop on Multiword Expressions (MWE 2021)
PublisherAssociation for Computational Linguistics
Publication date2021
Pages49-56
DOIs
Publication statusPublished - 2021
Event17th Workshop on Multiword Expressions (MWE 2021) - Online
Duration: 6 Aug 20216 Aug 2021

Conference

Conference17th Workshop on Multiword Expressions (MWE 2021)
ByOnline
Periode06/08/202106/08/2021

ID: 300916255