Linguistically debatable or just plain wrong?

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Standard

Linguistically debatable or just plain wrong? / Plank, Barbara; Hovy, Dirk; Søgaard, Anders.

Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Vol. volume 2 Baltimore, Maryland : Association for Computational Linguistics, 2014. p. 507-511.

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Harvard

Plank, B, Hovy, D & Søgaard, A 2014, Linguistically debatable or just plain wrong? in Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). vol. volume 2, Association for Computational Linguistics, Baltimore, Maryland, pp. 507-511.

APA

Plank, B., Hovy, D., & Søgaard, A. (2014). Linguistically debatable or just plain wrong? In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) (Vol. volume 2, pp. 507-511). Association for Computational Linguistics.

Vancouver

Plank B, Hovy D, Søgaard A. Linguistically debatable or just plain wrong? In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Vol. volume 2. Baltimore, Maryland: Association for Computational Linguistics. 2014. p. 507-511

Author

Plank, Barbara ; Hovy, Dirk ; Søgaard, Anders. / Linguistically debatable or just plain wrong?. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Vol. volume 2 Baltimore, Maryland : Association for Computational Linguistics, 2014. pp. 507-511

Bibtex

@inproceedings{84d27a702dfd4d11adbd09a5d9cd5bcd,
title = "Linguistically debatable or just plain wrong?",
abstract = "In linguistic annotation projects, we typically develop annotation guidelines to maximize inter-annotator agreement and learnability. However, in this position paper we question whether we should actually limit the disagreements between annotators, rather than embrace them. We present an empirical analysis of part-of-speech annotated data sets that suggests that certain disagreements are systematic across domains and languages. This points to an underlying ambiguity rather than random errors. Moreover, a quantitative analysis of disagreements reveals that the majority of them are due to linguistically debatable cases, rather than to actual annotation errors. Specifically, we show that even in the absence of annotation guidelines, only 2% of annotator choices are linguistically unmotivated.",
author = "Barbara Plank and Dirk Hovy and Anders S{\o}gaard",
year = "2014",
language = "English",
volume = "volume 2",
pages = "507--511",
booktitle = "Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)",
publisher = "Association for Computational Linguistics",

}

RIS

TY - GEN

T1 - Linguistically debatable or just plain wrong?

AU - Plank, Barbara

AU - Hovy, Dirk

AU - Søgaard, Anders

PY - 2014

Y1 - 2014

N2 - In linguistic annotation projects, we typically develop annotation guidelines to maximize inter-annotator agreement and learnability. However, in this position paper we question whether we should actually limit the disagreements between annotators, rather than embrace them. We present an empirical analysis of part-of-speech annotated data sets that suggests that certain disagreements are systematic across domains and languages. This points to an underlying ambiguity rather than random errors. Moreover, a quantitative analysis of disagreements reveals that the majority of them are due to linguistically debatable cases, rather than to actual annotation errors. Specifically, we show that even in the absence of annotation guidelines, only 2% of annotator choices are linguistically unmotivated.

AB - In linguistic annotation projects, we typically develop annotation guidelines to maximize inter-annotator agreement and learnability. However, in this position paper we question whether we should actually limit the disagreements between annotators, rather than embrace them. We present an empirical analysis of part-of-speech annotated data sets that suggests that certain disagreements are systematic across domains and languages. This points to an underlying ambiguity rather than random errors. Moreover, a quantitative analysis of disagreements reveals that the majority of them are due to linguistically debatable cases, rather than to actual annotation errors. Specifically, we show that even in the absence of annotation guidelines, only 2% of annotator choices are linguistically unmotivated.

M3 - Article in proceedings

VL - volume 2

SP - 507

EP - 511

BT - Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

PB - Association for Computational Linguistics

CY - Baltimore, Maryland

ER -

ID: 107673308