What I think when I think about treebanks

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningfagfællebedømt

In this opinion piece, I present four somewhat controversial suggestions for the design of futuretreebanks: a) Treebanks should be based on adversarial samples, rather than pseudorepresentativesamples. b) Treebanks should include multiple splits of the data, rather than justa single split, as in most treebanks today. c) They should include multiple annotations of eachsentence, whenever possible, instead of adjudicated annotations. d) There is no real motivationfor adhering to a notion of well-formedness, since we now have parsers based on deep learningthat generalize easily and perform well on any type of graphs, and treebanks therefore do not haveto limit themselves to trees or directed acyclic graphs.
OriginalsprogEngelsk
TitelProceedings of the 16th International Workshop on Treebanks and Linguistic Theories (TLT16),
ForlagAssociation for Computational Linguistics
Publikationsdato2018
Sider161-166
StatusUdgivet - 2018
Begivenhed16th International Workshop on Treebanks and Linguistic Theories (TLT16) - Prague, Tjekkiet
Varighed: 23 jan. 201824 jan. 2018

Konference

Konference16th International Workshop on Treebanks and Linguistic Theories (TLT16)
LandTjekkiet
ByPrague
Periode23/01/201824/01/2018

Links

ID: 214752172