WikiBank: Using wikidata to improve multilingual frame-semantic parsing

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningfagfællebedømt

Frame-semantic annotations exist for a tiny fraction of the world's languages, Wikidata, however, links knowledge base triples to texts in many languages, providing a common, distant supervision signal for semantic parsers. We present WIKIBANK, a multilingual resource of partial semantic structures that can be used to extend pre-existing resources rather than creating new man-made resources from scratch. We also integrate this form of supervision into an off-the-shelf frame-semantic parser and allow cross-lingual transfer. Using Google's SLING architecture, we show significant improvements on the English and Spanish CoNLL 2009 datasets, whether training on the full available datasets or small subsamples thereof.

OriginalsprogEngelsk
TitelLREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings
RedaktørerNicoletta Calzolari, Frederic Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
ForlagEuropean Language Resources Association (ELRA)
Publikationsdato2020
Sider4183-4189
ISBN (Elektronisk)9791095546344
StatusUdgivet - 2020
Begivenhed12th International Conference on Language Resources and Evaluation, LREC 2020 - Marseille, Frankrig
Varighed: 11 maj 202016 maj 2020

Konference

Konference12th International Conference on Language Resources and Evaluation, LREC 2020
LandFrankrig
ByMarseille
Periode11/05/202016/05/2020
SponsorAmazon AWS, Bertin, Lenovo, Ontotex, Vecsys, Vocapia

ID: 258332560