Lost in translation

Lost in translation: Authorship attribution using frame semantics

Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › fagfællebedømt

Steffen Hedegaard
Simonsen, Jakob Grue

We investigate authorship attribution using classifiers based on frame semantics. The purpose is to discover whether adding semantic information to lexical and syntactic methods for authorship attribution will improve them, specifically to address the difficult problem of authorship attribution of translated texts. Our results suggest (i) that frame-based classifiers are usable for author attribution of both translated and untranslated texts; (ii) that framebased classifiers generally perform worse than the baseline classifiers for untranslated texts, but (iii) perform as well as, or superior to the baseline classifiers on translated texts; (iv) that-contrary to current belief-naïve classifiers based on lexical markers may perform tolerably on translated texts if the combination of author and translator is present in the training set of a classifier.

Originalsprog	Engelsk
Titel	ACL-HLT 2011 - Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics : Human Language Technologies
Antal sider	6
Publikationsdato	1 dec. 2011
Sider	65-70
ISBN (Trykt)	9781932432886
Status	Udgivet - 1 dec. 2011
Begivenhed	49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL-HLT 2011 - Portland, OR, USA Varighed: 19 jun. 2011 → 24 jun. 2011

Konference

Konference	49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, ACL-HLT 2011
Land	USA
By	Portland, OR
Periode	19/06/2011 → 24/06/2011
Sponsor	Google, Baidu, Microsoft Research, Pacific Northwest National Laboratory, Yahoo

Navn	ACL-HLT 2011 - Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies
Vol/bind	2

ID: 224020667

Datalogisk Institut

Lost in translation: Authorship attribution using frame semantics

Konference