Rhetorical relations for information retrieval

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningfagfællebedømt

Typically, every part in most coherent text has some plausible reason for its presence, some function that it performs to the overall semantics of the text. Rhetorical relations, e.g. contrast, cause, explanation, describe how the parts of a text are linked to each other. Knowledge about this so-called discourse structure has been applied successfully to several natural language processing tasks. This work studies the use of rhetorical relations for Information Retrieval (IR): Is there a correlation between certain rhetorical relations and retrieval performance? Can knowledge about a document’s rhetorical relations be useful to IR?

We present a language model modification that considers rhetorical relations when estimating the relevance of a document to a query. Empirical evaluation of different versions of our model on TREC settings shows that certain rhetorical relations can benefit retrieval effectiveness notably (> 10%
in mean average precision over a state-of-the-art baseline).
OriginalsprogEngelsk
TitelProceedings of the 35th international ACM SIGIR conference on Research and Development in Information Retrieval
Antal sider10
ForlagAssociation for Computing Machinery
Publikationsdato2012
Sider931-940
ISBN (Elektronisk)978-1-4503-1472-5
DOI
StatusUdgivet - 2012
Begivenhed35th International ACM SIGIR Conference on Research and Development in Information Retrieval - Oregon, USA
Varighed: 12 aug. 201216 aug. 2012
Konferencens nummer: 35

Konference

Konference35th International ACM SIGIR Conference on Research and Development in Information Retrieval
Nummer35
LandUSA
ByOregon
Periode12/08/201216/08/2012

ID: 38240033