Syntactic Query Models for Restatement Retrieval
Publication Date
2009
Journal or Book Title
STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS
Abstract
We consider the problem of retrieving sentence level restatements. Formally, we define restatements as sentences that contain all or some subset of information present in a query sentence. Identifying restatements is useful for several applications such as multi-document summarization, document provenance, text reuse and novelty detection. Spurious partial matches and term dependence become important issues for restatement retrieval in these settings. To address these issues, we focus on query models that capture relative term importance and sequential term dependence. In this paper, we build query models using syntactic information such as subject-verb-objects and phrases. Our experimental results on two different collections show that syntactic query models are consistently more effective than purely statistical alternatives.
DOI
https://doi.org/10.1007/978-3-642-03784-9_14
Pages
143-155
Volume
5721
Book Series Title
Lecture Notes in Computer Science
Recommended Citation
Balasubramanian, N and Allan, J, "Syntactic Query Models for Restatement Retrieval" (2009). STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS. 283.
https://doi.org/10.1007/978-3-642-03784-9_14