TY - JOUR
T1 - Utilizing inter-passage and inter-document similarities for reranking search results
AU - Krikon, Eyal
AU - Kurland, Oren
AU - Bendersky, Michael
N1 - Funding Information:
This work was supported by Key Project of National Natural Science Foundation of China (NSFC) under Grant no. 51636008, Excellent Young Scientist Fund of NSFC under Grant no. 51422606, the Newton Advanced Fellowship (NSFC: 51561130158; RS: NA140102), Key Research Program of Frontier Sciences, Chinese Academy of Science (CAS) under Grant no. QYZDB-SSW-JSC029, the Fok Ying-Tong Education Foundation under Grant no. 151056, Fundamental Research Funds for the Central Universities under Grant no. WK2320000035, and Natural Science Foundation of Guangdong Province under Grant no. 2014A030310190.
PY - 2010/12
Y1 - 2010/12
N2 - We present a novel language-model-based approach to reranking search results; that is, reordering the documents in an initially retrieved list so as to improve precision at top ranks. Our model integrates whole-document information with that induced from passages. Specifically, interpassage, inter-document, and query-based similarities, which constitute a rich source of information, are combined in our model. Empirical evaluation shows that the precision-at-top-ranks performance of our model is substantially better than that of the initial ranking upon which reranking is performed. Furthermore, the performance is substantially better than that of a commonly used passage-based document ranking method that does not exploit inter-item similarities. Our model also generalizes and outperforms a recently proposed reranking method that utilizes interdocument similarities, but which does not exploit passage-based information. Finally, the model's performance is superior to that of a state-of-the-art pseudo-feedback-based retrieval approach.
AB - We present a novel language-model-based approach to reranking search results; that is, reordering the documents in an initially retrieved list so as to improve precision at top ranks. Our model integrates whole-document information with that induced from passages. Specifically, interpassage, inter-document, and query-based similarities, which constitute a rich source of information, are combined in our model. Empirical evaluation shows that the precision-at-top-ranks performance of our model is substantially better than that of the initial ranking upon which reranking is performed. Furthermore, the performance is substantially better than that of a commonly used passage-based document ranking method that does not exploit inter-item similarities. Our model also generalizes and outperforms a recently proposed reranking method that utilizes interdocument similarities, but which does not exploit passage-based information. Finally, the model's performance is superior to that of a state-of-the-art pseudo-feedback-based retrieval approach.
KW - Ad hoc retrieval
KW - Document centrality
KW - Inter-document similarities
KW - Interpassage similarities
KW - Passage centrality
KW - Passage-based retrieval
KW - Reranking
UR - http://www.scopus.com/inward/record.url?scp=80051500892&partnerID=8YFLogxK
U2 - 10.1145/1877766.1877769
DO - 10.1145/1877766.1877769
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:80051500892
SN - 1046-8188
VL - 29
JO - ACM Transactions on Information Systems
JF - ACM Transactions on Information Systems
IS - 1
M1 - 3
ER -