The University of Dublin | Trinity College -- Ollscoil Átha Cliath | Coláiste na Tríonóide
Trinity's Access to Research Archive
Home :: Log In :: Submit :: Alerts ::

TARA >
School of Computer Science and Statistics >
Computer Science >
Computer Science (Scholarly Publications) >

Please use this identifier to cite or link to this item: http://hdl.handle.net/2262/41262

Title: DCU-TCD@LogCLEF 2010: Re-ranking Document Collections and Query Performance Estimation
Author: LEVELING, JOHANNES
GHORAB, MOHAMMED RAMI ELHUSSEIN
MAGDY, WALID
JONES, GARETH J. F.
WADE, VINCENT PATRICK
Sponsor: Science Foundation Ireland
Author's Homepage: http://people.tcd.ie/ghorabm
http://people.tcd.ie/vwade
Keywords: Result Re-ranking
Result Adaptation
Query Performance Esitmation
Cross-Language Information Retrieval
Library Search
Multilingual Information Retrieval
Personlaisation
Issue Date: 23-Sep-2010
Citation: Johannes Leveling, M. Rami Ghorab, Walid Magdy, Gareth J. F. Jones, Vincent Wade, DCU-TCD@LogCLEF 2010: Re-ranking Document Collections and Query Performance Estimation, 2010
Abstract: Abstract. This paper describes the collaborative participation of Dublin City University and Trinity College Dublin in LogCLEF 2010. Two sets of experiments were conducted. First, different aspects of the TEL query logs were analysed after extracting user sessions of consecutive queries on a topic. The relation between the queries and their length (number of terms) and position (first query or further reformulations) was examined in a session with respect to query performance estimators such as query scope, IDF-based measures, simplified query clarity score, and average inverse document collection frequency. Results of this analysis suggest that only some estimator values show a correlation with query length or position in the TEL logs (e.g. similarity score between collection and query). Second, the relation between three attributes was investigated: the user's country (detected from IP address), the query language, and the interface language. The investigation aimed to explore the influence of the three attributes on the user's collection selection. Moreover, the investigation involved assigning different weights to the three attributes in a scoring function that was used to re-rank the collections displayed to the user according to the language and country. The results of the collection re-ranking show a significant improvement in Mean Average Precision (MAP) over the original collection ranking of TEL. The results also indicate that the query language and interface language have more in fluence than the user's country on the collections selected by the users.
Description: PUBLISHED
URI: http://hdl.handle.net/2262/41262
Related links: http://www.clef2010.org/resources/proceedings/clef2010labs_submission_77.pdf
http://www.clef2010.org/index.php?page=pages/proceedings.php
Appears in Collections:Computer Science (Scholarly Publications)

Files in This Item:

File Description SizeFormat
DCU-TCD@LogCLEF 2010 Re-ranking Document Collections and Query Performance Estimation.pdfPublished (author's copy) - Non-Peer Reviewed480.1 kBAdobe PDFView/Open


This item is protected by original copyright


Please note: There is a known bug in some browsers that causes an error when a user tries to view large pdf file within the browser window. If you receive the message "The file is damaged and could not be repaired", please try one of the solutions linked below based on the browser you are using.

Items in TARA are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback