Liu, Hanze (2012) Improving web search personalization using Luhn-Inspired vector re-weighting. Masters thesis, Memorial University of Newfoundland.
PDF (Migrated (PDF/A Conversion) from original format: (application/pdf))
- Accepted Version
Available under License - The author retains copyright ownership and moral rights in this thesis. Neither the thesis nor substantial extracts from it may be printed or otherwise reproduced without the author's permission.
Web search personalization has been studied as a way to tailor Web search results to individual users based on their interests and preferences. Commonly, document and personalization profile features are stored in vector space models using measures such as term frequency (TF) and term frequency-inverse document frequency (TF*IDF). Inspired by Luhn's model of term importance, a novel approach is proposed in this thesis to identify and re-weight significant terms in the vector-based personalization models. Evaluations with a set of ambiguous queries illustrate that the order of the search results using this approach is superior to the TF approach and comparable to the TF*IDF approach. However, it is based only on the information stored in the personalization profiles, rather than requiring access to the distribution of each term across the document collection. As such, it can be applied more broadly when only limited information regarding the collection being searched is available.
|Item Type:||Thesis (Masters)|
|Additional Information:||Includes bibliographical references (leaves 121-130).|
|Department(s):||Science, Faculty of > Computer Science|
|Library of Congress Subject Heading:||Web search engines; World Wide Web--Subject access; Data mining; Web personalization;|
Actions (login required)