Technical Reports - Query Results

Your query term was 'number = 2014-02'
1 report found
OFAI-TR-2014-02 ( 344kB PDF file)

Improving Neighborhood-Based Collaborative Filtering by Reducing Hubness

Peter Knees, Dominik Schnitzer, Arthur Flexer

For recommending multimedia items, collaborative filtering (CF) denotes the technique of automatically predicting a user's rating or preference for an item by exploiting item preferences of a (large) group of other users. In traditional memory-based (or neighborhood-based) recommenders, this is accomplished by, first, selecting a number of similar users (or items) and, second, combining their ratings into a single user's predicted rating for an item. Strategies for both defi ning similarity (i.e., to identify nearest neighbors) and for combining ratings (i.e., to weight their impact) have been extensively studied and even resulted in inconsistent findings. In this paper, we investigate the eff ects of the high dimensionality of useritem matrices on the quality of memorybased movie rating prediction. By examining several publicly available real-world CF data sets, we show that the step of nearest neighbor selection is a ffected by the phenomena of similarity concentration and hub occurrence due to highdimensional data spaces and the class of similarity measures used. To mitigate this, we adapt a normalization technique called mutual proximity that has been shown to reduce these e ffects in classi cation tasks. Finally, we show that removing hubs and incorporating normalized similarity values into the neighbor weighting step leads to increased rating prediction accuracy, observable on all examined data sets in terms of lowered error measure (RMSE).

Keywords: Hubness, Collaborative Filtering, Concentration of distances,

Citation: Knees P., Schnitzer D., Flexer A.: Improving Neighborhood-Based Collaborative Filtering by Reducing Hubness, ACM International Conference on Multimedia Retrieval (ICMR), 2014.