%0 Conference Proceedings %T User-Centric vs. System-Centric Evaluation of Recommender Systems %+ Politecnico di Milano [Milan] (POLIMI) %+ ContentWise [Milan] %A Cremonesi, Paolo %A Garzotto, Franca %A Turrin, Roberto %Z Part 1: Long and Short Papers (Continued) %< avec comité de lecture %( Lecture Notes in Computer Science %B 14th International Conference on Human-Computer Interaction (INTERACT) %C Cape Town, South Africa %Y Paula Kotzé %Y Gary Marsden %Y Gitte Lindgaard %Y Janet Wesson %Y Marco Winckler %I Springer %3 Human-Computer Interaction – INTERACT 2013 %V LNCS-8119 %N Part III %P 334-351 %8 2013-09-02 %D 2013 %R 10.1007/978-3-642-40477-1_21 %K Recommender systems %K E-tourism %K Evaluation %K Decision Making %Z Computer Science [cs]Conference papers %X Recommender Systems (RSs) aim at helping users search large amounts of contents and identify more effectively the items (products or services) that are likely to be more useful or attractive. The quality of a RS can be defined from two perspectives: system-centric, in which quality measures (e.g., precision, recall) are evaluated using vast datasets of preferences and opinions on items previously collected from users that are not interacting with the RS under study; user-centric, in which user measures are collected from users interacting with the RS under study. Prior research in e-commerce has provided some empirical evidence that system-centric and user-centric quality methods may lead to inconsistent results, e.g., RSs that were “best” according to system-centric measures were not the top ones according to user-centric measures. The paper investigates if a similar mismatch also exists in the domain of e-tourism. We discuss two studies that have adopted a system-centric approach using data from 210000 users, and a user-centric approach involving 240 users interacting with an online hotel booking service. In both studies, we considered four RSs that employ an implicit user preference elicitation technique and different baseline and state-of-the-art recommendation algorithms. In these four experimental conditions, we compared system-centric quality measures against user-centric evaluation results. System-centric quality measures were consistent with user-centric measures, in contrast with past studies in e-commerce. This pinpoints that the relationship between the two kinds of metrics may depend on the business sector, is more complex that we may expect, and is a challenging issues that deserves further research. %G English %Z TC 13 %2 https://inria.hal.science/hal-01504894/document %2 https://inria.hal.science/hal-01504894/file/978-3-642-40477-1_21_Chapter.pdf %L hal-01504894 %U https://inria.hal.science/hal-01504894 %~ IFIP-LNCS %~ IFIP %~ IFIP-AICT %~ IFIP-TC %~ IFIP-TC13 %~ IFIP-INTERACT %~ IFIP-LNCS-8119