%0 Conference Proceedings %T Similarity Aware Shuffling for the Distributed Execution of SQL Window Functions %+ Universidade do Minho = University of Minho [Braga] %+ Instituto Superior Técnico, Universidade Técnica de Lisboa (IST) %+ Instituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento em Lisboa (INESC-ID) %A Coelho, Fábio %A Matos, Miguel %A Pereira, José %A Oliveira, Rui %Z Part 1: Running System Efficiently (Distributed System) %< avec comité de lecture %( Lecture Notes in Computer Science %B 17th IFIP International Conference on Distributed Applications and Interoperable Systems (DAIS) %C Neuchâtel, Switzerland %Y Lydia Y. Chen %Y Hans Reiser %I Springer International Publishing %3 Distributed Applications and Interoperable Systems %V LNCS-10320 %P 3-18 %8 2017-06-19 %D 2017 %R 10.1007/978-3-319-59665-5_1 %Z Computer Science [cs] %Z Computer Science [cs]/Networking and Internet Architecture [cs.NI]Conference papers %X Window functions are extremely useful and have become increasingly popular, allowing ranking, cumulative sums and other analytic aggregations to be computed over a highly flexible and configurable sliding window. This powerful expressiveness comes naturally at the expense of heavy computational requirements which, so far, have been addressed through optimizations around centralized approaches by works both from the industry and academia. Distribution and parallelization has the potential to improve performance, but introduces several challenges associated with data distribution that may harm data locality. In this paper, we show how data similarity can be employed across partitions during the distributed execution of these operators to improve data co-locality between instances of a Distributed Query Engine and the associated data storage nodes. Our contribution can attain network gains in the average of 3 times and it is expected to scale as the number of instances increase. In the scenario with 8 nodes, we were to able attain bandwidth and time savings of 7.3 times and 2.61 times respectively. %G English %Z TC 6 %Z WG 6.1 %2 https://inria.hal.science/hal-01800128/document %2 https://inria.hal.science/hal-01800128/file/450046_1_En_1_Chapter.pdf %L hal-01800128 %U https://inria.hal.science/hal-01800128 %~ IFIP-LNCS %~ IFIP %~ IFIP-TC %~ IFIP-WG %~ IFIP-TC6 %~ IFIP-WG6-1 %~ IFIP-DAIS %~ IFIP-DISCOTEC %~ IFIP-LNCS-10320