Similarity Aware Shuffling for the Distributed Execution of SQL Window Functions

Fábio Coelho; Miguel Matos; José Pereira; Rui Oliveira

doi:10.1007/978-3-319-59665-5_1

Conference Papers Year : 2017

Similarity Aware Shuffling for the Distributed Execution of SQL Window Functions

(1) , (2, 3) , (1) , (1)

1
2
3

Fábio Coelho

Function : Author
PersonId : 1032381

Universidade do Minho = University of Minho [Braga]

Miguel Matos

Function : Author
PersonId : 1032391

IST / Técnico Lisboa - Instituto Superior Técnico (Universidade de Lisboa) [Lisboa]

INESC-ID - Instituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento em Lisboa

José Pereira

Function : Author
PersonId : 968822

Universidade do Minho = University of Minho [Braga]

Rui Oliveira

Function : Author
PersonId : 998185

Universidade do Minho = University of Minho [Braga]

Abstract

Window functions are extremely useful and have become increasingly popular, allowing ranking, cumulative sums and other analytic aggregations to be computed over a highly flexible and configurable sliding window. This powerful expressiveness comes naturally at the expense of heavy computational requirements which, so far, have been addressed through optimizations around centralized approaches by works both from the industry and academia. Distribution and parallelization has the potential to improve performance, but introduces several challenges associated with data distribution that may harm data locality. In this paper, we show how data similarity can be employed across partitions during the distributed execution of these operators to improve data co-locality between instances of a Distributed Query Engine and the associated data storage nodes. Our contribution can attain network gains in the average of 3 times and it is expected to scale as the number of instances increase. In the scenario with 8 nodes, we were to able attain bandwidth and time savings of 7.3 times and 2.61 times respectively.

Domains

Fichier principal

450046_1_En_1_Chapter.pdf (808.1 Ko)

Origin	Files produced by the author(s)
licence	CC BY 4.0 - Attribution

Connect in order to contact the contributor

https://inria.hal.science/hal-01800128

Submitted on : Friday, May 25, 2018-3:17:47 PM

Last modification on : Thursday, December 18, 2025-8:52:03 AM

Long-term archiving on : Sunday, August 26, 2018-1:53:53 PM

Dates and versions

hal-01800128 , version 1 (25-05-2018)

Licence

CC BY 4.0 - Attribution

Identifiers

HAL Id : hal-01800128 , version 1
DOI : 10.1007/978-3-319-59665-5_1

Cite

Fábio Coelho, Miguel Matos, José Pereira, Rui Oliveira. Similarity Aware Shuffling for the Distributed Execution of SQL Window Functions. 17th IFIP International Conference on Distributed Applications and Interoperable Systems (DAIS), Jun 2017, Neuchâtel, Switzerland. pp.3-18, ⟨10.1007/978-3-319-59665-5_1⟩. ⟨hal-01800128⟩

Similarity Aware Shuffling for the Distributed Execution of SQL Window Functions

Abstract

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share