%0 Conference Proceedings %T DIsCO: DynamIc Data COmpression in Distributed Stream Processing Systems %+ Athens University of Economics and Business (AUEB) %A Zacheilas, Nikos %A Kalogeraki, Vana %Z Part 1: Running System Efficiently (Distributed System) %< avec comité de lecture %( Lecture Notes in Computer Science %B 17th IFIP International Conference on Distributed Applications and Interoperable Systems (DAIS) %C Neuchâtel, Switzerland %Y Lydia Y. Chen %Y Hans Reiser %I Springer International Publishing %3 Distributed Applications and Interoperable Systems %V LNCS-10320 %P 19-33 %8 2017-06-19 %D 2017 %R 10.1007/978-3-319-59665-5_2 %Z Computer Science [cs] %Z Computer Science [cs]/Networking and Internet Architecture [cs.NI]Conference papers %X Supporting high throughput in Distributed Stream Processing Systems (DSPSs) has been an important goal in recent years. Current works either focus on automatically increasing the system resources whenever the current setup is inadequate or apply load shedding techniques discarding some of the incoming data. However, both approaches have significant shortcomings as they require on the fly application reconfiguration where the application needs to be stopped and re-uploaded in the cluster with the new configurations, and can lead to significant information loss. One approach that has not yet been considered for improving the throughput of DSPSs is exploiting compression algorithms to minimize the communication overhead between components especially in cases where we have large-sized data like live CCTV camera reports. This work is the first that provides a novel framework, built on top of Apache Storm, which enables dynamic compression of incoming streaming data. Our approach uses a profiling algorithm to automatically determine the compression algorithm that should be applied and supports both lossless and lossy compression techniques. Furthermore, we propose a novel algorithm for determining when profiling should be applied. Finally, our detailed experimental evaluation with commonly used stream processing applications, indicates a clear improvement on the applications’ throughput when our proposed techniques are applied. %G English %Z TC 6 %Z WG 6.1 %2 https://inria.hal.science/hal-01800129/document %2 https://inria.hal.science/hal-01800129/file/450046_1_En_2_Chapter.pdf %L hal-01800129 %U https://inria.hal.science/hal-01800129 %~ IFIP-LNCS %~ IFIP %~ IFIP-TC %~ IFIP-WG %~ IFIP-TC6 %~ IFIP-WG6-1 %~ IFIP-DAIS %~ IFIP-DISCOTEC %~ IFIP-LNCS-10320