Ansatte – Københavns Universitet

Integrative dynamic reconfiguration in a parallel stream processing engine

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningfagfællebedømt

Dokumenter

  • pdf

    Indsendt manuskript, 347 KB, PDF-dokument

Kasper Grud Skat Madsen, Yongluan Zhou, Jianneng Cao

Load balancing, operator instance collocations and horizontal scaling are critical issues in Parallel Stream Processing Engines to achieve low data processing latency, optimized cluster utilization and minimized communication cost respectively. In previous work, these issues are typically tackled separately and independently. We argue that these problems are tightly coupled in the sense that they all need to determine the allocations of workloads and migrate computational states at runtime. Optimizing them independently would result in suboptimal solutions. Therefore, in this paper, we investigate how these three issues can be modeled as one integrated optimization problem. In particular, we first consider jobs where workload allocations have little effect on the communication cost, and model the problem of load balance as a Mixed-Integer Linear Program. Afterwards, we present an extended solution called ALBIC, which support general jobs. We implement the proposed techniques on top of Apache Storm, an open-source Parallel Stream Processing Engine. The extensive experimental results over both synthetic and real datasets show that our techniques clearly outperform existing approaches.
OriginalsprogEngelsk
TitelProceedings of the 33rd IEEE International Conference on Data Engineering (ICDE)
Antal sider4
ForlagIEEE Press
Publikationsdato2017
Sider227-230
ISBN (Trykt)978-1-5090-6544-8
ISBN (Elektronisk)978-1-5090-6543-1
DOI
StatusUdgivet - 2017
Eksternt udgivetJa
Begivenhed33rd IEEE International Conference on Data Engineering - San Diego, USA
Varighed: 19 apr. 201722 apr. 2017
Konferencens nummer: 33

Konference

Konference33rd IEEE International Conference on Data Engineering
Nummer33
LandUSA
BySan Diego
Periode19/04/201722/04/2017

Links

Antal downloads er baseret på statistik fra Google Scholar og www.ku.dk


Ingen data tilgængelig

ID: 179278061