HetExchange: Encapsulating heterogeneous CPU-GPU parallelism in JIT compiled engines

Chrysogelos, Periklis; Karpathiotakis, Manos; Appuswamy, Raja; Ailamaki, Anastasia
VLDB 2019, 45th International Conference on Very Large Data Bases, August 26-30, 2019, Los Angeles, California, USA

Modern server hardware is increasingly heterogeneous as hardware accelerators, such as GPUs, are used together with multicore CPUs to meet the computational demands of modern data analytics workloads.Unfortunately, query parallelization techniques used by analytical database engines are designed for homogeneous multicore servers, where query plans are parallelized across CPUs to process data stored in cache coherent shared memory. Thus, these techniques are unable to fully exploit available heterogeneous hardware, where one needs to exploit task-parallelism of CPUs and data-parallelism of GPUs for processing data stored in a deep, noncache-coherent memory hierarchy with widely varying access latencies and bandwidth. In this paper, we introduce HetExchange-a parallel query execution framework that encapsulates the heterogeneous parallelism of modern multi-CPU-multi-GPU servers and enables the parallelization of (pre-)existing sequential relational operators. In contrast to the interpreted nature of traditional Exchange, HetExchange is designed to be used in conjunction with JIT compiled engines in order to allow a tight integration with the proposed operators and generation of efficient code for heterogeneous hardware. We validate the applicability and efficiency of our design by building a prototype that can operate over both CPUs and GPUs, and enables its operators to be parallelism- and data-location-agnostic. In doing so, we show that efficiently exploiting CPU-GPU parallelism can provide 2.8x and 6.4x improvement in performance than state-ofthe-art CPU-based and GPU-based DBMS.

Los Angeles
Data Science
Eurecom Ref:
© ACM, 2019. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in VLDB 2019, 45th International Conference on Very Large Data Bases, August 26-30, 2019, Los Angeles, California, USA http://dx.doi.org/10.14778/3303753.3303760
See also:

PERMALINK : https://www.eurecom.fr/publication/5781