A flexible heuristic to schedule distributed analytic applications in compute clusters

Pace, Francesco; Venzano, Daniele; Carra, Damiano; Michiardi, Pietro
IEEE Transactions on Cloud Computing, 10 July 2019

This work addresses the problem of scheduling user-defined analytic applications, which we define as high-level compositions of frameworks, their components, and the logic necessary to carry out work. The key idea in our application definition, is to distinguish classes of components, including core and elastic types: the first being required for an application to make progress, the latter contributing to reduced execution times. We show that the problem of scheduling such applications poses new challenges, which existing approaches address inefficiently.

Data Science
