Digital Library

of the European Council for Modelling and Simulation



Data-Aware Scheduling In Massive Heterogeneous Systems


Magdalena Szmajduch, Joanna Kolodziej

Published in:



(2015).ECMS 2015 Proceedings edited by: Valeri M. Mladenov, Grisha Spasov, Petia Georgieva, Galidiya Petrova, European Council for Modeling and Simulation. doi:10.7148/2015



ISBN: 978-0-9932440-0-1


29th European Conference on Modelling and Simulation,

Albena (Varna), Bulgaria, May 26th – 29th, 2015


Citation format:

Magdalena Szmajduch, Joanna Kolodziej (2015). Data-Aware Scheduling In Massive Heterogeneous Systems, ECMS 2015 Proceedings edited by: Valeri M. Mladenov, Petia Georgieva, Grisha Spasov, Galidiya Petrova  European Council for Modeling and Simulation. doi:10.7148/2015-0601




Data-aware scheduling in large-scale heterogeneous computing systems remains a challenging research issue, especially in the era of Big Data. Design of all data-related components of the popular distributed environments, such as Data Clouds (DCs), Data Grids (DGs) and Data Centers supports the processing, analysis and monitoring of the big data generated by various sources at computing centers by the end-users, devices and services. The above facts leave no doubts that data scheduling must be integrated in a single joint process together with the scheduling of computer tasks and applications. Therefore, many of the current optimization issues need to be changed and new requirements have to be considered in the scheduling process. This includes data transmission times, data processing times, availability of the data servers, safety and authentication in the data access processes. This paper presents a new version of the Expected Time to Compute Matrix model (ETC Matrix) for the case of data-aware independent batch scheduling in physical network in DGs and DCs environments. Simple geneticbased schedulers have been developed for experimental justification of the significance of the presented problem.


Full text: