Exploiting Interference-aware GPU Container Concurrency Learning from Resource Usage of Application Execution

Sejin Kim; Yoonhee Kim

Summary

Asia-Pacific Network Operations and Management Symposium

2020

Session Number:TS8

Session:

Number:TS8-3

Exploiting Interference-aware GPU Container Concurrency Learning from Resource Usage of Application Execution

Sejin Kim, Yoonhee Kim,

pp.173-178

Publication Date:2020/9/22

Online ISSN:2188-5079

DOI:10.34385/proc.62.TS8-3

PDF download (495.1KB)

Summary:

The advent of GPGPU (General-Purpose Graphic Processing Unit) containers enlarges opportunities of acceleration and easy-to-use in clouds. However, there is still lack of research on utilizing efficiently GPU resource and managing multiple applications at the same time. Co-execution of applications without understanding applications' execution characteristics may result in low performance caused by their interference problems. To solve the problem, this paper defines resource metrics that causes performance degradation when sharing resource. We calculate the degree of interference during concurrent execution of multi applications using a ML (Machine Learning) method with the metrics. The experiments show that the execution of interference aware groups improves 7??/o in execution time compared to non-interference aware group in overall. For a workload consisting of several applications, the overall performance was improved by 18% and 25%, respectively, when compared to SJF and random.