![]() |
个人信息Personal Information
副教授
硕士生导师
性别:男
毕业院校:大连理工大学
学位:博士
所在单位:计算机科学与技术学院
办公地点:创新园大厦A0827
联系方式:18640987378
电子邮箱:wyx@dlut.edu.cn
A Credit-Based Load-Balance-Aware CTA Scheduling Optimization Scheme in GPGPU
点击次数:
论文类型:期刊论文
发表时间:2016-02-01
发表刊物:INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING
收录刊物:SCIE、EI
卷号:44
期号:1,SI
页面范围:109-129
ISSN号:0885-7458
关键字:GPGPU; CTA scheduler; Credit-based load-balance-aware scheduling scheme; Load balance
摘要:GPGPU improves the computing performance due to the massive parallelism. The cooperative-thread-array (CTA) schedulers employed by the current GPGPUs greedily issue CTAs to GPU cores as soon as the resources become available for higher thread level parallelism. Due to the locality consideration in the memory controller, the CTA execution time varies in different cores, and thus it leads to a load imbalance of the CTA issuance among the cores. The load imbalance causes the computing resources under-utilized, and leaves an opportunity for further performance improvement. However, existing warp and CTA scheduling policies did not take account of load balance. We propose a credit-based load-balance-aware CTA scheduling optimization scheme (CLASO) piggybacked to a standard GPGPU scheduling system. CLASO uses credits to limit the amount of CTAs issued on each core to avoid the greedy issuance to faster executing cores as well as the starvation to leftover cores. In addition, CLASO employs the global credits and two tuning parameters, active levels and loose levels, to enhance the load balance and the robustness. Instead of a standalone scheduling policy, CLASO is compatible with existing CTA and warp schedulers. The experiments conducted using several paradigmatic benchmarks illustrate that CLASO effectively improves the load balance by reducing 52.4 % idle cycles on average, and achieves up to 26.6 % speedup compared to the GPGPU baseline scheduling policy.