He Guo

Professor   Supervisor of Doctorate Candidates   Supervisor of Master's Candidates

Gender:Male

Alma Mater:大连理工大学

Degree:Master's Degree

School/Department:软件学院、国际信息与软件学院

Contact Information:guohe@dlut.edu.cn

E-Mail:guohe@dlut.edu.cn


Paper Publications

APR: A novel parallel repacking algorithm for efficient GPGPU parallel code transformation

Hits:

Indexed by:会议论文

Date of Publication:2014-03-01

Included Journals:EI、Scopus

Page Number:81-89

Abstract:General-purpose graphics processing units (GPGPU) brings an opportunity to improve the performance for many applications. However, exploiting parallelism is low productive in current programming frameworks such as CUDA and OpenCL. Programmers have to consider and deal with many GPGPU architecture details; therefore it is a challenge to trade off the programmability and the efficiency of performance tuning. Parallel Repacking (PR) is a popular performance tuning approach for GPGPU applications, which improves the performance by changing the parallel granularity. Existing code transformation algorithms using PR increase the productivity, but they do not cover adequate code patterns and do not give an effective code error detection. In this paper, we propose a novel parallel repacking algorithm (APR) to cover a wide range of code patterns and improve efficiency. We develop an efficient code model that expresses a GPGPU program as a recursive statement sequence, and introduces a concept of singular statement. APR building upon this model uses appropriate transformation rules for singular and non-singular statements to generate the repacked codes. A recursive transformation is performed when it encounters a branching/loop singular statement. Additionally, singular statements unify the transformation for barriers and data sharing, and enable APR to detect the barrier errors. The experiment results based on a prototype show that out proposed APR covers more code patterns than existing solutions such as the automatic thread coarsening in Crest, and the repacked codes using the APR achieve effective performance gain up to 3:28X speedup, in some cases even higher than manually tuned repacked codes. Copyright 2014 ACM.

Pre One:A new geometric descriptor for symbols with affine deformations

Next One:以度量分段约束为特征的形状匹配算法

Profile

教育背景:

  • 学士学位:吉林大学计算机系,1982

  • 硕士学位:大连理工大学计算机系,1989

科研与工作经历:

  • 198610月—198710月,新西兰Progeni Company,访问学者

  • 199010月—199212月,德国PDI Karlsruhe University计算机系,访问学者

  • 199212月—200712月,大连理工大学计算机系,副教授

  • 19953月—19966月,大连市金卡工程系统,总工程师

  •  

  • 20081月—今,大连理工大学软件学院,教授

  • 20204 退休

教学工作:

  • 1992年—2007年,计算机导论,计算机组织与结构,计算机系统结构

  • 2009年—2019年,存储技术,计算机系统结构,并行计算

科研:

  • 研究兴趣:并行与分布式计算。