刘晓东

个人信息Personal Information

教授

博士生导师

硕士生导师

性别:男

毕业院校:东北大学

学位:博士

所在单位:控制科学与工程学院

学科:应用数学. 应用数学. 控制理论与控制工程

办公地点:创新园大厦A0620

联系方式:电话: (+86-411) 84726020 (home) (+86-411) 84709380 (Office) 传真: (+86-411) 84707579 手机: (+86-411) 13130042458

电子邮箱:xdliuros@dlut.edu.cn

扫描关注

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

A parallel tree node splitting criterion for fuzzy decision trees

点击次数:

论文类型:期刊论文

发表时间:2019-09-10

发表刊物:CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE

收录刊物:SCIE、EI

卷号:31

期号:17

ISSN号:1532-0626

关键字:fuzzy decision trees; fuzzy information gain; MapReduce; parallel computing

摘要:Fuzzy decision trees are one of the most important extensions of decision trees for symbolic knowledge acquisition by fuzzy representation. Many fuzzy decision trees employ fuzzy information gain as a measure to construct the tree node splitting criteria. These criteria play a critical role in the construction of decision trees. However, many of the criteria can only work well on small-scale or medium-scale data sets, and cannot directly deal with large-scale data sets on the account of some limiting factors such as memory capacity, execution time, and data complexity. Parallel computing is one way to overcome these problems; in particular, MapReduce is one mainstream solution of parallel computing. In this paper, we design a parallel tree node splitting criterion (MR-NSC) based on fuzzy information gain via MapReduce, which is completed equivalent to the traditional unparallel splitting rule. The experimental studies verify the equivalency between the proposed MR-NSC algorithm and the traditional unparallel way through 22 UCI benchmark data sets. Furthermore, the feasibility and parallelism are also studied on two large-scale data sets.