冯林

个人信息Personal Information

教授

博士生导师

硕士生导师

性别:男

毕业院校:大连理工大学

学位:博士

所在单位:创新创业学院

办公地点:创新创业学院402室

联系方式:041184707111

电子邮箱:fenglin@dlut.edu.cn

扫描关注

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

An efficient algorithm of frequent itemsets mining based on MapReduce

点击次数:

论文类型:期刊论文

发表时间:2014-05-20

发表刊物:Journal of Information and Computational Science

收录刊物:EI、Scopus

卷号:11

期号:8

页面范围:2809-2816

ISSN号:15487741

摘要:Mainstream parallel algorithms for mining frequent itemsets (patterns) were designed by implementing FP-Growth or Apriori algorithms on MapReduce (MR) framework. Existing MR FP-Growth algorithms can not distribute data equally among nodes, and MR Apriori algorithms utilize multiple map/reduce procedures and generate too many key-value pairs with value of 1; these disadvantages hinder their performance. This paper proposes an algorithm FIMMR: it firstly mines local frequent itemsets for each data chunk as candidates, applies prune strategies to the candidates, and then identifies global frequent itemsets from candidates. Experimental results show that the time efficiency of FIMMR outperforms PFP and SPC significantly; and under small minimum support threshold, FIMMR can achieve one order of magnitude improvement than the other two algorithms; meanwhile, the speedup of FIMMR is also satisfactory. Copyright ? 2014 Binary Information Press.