location: Current position: Home >> Scientific Research >> Paper Publications

AT-mine: An efficient algorithm of frequent itemset mining on uncertain dataset

Hits:

Indexed by:期刊论文

Date of Publication:2013-01-01

Journal:Journal of Computers (Finland)

Included Journals:EI、Scopus

Volume:8

Issue:6

Page Number:1417-1426

ISSN No.:1796203X

Abstract:Frequent itemset/pattern mining (FIM) over uncertain transaction dataset is a fundamental task in data mining. In this paper, we study the problem of FIM over uncertain datasets. There are two main approaches for FIM: the level-wise approach and the pattern-growth approach. The level-wise approach requires multiple scans of dataset and generates candidate itemsets. The pattern-growth approach requires a large amount of memory and computation time to process tree nodes because the current algorithms for uncertain datasets cannot create a tree as compact as the original FP-Tree. In this paper, we propose an array based tail node tree structure (namely AT-Tree) to maintain transaction itemsets, and a pattern-growth based algorithm named AT-Mine for FIM over uncertain dataset. AT-Tree is created by two scans of dataset and it is as compact as the original FP-Tree. AT-Mine mines frequent itemsets from AT-Tree without additional scan of dataset. We evaluate our algorithm using sparse and dense datasets; the experimental results show that our algorithm has achieved better performance than the state-of-the-art FIM algorithms on uncertain transaction datasets, especially for small minimum expected support number. ? 2013 ACADEMY PUBLISHER.

Pre One:我国高校书院制与美国高校住宿学院制学生管理模式的比较研究

Next One:Sliding Window-based Frequent Itemsets Mining over Data Streams using Tail Pointer Table