Phương pháp song song khai phá tập lợi ích cao dựa trên chỉ số hình chiếu
Abstract
High utility itemsets (HUIs) mining is one of popular problems in data mining. Several parallel and sequential algorithms have been proposed in the literature to solve this problem. All the parallel algorithms to try reduce synchronization cost and caculation global profit of itemsets. In this paper, we present a parallel method for mining HUIs from projection-based indexing to speed up performance and reduce memory requirements. The experimental results show that the performance and number candidate of our algorithm is better than some non parallel algorithms.References
A. C.F. AND T. S.K, Efficient Tree Structures for Highutility Pattern Mining in Incremental Databases, 2009.
G. CHENG LAN, T. PEI HONG, AND V. S. TSENG, An efficient projection-based indexing approach for mining high utility itemsets, 2013.
Y. LIU, W. LIAO, AND A. CHOUDHARY, A Fast High Utility Itemsets Mining Algorithm, Proceedings of the 1st International Workshop on Utility-based Data Mining, New York, NY, USA, 2005, pp. 90–99.
M. LIU AND J. QU, Mining High Utility Itemsets Without Candidate Generation, Proceedings of the 21st ACM International Conference on Information and Knowledge Management, New York, NY, USA, 2012, pp. 55–64.
K. SUBRAMANIAN, P. KANDHASAMY, AND S. SUBRAMANIAN, A Novel Approach to Extract High Utility Itemsets from Distributed Databases, Comput. Inform., vol. 31, no. 6+, pp. 1597–1615, 2013.
V. S. TSENG, B.-E. SHIE, C.-W. WU, AND P. S. YU, Efficient Algorithms for Mining High Utility Itemsets from Transactional Databases, IEEE Trans Knowl Data Eng, vol. 25, no. 8, pp. 1772–1786, Aug. 2013.
R. AGRAWAL AND R. SRIKANT, Fast Algorithms for Mining Association Rules in Large Databases, Proceedings of the 20th International Conference on Very Large Data Bases, 1994, pp. 487–499.
Y. LIU, W. LIAO, AND A. CHOUDHARY, A Two-phase Algorithm for Fast Discovery of High Utility Itemsets, Proceedings of the 9th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, Berlin, Heidelberg, 2005, pp. 689–695.
W. SONG, Y. LIU, AND J. LI, Vertical mining for high utility itemsets, 2012 IEEE International Conference on Granular Computing, 2012, pp. 429–434.
C. F. AHMED, S. K. TANBEER, B.-S. JEONG, AND Y.-K. LEE, HUC-Prune: an efficient candidate pruning technique to mine high utility patterns, Appl. Intell., vol. 34, no. 2, pp. 181–198, Apr. 2011.
V. S. TSENG, C.-W. WU, B.-E. SHIE, AND P. S. YU, UP-Growth: An Efficient Algorithm for High Utility Itemset Mining, Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, NY, USA, 2010, pp. 253–262.
A. ERWIN, R. P. GOPALAN, AND N. R. ACHUTHAN, CTU-Mine: An Efficient High Utility Itemset Mining Algorithm Using the Pattern Growth Approach, 7th IEEE International Conference on Computer and Information Technology (CIT 2007), 2007, pp. 71–76.
D. PHONG AND N. HUNG, Một mô hình hiệu quả khai phá tập mục lợi ích cao, Các Công Trình Nghiên Cứu Phát Triển Và Ứng Dụng CNTT-TT, pp. 26–36, Jun. 2015.
A. ERWIN, R. P. GOPALAN, AND N. R. ACHUTHAN, Efficient Mining of High Utility Itemsets from Large Datasets, Advances in Knowledge Discovery and Data Mining, T. Washio, E. Suzuki, K. M. Ting, and A. Inokuchi, Eds. Springer Berlin Heidelberg, 2008, pp. 554–561.
B. VO, H. NGUYEN, T. B. HO, AND B. LE, Parallel Method for Mining High Utility Itemsets from Vertically Partitioned Distributed Databases, Proceedings of the 13th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems: Part I, Berlin, Heidelberg, 2009, pp. 251–260.