Method for Mining High-utility Patterns in Transaction Stream Data based on Linked List Structure

Minh-Thai Tran; Anh-Duy Tran; Duc-Thanh Pham; Minh-Nguyen Le

doi:10.32913/mic-ict-research.v2024.n2.1249

Minh-Thai Tran ĐẠI HỌC NGOẠI NGỮ - TIN HỌC TP. HỒ CHÍ MINH
Anh-Duy Tran
Duc-Thanh Pham
Minh-Nguyen Le

DOI: https://doi.org/10.32913/mic-ict-research.v2024.n2.1249

Keywords: transaction stream data, data mining, high-utility item set

Abstract

Mining valuable patterns in data streams presents
a significant challenge in the field of data mining. This
task is crucial as it allows for the identification of highly
profitable item sets within transaction databases. However, as
new transactions are continually added, new valuable patterns
emerge, thus changing the usefulness of previously analyzed
data. It is essential to promptly update information regarding
these changes to enable effective business decision-making.
Consequently, existing mining methods applied to transaction
flow datasets require considerable time to identify new
patterns and update information related to new transactions.
This article focuses on the research and proposal of a new
transaction stream data mining method called High-Utility
Stream Linked-List Mining. The method utilizes a linked
list structure known as the High-Utility Stream Linked List
(HUSLL) to store information about patterns in the database.
Mining and updating transaction information are directly
performed on the HUSLL structure. Experimental results
demonstrate that this novel mining method exhibits more
efficient execution times compared to previous solutions.

Author Biographies

Minh-Thai Tran, ĐẠI HỌC NGOẠI NGỮ - TIN HỌC TP. HỒ CHÍ MINH

Dr. Tran Minh Thai received a Bachelor’s degree in 2001 and a Master’s degree in Computer Science in 2006, both from the University of Natural Sciences, Vietnam National University, HCMC. In 2017, he was awarded a Ph.D. in Computer Science from Vietnam National University, HCMC. From 2002 to 2015, he served as a lecturer
and manager of the Information Technology Department at the Information Technology College HCMC. Since 2015, he has been a lecturer and Head of the Information Systems Department of the Faculty of Information Technology at the HCMC University of Foreign Languages- Information Technology. Dr. Thai’s primary
research interests lie in data mining, data privacy, big data processing, and pattern recognition.

Anh-Duy Tran

Anh-Duy Tran received a Master’s degree in Computer Science from the University of Natural Sciences, Vietnam National University, HCMC in 2017. He serves as a Lecturer at the Department of Information
Technology at the University of Foreign Languages and Information Technology in Ho Chi Minh City. His contributions to the field of data mining aim to empower businesses and organizations to make data-driven decisions, fostering innovation and advance ments in various domains.

Duc-Thanh Pham

Duc-Thanh Pham received his Master’s degree in 2006 from Ho Chi Minh City National University; Currently working as a Lecturer at the Faculty of Information Technology, Ho Chi Minh City University of Foreign Languages and Information Technology; The research field of interestis: data mining

Minh-Nguyen Le

Minh-Nguyen Le received his Master’s degree in 2007 from Ho Chi Minh City National University; Currently working as a Lecturer at the Faculty of Information Technology, Ho Chi Minh City University of Foreign Languages and Information Technology; The research field of interest is: data mining.

References

R. Agrawal, R. Srikant et al., “Fast algorithms for mining association rules,” in Proc. 20th int. conf. very large data bases, VLDB, vol. 1215. Santiago, Chile, 1994, pp. 487–499.

P. Fournier-Viger, J. Chun-Wei Lin, T. Truong-Chi, and R. Nkambou, “A survey of high-utility itemset mining,”

High-utility pattern mining: Theory, algorithms and applications, pp. 1–45, 2019.

S. Krishnamoorthy, “Hminer: Efficiently mining high-utility itemsets,” Expert Systems with Applications, vol. 90, pp. 168–183, 2017.

S. Zida, P. Fournier-Viger, J. C.-W. Lin, C.-W. Wu, and V. S. Tseng, “Efim: a fast and memory efficient algorithm

for high-utility itemset mining,” Knowledge and Information Systems, vol. 51, no. 2, pp. 595–625, 2017.

M. K. Sohrabi, “An efficient projection-based method for high utility itemset mining using a novel pruning approach on the utility matrix,” Knowledge and Information Systems, vol. 62, pp. 4141–4167, 2020.

P. Wu, X. Niu, P. Fournier-Viger, C. Huang, and B. Wang, “Ubp-miner: An efficient bit based high utility itemset

mining algorithm,” Knowledge-Based Systems, vol. 248, p. 108865, 2022.

J. C.-W. Lin, W. Gan, T.-P. Hong, B. Zhang et al., “An incremental high-utility mining algorithm with transaction

insertion,” The Scientific World Journal, vol. 2015, 2015.

P. Fournier-Viger, J. C.-W. Lin, T. Gueniche, and P. Barhate, “Efficient incremental high-utility itemset mining,” in Proceedings of the ASE BigData & SocialInformatics 2015, 2015, pp. 1–6.

U. Yun, H. Ryang, G. Lee, and H. Fujita, “An efficient algorithm for mining high-utility patterns from incremental databases with one database scan,” Knowledge-Based Systems, vol. 124, pp. 188–206, 2017.

H.-F. Li, H.-Y. Huang, Y.-C. Chen, Y.-J. Liu, and S.-Y. Lee, “Fast and memory efficient mining of high-utility itemsets in data streams,” in 2008 eighth IEEE international conference on data mining. IEEE, 2008, pp. 881–886.

C. F. Ahmed, S. K. Tanbeer, and B.-S. Jeong, “Efficient mining of high-utility patterns over data streams with a

sliding window method,” Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing 2010, pp. 99–113, 2010.

J. Han, J. Pei, and Y. Yin, “Mining frequent patterns without candidate generation,” ACM sigmod record, vol. 29, no. 2, pp. 1–12, 2000.

G. Pyun, U. Yun, and K. H. Ryu, “Efficient frequent pattern mining based on linear prefix tree,” Knowledge-Based Systems, vol. 55, pp. 125–139, 2014.

Y. Mao, B. Wu, Q. Deng, S. Mahmoodi, Z. Chen, and Y.-C. Chen, “A novel parallel frequent itemset mining algorithm for automatic enterprise,” Enterprise Information Systems, p. 2204317, 2023.

J. Lu, W. Xu, K. Zhou, and Z. Guo, “Frequent itemset mining algorithm based on linear table,” Journal of Database Management (JDM), vol. 34, no. 1, pp. 1–21, 2023.

P. Fournier-Viger, C.-W.Wu, S. Zida, and V. S. Tseng, “Fhm: Faster high-utility itemset mining using estimated utility cooccurrence pruning,” in Foundations of Intelligent Systems: 21st International Symposium, ISMIS 2014, Roskilde, Denmark, June 25-27, 2014. Proceedings 21. Springer, 2014, pp. 83–92.

J. Liu, K. Wang, and B. CM Fung, “Direct discovery of high-utility itemsets without candidate generation,” in 2012 IEEE 12th international conference on data mining, 2012, pp. 984–989.

Y. Liu, W.-k. Liao, and A. Choudhary, “A two-phase algorithm for fast discovery of high-utility itemsets,” in Advances in Knowledge Discovery and Data Mining: 9th Pacific-Asia Conference, PAKDD 2005, Hanoi, Vietnam, May 18-20, 2005. Proceedings 9. Springer, 2005, pp. 689–695.

J.-S. Yeh, C.-Y. Chang, and Y.-T. Wang, “Efficient algorithms for incremental utility mining,” in Proceedings of

the 2nd international conference on Ubiquitous information management and communication, 2008, pp. 212–217.

C. F. Ahmed, S. K. Tanbeer, B.-S. Jeong, and Y.-K. Lee, “Efficient tree structures for high-utility pattern mining in incremental databases,” IEEE Transactions on Knowledge and Data Engineering, vol. 21, no. 12, pp. 1708–1721, 2009.

H.-T. Zheng and Z. Li, “ichum: an efficient algorithm for high-utility mining in incremental databases,” in Knowledge Science, Engineering and Management: 8th International Conference, KSEM 2015, Chongqing, China, October 28-30, 2015, Proceedings 8. Springer, 2015, pp. 212–223.

U. Yun and H. Ryang, “Incremental high-utility pattern mining with static and dynamic databases,” Applied intelligence, vol. 42, pp. 323–352, 2015.

M. Zihayat, Y. Chen, and A. An, “Memory-adaptive high utility sequential pattern mining over data streams,” Machine Learning, vol. 106, pp. 799–836, 2017.

D. Kim and U. Yun, “Mining high utility itemsets based on the time decaying model,” Intelligent Data Analysis, vol. 20, no. 5, pp. 1157–1180, 2016.

U. Yun, D. Kim, E. Yoon, and H. Fujita, “Damped window based high average utility pattern mining over data streams,” Knowledge-Based Systems, vol. 144, pp. 188–205, 2018.

H. Ryang and U. Yun, “High utility pattern mining over data streams with sliding window technique,” Expert Systems with Applications, vol. 57, pp. 214–231, 2016.

P. A. Reddy and M. K. Prasad, “Sliding window-based high utility item-sets mining over data stream using extended global utility item-sets tree,” International Journal of Software Innovation (IJSI), vol. 10, no. 1, pp. 1–16, 2022.

H. Yao and H. J. Hamilton, “Mining itemset utilities from transaction databases,” Data & Knowledge Engineering, vol. 59, no. 3, pp. 603–626, 2006.

M. Li, M. Han, Z. Chen, H. Wu, and X. Zhang, “Fchmstream: fast closed high utility itemsets mining over data

streams,” Knowledge and Information Systems, vol. 65, no. 6, pp. 2509–2539, 2023.

Method for Mining High-utility Patterns in Transaction Stream Data based on Linked List Structure

Abstract

Author Biographies

References

Aim, Scope, Indexing

Editorial Board