Method for Mining High-utility Patterns in Transaction Stream Data based on Linked List Structure
Abstract
Mining valuable patterns in data streams presents
a significant challenge in the field of data mining. This
task is crucial as it allows for the identification of highly
profitable item sets within transaction databases. However, as
new transactions are continually added, new valuable patterns
emerge, thus changing the usefulness of previously analyzed
data. It is essential to promptly update information regarding
these changes to enable effective business decision-making.
Consequently, existing mining methods applied to transaction
flow datasets require considerable time to identify new
patterns and update information related to new transactions.
This article focuses on the research and proposal of a new
transaction stream data mining method called High-Utility
Stream Linked-List Mining. The method utilizes a linked
list structure known as the High-Utility Stream Linked List
(HUSLL) to store information about patterns in the database.
Mining and updating transaction information are directly
performed on the HUSLL structure. Experimental results
demonstrate that this novel mining method exhibits more
efficient execution times compared to previous solutions.
References
R. Agrawal, R. Srikant et al., “Fast algorithms for mining association rules,” in Proc. 20th int. conf. very large data bases, VLDB, vol. 1215. Santiago, Chile, 1994, pp. 487–499.
P. Fournier-Viger, J. Chun-Wei Lin, T. Truong-Chi, and R. Nkambou, “A survey of high-utility itemset mining,”
High-utility pattern mining: Theory, algorithms and applications, pp. 1–45, 2019.
S. Krishnamoorthy, “Hminer: Efficiently mining high-utility itemsets,” Expert Systems with Applications, vol. 90, pp. 168–183, 2017.
S. Zida, P. Fournier-Viger, J. C.-W. Lin, C.-W. Wu, and V. S. Tseng, “Efim: a fast and memory efficient algorithm
for high-utility itemset mining,” Knowledge and Information Systems, vol. 51, no. 2, pp. 595–625, 2017.
M. K. Sohrabi, “An efficient projection-based method for high utility itemset mining using a novel pruning approach on the utility matrix,” Knowledge and Information Systems, vol. 62, pp. 4141–4167, 2020.
P. Wu, X. Niu, P. Fournier-Viger, C. Huang, and B. Wang, “Ubp-miner: An efficient bit based high utility itemset
mining algorithm,” Knowledge-Based Systems, vol. 248, p. 108865, 2022.
J. C.-W. Lin, W. Gan, T.-P. Hong, B. Zhang et al., “An incremental high-utility mining algorithm with transaction
insertion,” The Scientific World Journal, vol. 2015, 2015.
P. Fournier-Viger, J. C.-W. Lin, T. Gueniche, and P. Barhate, “Efficient incremental high-utility itemset mining,” in Proceedings of the ASE BigData & SocialInformatics 2015, 2015, pp. 1–6.
U. Yun, H. Ryang, G. Lee, and H. Fujita, “An efficient algorithm for mining high-utility patterns from incremental databases with one database scan,” Knowledge-Based Systems, vol. 124, pp. 188–206, 2017.
H.-F. Li, H.-Y. Huang, Y.-C. Chen, Y.-J. Liu, and S.-Y. Lee, “Fast and memory efficient mining of high-utility itemsets in data streams,” in 2008 eighth IEEE international conference on data mining. IEEE, 2008, pp. 881–886.
C. F. Ahmed, S. K. Tanbeer, and B.-S. Jeong, “Efficient mining of high-utility patterns over data streams with a
sliding window method,” Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing 2010, pp. 99–113, 2010.
J. Han, J. Pei, and Y. Yin, “Mining frequent patterns without candidate generation,” ACM sigmod record, vol. 29, no. 2, pp. 1–12, 2000.
G. Pyun, U. Yun, and K. H. Ryu, “Efficient frequent pattern mining based on linear prefix tree,” Knowledge-Based Systems, vol. 55, pp. 125–139, 2014.
Y. Mao, B. Wu, Q. Deng, S. Mahmoodi, Z. Chen, and Y.-C. Chen, “A novel parallel frequent itemset mining algorithm for automatic enterprise,” Enterprise Information Systems, p. 2204317, 2023.
J. Lu, W. Xu, K. Zhou, and Z. Guo, “Frequent itemset mining algorithm based on linear table,” Journal of Database Management (JDM), vol. 34, no. 1, pp. 1–21, 2023.
P. Fournier-Viger, C.-W.Wu, S. Zida, and V. S. Tseng, “Fhm: Faster high-utility itemset mining using estimated utility cooccurrence pruning,” in Foundations of Intelligent Systems: 21st International Symposium, ISMIS 2014, Roskilde, Denmark, June 25-27, 2014. Proceedings 21. Springer, 2014, pp. 83–92.
J. Liu, K. Wang, and B. CM Fung, “Direct discovery of high-utility itemsets without candidate generation,” in 2012 IEEE 12th international conference on data mining, 2012, pp. 984–989.
Y. Liu, W.-k. Liao, and A. Choudhary, “A two-phase algorithm for fast discovery of high-utility itemsets,” in Advances in Knowledge Discovery and Data Mining: 9th Pacific-Asia Conference, PAKDD 2005, Hanoi, Vietnam, May 18-20, 2005. Proceedings 9. Springer, 2005, pp. 689–695.
J.-S. Yeh, C.-Y. Chang, and Y.-T. Wang, “Efficient algorithms for incremental utility mining,” in Proceedings of
the 2nd international conference on Ubiquitous information management and communication, 2008, pp. 212–217.
C. F. Ahmed, S. K. Tanbeer, B.-S. Jeong, and Y.-K. Lee, “Efficient tree structures for high-utility pattern mining in incremental databases,” IEEE Transactions on Knowledge and Data Engineering, vol. 21, no. 12, pp. 1708–1721, 2009.
H.-T. Zheng and Z. Li, “ichum: an efficient algorithm for high-utility mining in incremental databases,” in Knowledge Science, Engineering and Management: 8th International Conference, KSEM 2015, Chongqing, China, October 28-30, 2015, Proceedings 8. Springer, 2015, pp. 212–223.
U. Yun and H. Ryang, “Incremental high-utility pattern mining with static and dynamic databases,” Applied intelligence, vol. 42, pp. 323–352, 2015.
M. Zihayat, Y. Chen, and A. An, “Memory-adaptive high utility sequential pattern mining over data streams,” Machine Learning, vol. 106, pp. 799–836, 2017.
D. Kim and U. Yun, “Mining high utility itemsets based on the time decaying model,” Intelligent Data Analysis, vol. 20, no. 5, pp. 1157–1180, 2016.
U. Yun, D. Kim, E. Yoon, and H. Fujita, “Damped window based high average utility pattern mining over data streams,” Knowledge-Based Systems, vol. 144, pp. 188–205, 2018.
H. Ryang and U. Yun, “High utility pattern mining over data streams with sliding window technique,” Expert Systems with Applications, vol. 57, pp. 214–231, 2016.
P. A. Reddy and M. K. Prasad, “Sliding window-based high utility item-sets mining over data stream using extended global utility item-sets tree,” International Journal of Software Innovation (IJSI), vol. 10, no. 1, pp. 1–16, 2022.
H. Yao and H. J. Hamilton, “Mining itemset utilities from transaction databases,” Data & Knowledge Engineering, vol. 59, no. 3, pp. 603–626, 2006.
M. Li, M. Han, Z. Chen, H. Wu, and X. Zhang, “Fchmstream: fast closed high utility itemsets mining over data
streams,” Knowledge and Information Systems, vol. 65, no. 6, pp. 2509–2539, 2023.