Mining Association Rules on Time-sensitive Data

  • Dang-Ngo Huu
  • Nghia Le
  • Khac-Chien Nguyen CNTT
Keywords: Association rules, rare association rules, rare patterns, common patterns, damped window model

Abstract

Real-world data is continuously produced and
collected quickly; hence, mining association rules are used to
find relationships between item sets. One of these combined
rule mining techniques is Rare Association Rule Mining
(RARM), a method that extracts rare, combined rules with low
support but high confidence from the database. Additionally,
a large amount of time-sensitive data is created in many fields,
and combining valid and outdated data leads to low efficiency
in extracting combined rules. This paper explores a mining
method that increases rare, combined rules on time-sensitive
data to extract common and rare patterns in the database.
The primary approach of the research proposes a new tree
structure called ILC-tree, inspired by the LC-tree structure
published in 2022. Specifically, our approach includes a new
tree structure and a fast reconstruction method. The proposed
approach reduces execution time and memory consumption
by approximately 2 times compared to the old tree structure.

Author Biographies

Dang-Ngo Huu

Dang-Ngo Huu received the B.E. degree in Data Science from Faculty of Information Technology, University
of Science in 2023. Research direction: Data Science, Privacy-preserving data mining.

Nghia Le

Nghia Le received the B.E. degree from Faculty of Electronics and Telecommunications, Ho Chi Minh City University of Technology in 1995. In 1998, he received Master degree in Computer Science, Ho Chi Minh City University of Science. His research interests include: Artificial Intelligence in Embedded Systems and Data Security.

Khac-Chien Nguyen, CNTT

Khac-Chien Nguyen received his Bachelor’s degree in Information Technology from the People’s Security Academy in 2003 and received the Master degree in Computer Science from the University of Natural Sciences- Vietnam National University, Ho Chi Minh City in 2009, and his PhD in Communication Engineering from the Post and Telecommunication Institute of Technology Hanoi in 2019. His research interests include: Cloud
computing and Data mining.

References

Hipp, Jochen, Ulrich G¨untzer, and Gholamreza Nakhaeizadeh, ”Algorithms for association rule mining - a general survey and comparison,” ACM SIGKDD Explorations Newsletter, vol. 2, no. 1, 2000, pp. 58–64.

Abbasi, Ameer Ahmed, and Mohamed Younis. ”A survey on clustering algorithms for wireless sensor networks,” Computer Communications, vol. 30, no. 14-15, 2007, pp. 2826–2841.

Hu, Kerui, Lemiao Qiu, Shuyou Zhang, Zili Wang, and Naiyu Fang, ”An incremental rare association rule mining approach with a life cycle tree structure considering timesensitive data,” Applied Intelligence, vol. 53, no. 9, 2023, pp. 10800–10824.

Srikant, Ramakrishnan, and Rakesh Agrawal, ”Mining generalized association rules,” Future Generation Computer Systems, vol. 13, no. 2-3, 1997, pp. 161–180.

Haglin, David J., and Anna M. Manning, ”On minimal infrequent itemset mining,” In Proceedings of DMIN, 2007, pp. 141–147.

Yun, Hyunyoon, Danshim Ha, Buhyun Hwang, and Keun Ho Ryu, ”Mining association rules on significant rare data using relative support,” Journal of Systems and Software, vol. 67, no. 3, 2003, pp. 181–191.

Koh, Yun Sing, and Nathan Rountree, ”Finding sporadic rules using apriori-inverse,” In Proceedings of the 9th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), Springer, 2005, pp. 97–106.

Szathmary, Laszlo, Amedeo Napoli, and Petko Valtchev, ”Towards rare itemset mining,” In Proceedings of the 19th IEEE International Conference on Tools with Artificial Intelligence (ICTAI), vol. 1, 2007, pp. 305–312.

Han, Jiawei, Jian Pei, and Yiwen Yin, ”Mining frequent patterns without candidate generation,” ACM SIGMOD Record, vol. 29, no. 2, 2000, pp. 1–12.

Koh, Jia-Ling, and Shui-Feng Shieh, ”An efficient approach for maintaining association rules based on adjusting FP-tree structures,” In Proceedings of the International Conference on Database Systems for Advanced Applications, Springer, 2004, pp. 417–424.

Hong, Tzung-Pei, Chun-Wei Lin, and Yu-Lung Wu, ”Incrementally fast updated frequent pattern trees,” Expert Systems with Applications, vol. 34, no. 4, 2008, pp. 2424–2435.

Adnan, Muhaimenul, Reda Alhajj, and Ken Barker, ”Alternative method for incrementally constructing the FP-tree,” In Proceedings of the 3rd International IEEE Conference on Intelligent Systems, 2006, pp. 494–499.

Borah, Anindita, and Bhabesh Nath, ”Incremental rare pattern based approach for identifying outliers in medical data,” Applied Soft Computing, vol. 85, 2019, pp. 105824.

Ahmed, Shafiul Alom, and Bhabesh Nath, ”ISSP-tree: An improved fast algorithm for constructing a complete prefix tree using single database scan,” Expert Systems with Applications, vol. 185, 2021, pp. 115603.

Mahdi, Mahmoud A., Khalid M. Hosny, and Ibrahim Elhenawy, ”FR-Tree: A novel rare association rule for big data problem,” Expert Systems with Applications, vol. 187, 2022, pp. 115898.

Yun, Unil, Donggyu Kim, Eunchul Yoon, and Hamido Fujita, ”Damped window based high average utility pattern mining over data streams,” Knowledge-Based Systems, vol. 144, 2018, pp. 188–205.

Published
2024-09-15