Efficiency and Effectiveness in Utility-Based and Frequent Itemset Mining: A Comprehensive Review

Authors

  • Nishigandha Mhatre  Department of Computer Engineering, Pillai HOC College of Engineering and Technology, Rasayani, Maharashtra, India
  • Srijita Bhattacharjee  Department of Computer Engineering, Pillai HOC College of Engineering and Technology, Rasayani, Maharashtra, India

Keywords:

Frequent Mining, Utility Mining, Performance Parameters, Itemset Mining.

Abstract

The rapid growth of data in various domains has led to the need for efficient pattern mining algorithms that can handle large-scale datasets. In this study, the comprehensive review in the domains of utility-based and frequent itemset mining (FIM), focusing on the dual facets of efficiency and effectiveness. In the ever-expanding landscape of data mining and knowledge discovery, the extraction of valuable patterns and insights from large datasets is paramount. Utility-based mining addresses the challenge of incorporating user-defined measures of importance, reflecting real-world applications where not all items are equal. Simultaneously, FIM seeks to identify recurring patterns within datasets, providing valuable associations and dependencies. This review synthesizes recent advancements and methodologies in both utility-based and FIM, analyzing their respective strengths and limitations. Efficiency considerations encompass algorithmic optimizations, parallel computing, and scalability, ensuring that the mining process is computationally tractable for large-scale datasets. Effectiveness evaluations delve into the quality of discovered patterns, emphasizing their relevance and utility in diverse applications. The synthesis of these two mining paradigms underscores the importance of striking a balance between computational efficiency and the ability to extract meaningful patterns. By examining state-of-the-art techniques and methodologies, this review aims to provide researchers, practitioners, and decision-makers with a comprehensive understanding of the current landscape in utility-based and FIM, offering insights into future directions for advancing the efficiency and effectiveness of pattern discovery in diverse domains.

References

  1. Yang Yang, Jiaman Ding, Honghai Wang, Huifen Xing, En Li, "A High Utility Itemset Mining Algorithm Based on Particle Filter", Mathematical Problems in Engineering, vol. 2023, Article ID 7941673, 15 pages, 2023. https://doi.org/10.1155/2023/7941673
  2. Liu, Y.; Wang, L.; Feng, L.; Jin, B. Mining High Utility Itemsets Based on Pattern Growth without Candidate Generation. Mathematics 2021, 9, 35. https://doi.org/10.3390/math9010035
  3. Peng Wu, Xinzheng Niu, Philippe Fournier-Viger, Cheng Huang, and Bing Wang. 2022. UBP-Miner: An efficient bit based high utility itemset mining algorithm. Know. -Based Syst. 248, C (Jul 2022). https://doi.org/10.1016/j.knosys.2022.108865
  4. Mai T, Nguyen LTT, Vo B, Yun U, Hong TP. Efficient Algorithm for Mining Non-Redundant High-Utility Association Rules. Sensors (Basel). 2020 Feb 17;20(4):1078. doi: 10.3390/s20041078. PMID: 32079200; PMCID: PMC7070778.
  5. Rashad Saeed, Azhar Rauf, Fahmi H. Quradaa, Syed Muhammad Asim, "Efficient Utility Tree-Based Algorithm to Mine High Utility Patterns Having Strong Correlation", Complexity, vol. 2021, Article ID 7310137, 18 pages, 2021. https://doi.org/10.1155/2021/7310137
  6. Dam, TL., Li, K., Fournier-Viger, P. et al. CLS-Miner: efficient and effective closed high-utility itemset mining. Front. Comput. Sci. 13, 357–381 (2019). https://doi.org/10.1007/s11704-016-6245-4
  7. Jimmy Ming-Tai Wu, Jerry Chun-Wei Lin, and Ashish Tamrakar. 2019. High-Utility Itemset Mining with Effective Pruning Strategies. ACM Trans. Knowl. Discov. Data 13, 6, Article 58 (December 2019), 22 pages. https://doi.org/10.1145/3363571
  8. Wu, J. M.-T., Srivastava, G., Wei, M., Yun, U., & Lin, J. C.-W. (2021). Fuzzy high-utility pattern mining in parallel and distributed Hadoop framework. Information Sciences, 553, 31-48. 10.1016/j.ins.2020.12.004
  9. Le Wang, Shui Wang, Haiyan Li, Chunliang Zhou, "Improved Strategy for High-Utility Pattern Mining Algorithm", Mathematical Problems in Engineering, vol. 2020, Article ID 1971805, 11 pages, 2020. https://doi.org/10.1155/2020/1971805
  10. D. Kim et al., "Medical Big Data Analysis System to Discover Associations between Genetic Variants and Diseases," ICC 2021 - IEEE International Conference on Communications, Montreal, QC, Canada, 2021, pp. 1-6, doi: 10.1109/ICC42927.2021.9500497.
  11. Sethi, Krishan & Ramesh, Dharavath & Edla, Damodar. (2018). P-FHM+: Parallel high utility itemset mining algorithm for big data processing. Procedia Computer Science. 132. 918-927. 10.1016/j.procs.2018.05.107.
  12. Patel, S. B., Shah, S. M., & Patel, M. N. (2022). An Efficient High Utility Itemset Mining Approach using Predicted Utility Co-exist Pruning. International Journal of Intelligent Systems and Applications in Engineering, 10(4), 224–230
  13. Qu, Jun-Feng & Liu, Mengchi & Fournier Viger, Philippe. (2019). Efficient Algorithms for High Utility Itemset Mining Without Candidate Generation. 10.1007/978-3-030-04921-8_5.
  14. Mai, Thang & Nguyen, Loan. (2017). An efficient approach for mining closed high utility itemsets and generators. Journal of Information and Telecommunication. 1. 193-207. 10.1080/24751839.2017.1347392.
  15. Dawar, Siddharth et al. “A hybrid framework for mining high-utility itemsets in a sparse transaction database.” Applied Intelligence 47 (2017): 809-827.
  16. Mathe John Kenny Kumar and Dipti Rana. 2023. RSPHUIM: Recent Short Period High Utility Itemset Mining. SN Comput. Sci. 4, 5 (Sep 2023). https://doi.org/10.1007/s42979-023-01967-y
  17. Chen, Jiahui & Shicheng, Wan & Gan, Wensheng & Chen, Guoting & Fujita, Hamido. (2021). TOPIC: Top-k High-Utility Itemset Discovering.
  18. Han, Xixian & Liu, Xianmin & Li, Jianzhong & Gao, Hong. (2020). Efficient Top-k High Utility Itemset Mining on Massive Data. Information Sciences. 557. 10.1016/j.ins.2020.08.028.
  19. Djenouri, Y.; Comuzzi, M. Combining Apriori heuristic and bio-inspired algorithms for solving the frequent itemsets mining problem. Inf. Sci. 2017, 420, 1–15
  20. Nguyen, L.T.; Nguyen, P.; Nguyen, T.D.; Vo, B.; Fournier-Viger, P.; Tseng, V.S. Mining high-utility itemsets in dynamic profit databases. Knowl.-Based Syst. 2019, 175, 130–144
  21. Aggarwal CC (2014) An introduction to Frequent Pattern Mining. In: Aggarwal CC, Han J (eds) Frequent Pattern Mining. Springer, Basel, pp 1–14
  22. Jamsheela O, Raju G (2015) Frequent itemset mining algorithms: a literature survey. In: Paper presented at the 2015 IEEE international advance computing conference (IACC), Banglore.

Downloads

Published

2024-02-07

Issue

Section

Research Articles

How to Cite

[1]
Nishigandha Mhatre, Srijita Bhattacharjee "Efficiency and Effectiveness in Utility-Based and Frequent Itemset Mining: A Comprehensive Review" International Journal of Scientific Research in Science, Engineering and Technology (IJSRSET), Print ISSN : 2395-1990, Online ISSN : 2394-4099, Volume 11, Issue 1, pp.179-192, January-February-2024.