A Survey on User Behaviour Prediction Using Web Server Log
Keywords:
Prepossessing, Web Server Log, Association Rule Mining, Markov ModelAbstract
Web prediction is a classification problem in which we attempt to predict the next set of request that a user may make based on the knowledge of the previously visited pages. User behaviour prediction deals with collecting the web server logs, prepossess the data and analyze the pattern. The main goal of this process is to extract the useful pattern from raw data collection. Prepossessing contains cleaning the data, user identification and session identification which saves 80% of processing data. Web server log contains much useful information like server ip, time, date, error code, browser etc. there are various prediction models like n-gram session identification, association rule mining, markov model and support vector machine are available. Here comparison is done on the basis of accuracy, processing time and scalability.
References
- Bing, L., Web Data Mining, 2nd ed., Berlin: Springer-Verlag, 2011.
- Hao, C., Yu-bo, J., Cheng-wei, H., Zhi-qiang, H., “Improved method for session identification in web log mining,” Journal of Computer Engineering and Design, vol. 30, 2009, pp. 1321-1323.
- Zi-jun, C., Xin-yu, W., Wei, L., “Method of web log session reconstruction,” Journal of Computer Engineering, vol.33, 2007, pp.95-97.
- He Xinhua,Wang Qiong,” Dynamic Timeout-Based a Session Identification Algorithm” Electric Information and Control. Engineering (ICEICE), 2011 International Conference on ISBN 978-1-4244-8036-4,15-17 April 2011
- Mamoun A. Awad and Issa Khalil, “Prediction of User’s Web- Browsing Behavior: Application of Markov Model”, IEEE Transactions on Systems, Man, And Cybernetics—Part B: Cybernetics, vol. 42, no. 4, pp. 1131-1142, August 2012.
- Zhong Su*, Qiang Yang* and Hongjiang Zhang,” WhatNext: A Prediction System forWeb Requests using N-gram Sequence Models”, IEEE
Downloads
Published
Issue
Section
License
Copyright (c) IJSRSET

This work is licensed under a Creative Commons Attribution 4.0 International License.