WEB DATA USING MINING TECHNIQUES WITH THE HELP OF OPEN SOURCE SOFTWARE PACKAGES
Keywords:
Web Usage Mining, Rough Set Model, Conditional AttributesAbstract
Predicting the next page to be accessed by the Web users has attracted a large amount of research. In this paper, a new web usage mining approach is proposed to predict next page access. It is proposed to identify similar access patterns from web log using K-mean clustering and then Markov model is used for prediction for next page accesses. The tightness of clusters is improved by setting similarity threshold while forming clusters. In traditional recommendation models, clustering by non-sequential data decreases recommendation accuracy. In this paper involve incorporating clustering with low order markov model which can improve the prediction accuracy. The main area of research in this paper is preprocessing and identification of useful patterns from web data using mining techniques with the help of open source software packages.
References
Pasi Franti,Olli Virmajoki, and Ville Hautamaki “Fast Agglomerative Clustering Using a k Nearest Neighbor graph”,IEEE transaction on pattern analysis and machine intelligence.Vol 28,No 11. November 2006, pp 1875-1880
Pasi Franti, Timo Kaukoranta,Day-Fann Shen and Kuo-Shu Chang “Fast and Memory Efficient Implementation of exact PNN”,IEEE Transaction on image processing, Vol 9,No 5,May 2000.pp 773-777
Mathias G´ ery, Hatem Haddad,” Evaluation of Web Usage Mining approaches for user’s next request prediction” WIDM ’03 Boston, USA, ACM
Siripom chimphlee, Naomie Salim, Mohd Salihin Bin Ngadiman, Witcha, Surat ,”Rough Sets Clustering and Markov Model for Web Access Prediction” ,Proceedings of post graduate annual seminar 2006, pp. 470-474
Devanshu Dhyani, Sourav S Bhowmick, Wee-Keong Ng,” Modelling and predicting web page accesses using Markov Processes”, IEEE,Computer Society, 2003,1529-4188
Faten Khalil, Jiuyong Li,Hua Wang, “Integerating Recommendation Models for Improved Web Page Prediction Accuracy”, Australian Computer Society, 2008,Conferences in Research and Practice in Information Technology, Vol 74.
Bamshad Mobasher, Honghua Dai, Tao Luo, Miki Nakagawa, ”Effective Personalizaion based on association rule discovery from Web Usage Data”, ACM workshop on Web Information and Data management, Nov 2001.
Mukund Deshpande and George Karypis, “Selective markov model for predicting web-page accesses”, Army High performance Computing Research Center, pp.1-15
Faten Khalil, Jiuyong Li,Hua Wang, ” Integrating Markov model with clustering for predicting web page accesses”, Australian Conference, Mar 2007 ,pp 1-26
Jose Miguel Gago, Carlos Juiz “Web Mining Service (WMS), a public and free Servie for Web Data Mining”, IEEE Fourth international Conference on Internet and Web Applications and Services, pp. 351-356, 2009.
Li Lan, Rong Qiao-mei “Research of Web Mining Technology Based on XML”, IEEE International Conference on Network Security, Wireless Communications and Trusted Computing, Vol.2, pp. 653-656, 2009.
Amazon. 2006. Amazon Web Services Website. http://www.amazon.com/webservice
Additional Files
Published
How to Cite
Issue
Section
License
Copyright (c) 2018 International Educational Journal of Science and Engineering

This work is licensed under a Creative Commons Attribution 4.0 International License.