Weblog and retail industries analysis using a robust modified Apriori algorithm
No Thumbnail Available
Date
2019
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
With the rapid development of the usage of the Internet. There is a lot of information on the Internet. The Weblogs and retail industries accumulate a large amount of transactions data. It is important that this data has to be properly used to find the hidden patterns and mine the association rules among these data. First part is the data pre-processing phase which is most important but makes the data of good quality and the next is the pattern discovery phase for that we use the Apriori algorithm. The Apriori algorithm is used to find association rules and commodities and could help in promoting sales and user interaction. In this paper, we propose a modified algorithm by reducing the dimensions of the items in the transaction set database Dk and Candidate set Ck. This also provides proof of the proposed optimization techniques. This paper introduces the hashing technique for finding the frequencies of items in the one step with O (1) time complexity. The buffer is introduced to keep track of all the unwanted sets which helps in increasing the memory efficiently and computation. This paper gives the application with an example and implementation with proposed optimization techniques. � BEIESP.
Description
Keywords
Apriori Algorithm, Association Rules, Candidate sets, Data Cleaning, Frequent Itemsets, Hashing, Retailer Stores, Web Log Data