Show simple item record

dc.contributor.advisorFegaras, Leonidas
dc.creatorNeupane, Gokarna
dc.date.accessioned2023-09-11T14:51:18Z
dc.date.available2023-09-11T14:51:18Z
dc.date.created2017-05
dc.date.submittedMay 2017
dc.identifier.urihttp://hdl.handle.net/10106/31669
dc.description.abstractWith explosive growth of data in past few years, discovering previously unknown, frequent patterns within the huge transactional data sets has been one of the most challenging and ventured fields in data mining. Apriori algorithm is widely used and one of the most researched field for frequent pattern mining. The exponential increase in the size of the input data has adverse effect on the efficiency of the traditional or centralized implementation of this algorithm. Thus, various distributed Frequent Itemset Mining(FIM) algorithms have been developed. MapReduce is a programming framework that allows the processing of large datasets with a distributed algorithm over a distributed cluster. During this research, We have implemented a parallel Apriori algorithm in Hadoop MapReduce framework with large volumes of input data and generate frequent patterns based on user defined parameters. We have implemented hash tree data structure to represent the candidate itemsets which aids in faster search for those candidates within a transaction. These experiments were conducted in real-life datasets and varying parameters. Based on various evaluations, the proposed algorithm turns out to be scalable and efficient method to generate frequent item-sets from a large dataset over a distributed network.
dc.format.mimetypeapplication/pdf
dc.language.isoen_US
dc.subjectmapreduce
dc.subjectapriori
dc.subjectparallel apriori
dc.titleA PARALLEL IMPLEMENTATION OF APRIORI ALGORITHM FOR MINING FREQUENT ITEMSETS IN HADOOP MAPREDUCE FRAMEWORK
dc.typeThesis
dc.date.updated2023-09-11T14:51:19Z
thesis.degree.departmentComputer Science and Engineering
thesis.degree.grantorThe University of Texas at Arlington
thesis.degree.levelMasters
thesis.degree.nameMaster of Science in Computer Science
dc.type.materialtext
dc.creator.orcid0000-0003-1256-0454


Files in this item

Thumbnail


This item appears in the following Collection(s)

Show simple item record