ATTENTION: The works hosted here are being migrated to a new repository that will consolidate resources, improve discoverability, and better show UTA's research impact on the global community. We will update authors as the migration progresses. Please see MavMatrix for more information.
Show simple item record
dc.contributor.advisor | Fegaras, Leonidas | |
dc.creator | Ulde, Ahmed Abdul Hameed A | |
dc.date.accessioned | 2016-09-28T17:33:30Z | |
dc.date.available | 2016-09-28T17:33:30Z | |
dc.date.created | 2016-05 | |
dc.date.issued | 2016-05-12 | |
dc.date.submitted | May 2016 | |
dc.identifier.uri | http://hdl.handle.net/10106/25879 | |
dc.description.abstract | Non-Negative matrix factorization is well-known complex machine learning algorithm which is also used in collaborative filtering. Collaborative filtering technique is used in recommendation systems and these techniques aim at predicting the missing values in user-item association matrix. User-item association matrix contains
number of users as rows and number of movies as columns and the values are the ratings given by user to respective movies. These matrices have large dimensions, missing values and needs parallel processing. Map reduce query language (MRQL) is a query processing and optimization system for large-scale, distributed data analysis, built on top of Apache hadoop, spark, hama and flink. Large scale matrix operations require proper scaling and optimization in distributed systems. Therefore, In this work we are analyzing the performance of MRQL on complex matrix operations by using different sparse matrix datasets in spark mode. This work aims at performance analysis of Map Redce Query Language on complex matrix operations and ease of scalability of these operations. We have performed simple matrix operation like multiplication, division, addition, subtraction and also complex operation like factorization. Gaussian non negative matrix factorization and stochiastic gradient descent based matrix factorization are the two algorithms which are tested in spark and flink modes of MRQL with dataset of movie ratings. The performance analysis in the experiments will help readers to understand and analyze the performance of MRQL and also understand more about MRQL. | |
dc.format.mimetype | application/pdf | |
dc.language.iso | en_US | |
dc.subject | Matrix factorization | |
dc.subject | Map reduce query language | |
dc.subject | MRQL | |
dc.title | PERFORMANCE EVALUATION OF MATRIX OPERATIONS ON MAP-REDUCE QUERY LANGUAGE | |
dc.type | Thesis | |
dc.degree.department | Computer Science and Engineering | |
dc.degree.name | Master of Science in Computer Science | |
dc.date.updated | 2016-09-28T17:35:37Z | |
thesis.degree.department | Computer Science and Engineering | |
thesis.degree.grantor | The University of Texas at Arlington | |
thesis.degree.level | Masters | |
thesis.degree.name | Master of Science in Computer Science | |
dc.type.material | text | |
Files in this item
- Name:
- ULDE-THESIS-2016.pdf
- Size:
- 311.6Kb
- Format:
- PDF
This item appears in the following Collection(s)
Show simple item record