ATTENTION: The works hosted here are being migrated to a new repository that will consolidate resources, improve discoverability, and better show UTA's research impact on the global community. We will update authors as the migration progresses. Please see MavMatrix for more information.
Show simple item record
dc.contributor.author | Kong, Deguang | en_US |
dc.date.accessioned | 2014-03-12T23:49:04Z | |
dc.date.available | 2014-03-12T23:49:04Z | |
dc.date.issued | 2014-03-12 | |
dc.date.submitted | January 2013 | en_US |
dc.identifier.other | DISS-12440 | en_US |
dc.identifier.uri | http://hdl.handle.net/10106/24078 | |
dc.description.abstract | Nowadays, in order to sense environment and understand human behaviors, data analysis plays a more and more important role to handle heterogeneous data ranging from different domains, e.g., image categorization/annotation, customer segmentation, traffic prediction, ad optimization, recommendation systems, privacy analysis, etc. The large amount of multivariate data raises the fundamental problem of data mining: how to discover meaningful compact patterns hidden in the high-dimensional noisy observations? One approach is to do dimension reduction, which finds the low-dimensional subspace and thus encodes data in a low-dimensional structure. The other approach is to do feature selection or feature engineering, which manipulates the features to capture the most discriminant patterns for classification/clustering tasks.In this thesis, to further improve the low-dimensional embedding results, an iteratively locally linear embedding algorithm is proposed, which captures the global structure of non-linear manifold through iteratively updating the embedding. To handle noisy data (e.g., data with missing values, corrupted values) classification problem, a robust data recovery model via Schatten-p norm is proposed to preprocessing the noisy data, where the rank of the data is implicitly decreased. To utilize the feature structure with constraints, an efficient feature learning algorithm via group lasso is proposed to handle features on arbitrary structure, whose convergence can be rigorously proved. To handle the problem of limited labeled data in image categorization/annotation tasks, efficient maximum consistency label propagation methods are proposed to improve the performance of graph-based semi-supervised learning methods, which utilizes both the labeled data information and graph manifold information. Extensive experiments indicate the good performance of proposed algorithms. | en_US |
dc.description.sponsorship | Ding, Chris | en_US |
dc.language.iso | en | en_US |
dc.publisher | Computer Science & Engineering | en_US |
dc.title | Image Annotation And Feature Engineering Via Structural Sparsity And Low Rank Approximation | en_US |
dc.type | Ph.D. | en_US |
dc.contributor.committeeChair | Ding, Chris | en_US |
dc.degree.department | Computer Science & Engineering | en_US |
dc.degree.discipline | Computer Science & Engineering | en_US |
dc.degree.grantor | University of Texas at Arlington | en_US |
dc.degree.level | doctoral | en_US |
dc.degree.name | Ph.D. | en_US |
Files in this item
- Name:
- Kong_uta_2502D_12440.pdf
- Size:
- 1.118Mb
- Format:
- PDF
This item appears in the following Collection(s)
Show simple item record