MACHINE LEARNING WITH GRAPHS
Abstract
In recent years, graph-based machine learning methods have attracted great attention because of their effectiveness and efficiency. Inspired by this trend, this thesis summarizes my research topics on machine learning techniques for the purpose of handling various kinds of problems on large graph data.
Generally, this thesis contains two parts. The first part is devoted to graph embedding, which aims to encode graph structure into dense vectors (or embeddings). In particular, we will consider a low rank-matrix factorization based approach to learn embeddings of attributed graphs. By jointly preserving graph structure and attribute-level similarity, our approach can generate embeddings, whose quality is higher than that of embeddings generated by state-of-the-art methods.
The second part of the thesis is devoted to graph-based semi-supervised learning, which attempts to predict labels for unlabeled nodes given a small set of labeled nodes and a large set of unlabeled nodes. In this part, we consider two different approaches: graph-regularization based semi-supervised learning and graph convolutional network, which deal with non-attributed and attributed graphs respectively. For graph-regularization based semi-supervised learning, we develop a simple approach for imbalanced classification, which can not only learn a smooth label function on the graph but also take into account the class imbalance of datasets. For graph convolutional network, we first introduce an attention mechanism induced by sub-maximal entropy random walks. Given this, we propose an attention-based graph convolutional network, which can jointly learn node attributes and graph structures at multiply scales. Both approaches can achieve promising performance on several benchmark datasets.
Related items
Showing items related by title, author, creator and subject.
-
PROCESSING QUERIES OVER PARTITIONED GRAPH DATABASES: AN APPROACH AND IT'S EVALUATION
Bodra, Jay Dilipbhai D.; 0000-0003-0723-0709 (2016-05-12)Representation of structured data using graphs is meaningful for applications such as road and social networks. With the increase in the size of graph databases, querying them to retrieve desired information poses challenges ... -
Semi-supervised Learning using Triple-Siamese Network
Banerjee, Debapriya; 0000-0001-6666-5863 (2020-06-04)Missing data problem is inevitable in mostly all research areas including Artificial Intelligence, Machine Learning and Computer Vision where we have modicum knowledge about the complete dataset. One of the key reasons of ... -
Graph Representation Learning for Heterogeneous Multimodal Biomedical Data
Tran, Nhat Chau; 0000-0002-2575-9633 (2022-12-20)The emergence of high-throughput sequencing technology has generated a wealth of “multi-omics” data, capturing information about different types of biomolecules at multiple levels. Since large-scale genomics, transcriptomics, ...