ATTENTION: The works hosted here are being migrated to a new repository that will consolidate resources, improve discoverability, and better show UTA's research impact on the global community. We will update authors as the migration progresses. Please see MavMatrix for more information.
Show simple item record
dc.contributor.advisor | Makedon, Fillia | |
dc.creator | Theofanidis, Michail | |
dc.date.accessioned | 2020-01-10T23:00:23Z | |
dc.date.available | 2020-01-10T23:00:23Z | |
dc.date.created | 2019-12 | |
dc.date.issued | 2019-12-06 | |
dc.date.submitted | December 2019 | |
dc.identifier.uri | http://hdl.handle.net/10106/28871 | |
dc.description.abstract | The coexistence of humans and robots has been the aspiration of many scientific endeavors in the past century. Most anthropomorphic or industrial robots are highly articulated and complex machines, which are designed to carry out tasks that often involve the manipulation of physical objects. Traditionally, robots learn how to perform such tasks with the aid of a human programmer or operator. In this regard, the human acts as a teacher who provides a demonstration of a task. From the data of the demonstration, the robot must learn a state-action mapping that accomplishes the task. This state-action mapping is often addressed in the literature as a policy[1]. A common strategy for the acquisition of robot motor policies for a task is often achieved with Learning from Demonstration (LfD) algorithms [1]. Initial attempts to create LfD methods relied purely on supervised learning algorithms [2], while most modern paradigms rely on Reinforcement Learning (RL). This phenomenon indicates a shift from supervised learning to goal-oriented algorithms [3]. The development of the Dynamic Movement Primitive (DMP) framework [4] was an essential contribution to this trend, as it provides an abstraction layer between the dimensions of state, action, and environment by computing a policy with distinct meta-parameters that affect the behavior of the robot [5]. One of the advantages of the DMP framework is its ability to learn motor policies by transforming motion trajectories (high-dimensional space) to specific motion features (low-dimensional latent space) via regression. The DMP framework learns policies that lie in the trajectory level. However, humans and other animals are capable of learning new behaviors simply by observation. Robots need to achieve the same performance even if there is a substantial domain shift in the environment, embodiment, and perspective between the robot and the teacher [6]. Most modern large deep neural network models can enable complex motor skill representation across different embodiments. As such, we propose a method to learn end-to-end visuomotor policies for robotic arms from demonstrations. The method computes state-action mappings in a supervised learning manner from raw images and motor commands. At the core of the system, a Convolutional Neural Network (CNN) extracts image features and produces motion features. The motion features encode and reproduce motor commands according to the Dynamic Movement Primitives (DMP) framework. | |
dc.format.mimetype | application/pdf | |
dc.language.iso | en_US | |
dc.subject | Robotic manipulators | |
dc.subject | Computer vision | |
dc.subject | Machine learning | |
dc.subject | Reinforcement learning | |
dc.subject | Dynamic movement primitives | |
dc.title | LEARNING ROBOT MANIPULATION TASKS VIA OBSERVATION | |
dc.type | Thesis | |
dc.degree.department | Computer Science and Engineering | |
dc.degree.name | Doctor of Philosophy in Computer Science | |
dc.date.updated | 2020-01-10T23:02:33Z | |
thesis.degree.department | Computer Science and Engineering | |
thesis.degree.grantor | The University of Texas at Arlington | |
thesis.degree.level | Doctoral | |
thesis.degree.name | Doctor of Philosophy in Computer Science | |
dc.type.material | text | |
Files in this item
- Name:
- THEOFANIDIS-DISSERTATION-2019.pdf
- Size:
- 30.27Mb
- Format:
- PDF
This item appears in the following Collection(s)
Show simple item record