Show simple item record

dc.contributor.authorRajendran, Srividhyaen_US
dc.date.accessioned2009-09-16T18:20:47Z
dc.date.available2009-09-16T18:20:47Z
dc.date.issued2009-09-16T18:20:47Z
dc.date.submittedJanuary 2009en_US
dc.identifier.otherDISS-10179en_US
dc.identifier.urihttp://hdl.handle.net/10106/1905
dc.description.abstractA life long learning agent performing in a complex and dynamic environment needs the ability to learn increasingly complex tasks over time. These agents over their lifetime have to learn new tasks, adapt the policies of already learned tasks and extract and reuse the knowledge gained to learn new, more complex tasks. To do this, they need methods that allow them to autonomously extract knowledge from the already learned policy instances and reuse the knowledge gained to learn related tasks in novel environments.This dissertation presents a novel approach that enables an agent to autonomously abstract reusable skills and concepts using policy instances of a similar task type and use the resulting abstractions to learn related tasks in novel situations. To achieve this, this work formalizes a novel idea of policy homomorphism that allows autonomous extraction of general policies for <italic> task types </italic>. Each extracted general policy is here an abstract policy that is homomorphic to the set of specific policy instances of the corresponding task type that it is derived from and is made up of abstract states that identify situations in which the given policy is applicable and abstract actions that identify actions that need to be performed in those situations. Once extracted, the generalized policies are reused in new contexts to address related tasks by adding them as higher level actions that the agent can choose to perform. To facilitate the autonomous abstraction of a policy of a given <italic> task type </italic> from a set of policies, the agent has to identify and categorize policies for various tasks into different <italic> task types </italic>. To achieve this the policy generalization approach presented here employs a utility-based criterion that enables the agent to autonomously categorize and generalize a set of situation-specific policies of different <italic> task types </italic> into a set of general policies containing one general policy for each identified <italic> task type </italic> using the policy homomorphism framework. To demonstrate the working of this policy generalization method we show the abstraction of a general policy for a specific <italic> task type </italic> using two sets of policies of different <italic> task types </italic> in a grid world domain and further show how the abstracted general policies can be used to learn related tasks in novel grid world environments. Further, to demonstrate the working of the utility based criterion to identify <italic> task types </italic> and autonomously abstract general policies for the identified <italic> task types </italic> we show the abstraction of general polices using the utility criterion from a set of situation-specific policies of different <italic> task types </italic> in a grid world domain.en_US
dc.description.sponsorshipHuber, Manfreden_US
dc.language.isoENen_US
dc.publisherComputer Science & Engineeringen_US
dc.titleAutonomous Abstraction Of Policies Based On Policy Homomorphismen_US
dc.typePh.D.en_US
dc.contributor.committeeChairHuber, Manfreden_US
dc.degree.departmentComputer Science & Engineeringen_US
dc.degree.disciplineComputer Science & Engineeringen_US
dc.degree.grantorUniversity of Texas at Arlingtonen_US
dc.degree.leveldoctoralen_US
dc.degree.namePh.D.en_US
dc.identifier.externalLinkhttp://www.uta.edu/ra/real/editprofile.php?onlyview=1&pid=190
dc.identifier.externalLinkDescriptionLink to Research Profiles


Files in this item

Thumbnail


This item appears in the following Collection(s)

Show simple item record