Discrete-time Control Algorithms And Adaptive Intelligent Systems Designs

Al-Tamimi, Asma Azmi

dc.contributor.author	Al-Tamimi, Asma Azmi	en_US
dc.date.accessioned	2007-10-08T23:55:04Z
dc.date.available	2007-10-08T23:55:04Z
dc.date.issued	2007-10-08T23:55:04Z
dc.date.submitted	May 2007	en_US
dc.identifier.other	DISS-1694	en_US
dc.identifier.uri	http://hdl.handle.net/10106/641
dc.description.abstract	In this work, approximate dynamic programming (ADP) designs based on adaptive critic structures are developed to solve the discrete-time optimal control problems in which the state and action spaces are continuous. This work considers linear discrete-time systems as well as nonlinear discrete-time systems that are affine in the input. This research resulted in forward-in-time reinforcement learning algorithms that converge to the solution of the Generalized Algebraic Riccati Equation (GARE) for linear systems. For the nonlinear case, a forward-in-time reinforcement learning algorithm is presented that converges to the solution of the associated Hamilton-Jacobi Bellman equation (HJB). The results in the linear case can be thought of as a way to solve the GARE of the well-known discrete-time optimal control problem forward in time. Four design algorithms are developed: Heuristic Dynamic programming (HDP), Dual Heuristic dynamic programming (DHP), Action dependent Heuristic Dynamic programming (ADHDP) and Action dependent Dual Heuristic dynamic programming (ADDHP). The significance of these algorithms is that for some of them, particularly the ADHDP algorithm, a priori knowledge of the plant model is not required to solve the dynamic programming problem. Another major outcome of this work is that we introduce a convergent policy iteration scheme based on the HDP algorithm that allows the use of neural networks to arbitrarily approximate for the value function of the discrete-time HJB equation. This online algorithm may be implemented in a way that requires only partial knowledge of the model of the nonlinear dynamical system. The dissertation includes detailed proofs of convergence for the proposed algorithms, HDP, DHP, ADHDP, ADDHP and the nonlinear HDP. Practical numerical examples are provided to show the effectiveness of the developed optimization algorithms. For nonlinear systems, a comparison with methods based on the State-Dependent Riccati Equation (SDRE) is also presented. In all the provided examples, parametric structures like neural networks have been used to find compact representations of the value function and optimal policies for the corresponding optimal control problems	en_US
dc.description.sponsorship	Lewis, Frank	en_US
dc.language.iso	EN	en_US
dc.publisher	Electrical Engineering	en_US
dc.title	Discrete-time Control Algorithms And Adaptive Intelligent Systems Designs	en_US
dc.type	Ph.D.	en_US
dc.contributor.committeeChair	Lewis, Frank	en_US
dc.degree.department	Electrical Engineering	en_US
dc.degree.discipline	Electrical Engineering	en_US
dc.degree.grantor	University of Texas at Arlington	en_US
dc.degree.level	doctoral	en_US
dc.degree.name	Ph.D.	en_US
dc.identifier.externalLink	https://www.uta.edu/ra/real/editprofile.php?onlyview=1&pid=27
dc.identifier.externalLinkDescription	Link to Research Profiles

Files in this item

Name:: umi-uta-1694.pdf
Size:: 924.4Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Show simple item record