MULTI-PLAYER H1 DIFFERENTIAL GAME USING ON-POLICY AND OFF-POLICY REINFORCEMENT LEARNING

An, Peiliang

ATTENTION: The works hosted here are being migrated to a new repository that will consolidate resources, improve discoverability, and better show UTA's research impact on the global community. We will update authors as the migration progresses. Please see MavMatrix for more information.

View/Open

AN-THESIS-2021.pdf (209.0Kb)

Date

2021-08-03

Author

An, Peiliang

Metadata

Show full item record

Abstract

This work studies a multi-player H-infinity differential game for systems of general linear dynamics. In this game, multiple players design their control inputs to minimize their cost functions in the presence of worst-case disturbances. We first derive the optimal control and disturbance policies using the solutions to Hamilton-Jacobi-Isaacs (HJI) equations. We then prove that the derived optimal policies stabilize the system and constitute a Nash equilibrium solution. Two integral reinforcement learning (IRL) -based algorithms, including the policy iteration IRL and o -policy IRL, are developed to solve the differential game online. We show that the off-policy IRL can solve the multi-player H-infinity differential game online without using any system dynamics information. Simulation studies are conducted to validate the theoretical analysis and demonstrate the effectiveness of the developed learning algorithms.

URI

http://hdl.handle.net/10106/29981