Nonuniqueness and equivalence in online inverse reinforcement learning with applications to pilot performance modeling

Town, Jared Curtis

dc.contributor.advisor	Kamalapurkar, Rushikesh
dc.contributor.author	Town, Jared Curtis
dc.date.accessioned	2023-08-25T20:06:39Z
dc.date.available	2023-08-25T20:06:39Z
dc.date.issued	2023-05
dc.identifier.uri	https://hdl.handle.net/11244/338973
dc.description.abstract	The focus of this thesis is behavior modeling for pilots of unmanned aerial vehicles.The pilot is assumed to make decisions that optimize an unknown cost functional, which is estimated from observed trajectories using a novel inverse reinforcement learning (IRL) framework. The resulting IRL problem often admits multiple solutions. Nonuniqueness necessitates the study of the notion of equivalent solutions, i.e., solutions that result in a different cost function but same feedback matrix, and convergence to such solutions. While offline algorithms that result in convergence to equivalent solutions have been developed in the literature, online, real-time techniques that address nonuniqueness are not available. In this thesis, a regularized history stack observer that converges to approximately equivalent solutions of the IRL problem is developed. Novel data-richness conditions are developed to facilitate the analysis and simulation results are provided to demonstrate the effectiveness of the developed technique.
dc.description.abstract	The novel IRL observer is then adapted to the pilot modeling problem. The observer is shown to converge to one of the equivalent solutions of the IRL problem. The developed technique is implemented on a quadcopter where the pilot is modeled as a linear quadratic regulator. Experimental results demonstrate the robustness of the method and its ability to learn an equivalent cost functional.
dc.format	application/pdf
dc.language	en_US
dc.rights	Copyright is held by the author who has granted the Oklahoma State University Library the non-exclusive right to share this material in its institutional repository. Contact Digital Library Services at lib-dls@okstate.edu or 405-744-9161 for the permission policy on the use, reproduction or distribution of this material.
dc.title	Nonuniqueness and equivalence in online inverse reinforcement learning with applications to pilot performance modeling
dc.contributor.committeeMember	Bai, He
dc.contributor.committeeMember	Faruque, Imraan
osu.filename	town_okstate_0664m_18158.pdf
osu.accesstype	Open Access
dc.type.genre	Thesis
dc.type.material	Text
thesis.degree.discipline	Mechanical and Aerospace Engineering
thesis.degree.grantor	Oklahoma State University

Files in this item

Name:: Town_okstate_0664M_18158.pdf
Size:: 1.460Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

OSU Theses [15752]

Show simple item record

SHAREOK^TM

advancing Oklahoma scholarship, research and institutional memory

Nonuniqueness and equivalence in online inverse reinforcement learning with applications to pilot performance modeling

Files in this item

This item appears in the following Collection(s)