Reinforcement Learning for Cognitive Phased Array Radar Surveillance

Flandermeyer, Shane

dc.contributor.advisor	Metcalf, Justin
dc.contributor.author	Flandermeyer, Shane
dc.date.accessioned	2023-04-17T15:06:29Z
dc.date.available	2023-04-17T15:06:29Z
dc.date.issued	2023-05-12
dc.identifier.uri	https://hdl.handle.net/11244/337410
dc.description.abstract	The proliferation of phased array radar (PAR) has significantly increased the flexibility of radar systems, making it possible to use a single radar to perform a variety of operational modes such as surveillance and tracking that each traditionally required a dedicated system. To fully take advantage of these capabilities, algorithms must be developed to efficiently distribute the radar's finite time, energy, and processing budget between competing tasks. Although many resource management methods exist for tracking applications, it is common to use a fixed strategy (e.g., a raster scan) for the surveillance task. The resulting allocation of resources is often sub-optimal since fixed approaches do not leverage prior knowledge and thus spend a disproportionate amount of time searching regions that are unlikely to contain new targets. This thesis presents a novel approach to more effectively utilize the radar timeline in surveillance and track initiation tasks. A variant of particle swarm optimization (PSO) is derived to estimate the density of untracked targets in the search volume, which is then used to inform the parameter selection process for each radar dwell. The resulting method, known as Surveillance PSO (SPSO), is computationally efficient and suitable for real-time implementation on a general-purpose CPU or GPU. SPSO is also highly general, making few assumptions about the properties of the target or the underlying radar system. Finally, the output of the algorithm is a constant-length tensor that can be incorporated into systems that utilize deep learning and reinforcement learning. Two cognitive agents are developed to demonstrate the utility of the SPSO algorithm. The first is a deterministic agent that directly uses the output of the SPSO algorithm to make decisions on where to steer the radar beam at each dwell. The second is reinforcement learning (RL) agent that uses a slight modification of SPSO to simultaneously steer and spoil the transmitted beam based on the current environment. The performance of each agent is evaluated in several simulated surveillance environments, where both are shown to outperform the standard raster scan approach.	en_US
dc.language	en_US	en_US
dc.rights	Attribution 4.0 International	*
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/	*
dc.subject	Phased Array Radar	en_US
dc.subject	Reinforcement Learning	en_US
dc.subject	Resource Management	en_US
dc.subject	Machine Learning	en_US
dc.title	Reinforcement Learning for Cognitive Phased Array Radar Surveillance	en_US
dc.contributor.committeeMember	Goodman, Nathan
dc.contributor.committeeMember	Hougen, Dean
dc.date.manuscript	2023-04
dc.thesis.degree	Master of Science	en_US
ou.group	Gallogly College of Engineering::School of Electrical and Computer Engineering	en_US