Development and assessment of constrained reinforcement learning-based controller for building demand response

Sanchez, Jerson

dc.contributor.advisor	Cai, Jie
dc.contributor.author	Sanchez, Jerson
dc.date.accessioned	2023-12-15T15:00:39Z
dc.date.available	2023-12-15T15:00:39Z
dc.date.issued	2023-12-15
dc.identifier.uri	https://hdl.handle.net/11244/340052
dc.description.abstract	Recent advancements in model-free control strategies, such as reinforcement learning (RL) have led to more practical and scalable solutions for building energy system controls These strategies do not require complex models of building dynamics and rely exclusively on data to learn the control policy. Applications of these techniques in heating, ventilation and air-conditioning (HVAC) systems are being studied under different operational scenarios, including demand response programs. Conventional (unconstrained) reinforcement learning controllers often address indoor comfort constraints by incorporating a comfort violation penalty in the reward function. While this approach can result in good performance in terms of energy cost, it often leads to significant constraint violations when a small penalty factor is used. On the other hand, effective enforcement of constraints can be achieved, but at the cost of economic performance degradation. Hence, a clear trade-off between economic performance and constraint satisfaction poses a challenge to overcome. Motivated by this challenge, this thesis presents a constrained RL-based control strategy for building demand response. The proposed strategy handles the constraints explicitly, avoiding the use of arbitrarily set penalty factors that can significantly impact control performance. To demonstrate its efficacy, simulation tests of the proposed strategy, as well as baseline model predictive controllers (MPC) and conventional (unconstrained) policy optimization methods, were conducted. The simulation tests showed that the constrained RL strategy achieved utility cost savings up to 16.1%, similar to the MPC baselines, without requiring any model of the building and with minimum constraint violation. In contrast, the unconstrained RL controllers led to either high utility costs or constraint violations, depending on the penalty factor setting.	en_US
dc.language	en_US	en_US
dc.subject	Demand response	en_US
dc.subject	Building controls	en_US
dc.subject	Constrained reinforcement learning	en_US
dc.subject	HVAC systems	en_US
dc.title	Development and assessment of constrained reinforcement learning-based controller for building demand response	en_US
dc.contributor.committeeMember	Song, Li
dc.contributor.committeeMember	Zhang, Dong
dc.date.manuscript	2023
dc.thesis.degree	Master of Science	en_US
ou.group	Gallogly College of Engineering::School of Aerospace and Mechanical Engineering	en_US
shareok.orcid	0009-0007-7387-0655	en_US

Files in this item

Name:: 2023_Sanchez_Jerson_Thesis.pdf
Size:: 8.941Mb
Format:: PDF

View/Open

Name:: 2023_Sanchez_Jerson_Thesis.zip
Size:: 14.41Mb
Format:: Unknown

View/Open

This item appears in the following Collection(s)

OU - Theses [2188]

Show simple item record

SHAREOK^TM

advancing Oklahoma scholarship, research and institutional memory

Development and assessment of constrained reinforcement learning-based controller for building demand response

Files in this item

This item appears in the following Collection(s)