ArXiv

Towards Affordable Energy: A Gymnasium Environment for Electric Utility Demand-Response Programs

Authors
Jose E. Aguilar Escamilla, Lingdong Zhou, Xiangqi Zhu...
Categories
cs.AI, cs.CY, cs.GT, cs.LG
arXiv
https://arxiv.org/abs/2605.12462v1
PDF
https://arxiv.org/pdf/2605.12462v1

Brief

DR-Gym is an open-source Gymnasium-compatible environment that trains and evaluates demand-response from the electric utility perspective, addressing the missing feedback loop in offline smart-meter datasets. The work pairs physics-based building demand profiles with a regime-switching wholesale price model calibrated to real-world extreme events and a configurable multi-objective reward. Unlike device-level simulators, DR-Gym targets market-level utility decision-making and demonstrates realistic, learnable scenarios using baseline strategies.

Why it matters

DR-Gym: an open-source, Gymnasium-compatible simulator introduced on arXiv 2026-05-12 by Jose E. Aguilar Escamilla, Lingdong Zhou, Xiangqi Zhu, and Huazheng Wang for training electric-utility-level demand-response policies.

Key details

  • Simulator includes a regime-switching wholesale price model calibrated to real-world extreme events, physics-based building demand profiles, and a configurable multi-objective reward; authors demonstrate realism and learnability via baseline strategies and data snapshots.
Source evidence

Abstract

Extreme weather and volatile wholesale electricity markets expose residential consumers to catastrophic financial risks, yet demand response at the distribution level remains an underutilized tool for grid flexibility and energy affordability. While a demand-response program can shield consumers by issuing financial credits during high-price periods, optimizing this sequential decision-making process presents a unique challenge for reinforcement learning despite the plentiful offline historical smart meter and wholesale pricing data available publicly. Offline historical data fails to capture the dynamic, interactive feedback loop between an electric utility's pricing signals and customer acceptance and adaptation to a demand-response program. To address this, we introduce DR-Gym, an open-source, online Gymnasium-compatible environment designed to train and evaluate demand-response from the electric utility's perspective. Unlike existing device-level energy simulators, our environment focuses on the market-level electric utility setting and provides a rich observational space relevant to the electric utility. The simulator additionally features a regime-switching wholesale price model calibrated to real-world extreme events, alongside physics-based building demand profiles. For our learning signal, we use a configurable, multi-objective reward function for specifying diverse learning objectives. We demonstrate through baseline strategies and data snapshots the capability of our simulator to create realistic and learnable environments.