Publication Detail

Constrained Reinforcement Learning for Fair and Environmentally Efficient Traffic Signal Controllers

Reference Number: UCD-ITS-RP-24-50

Series: Journal Article

Available online at https://doi.org/10.1145/3676169

Suggested Citation:
Haydari, Ammar, Vaneet Aggarwal, Michael Zhang, Chen-Nee Chuah (2024)

Constrained Reinforcement Learning for Fair and Environmentally Efficient Traffic Signal Controllers

. Journal on Autonomous Transportation Systems

Abstract:

Traffic signal controller (TSC) has a crucial role in managing traffic flow in urban areas. Recently, reinforcement learning models have received a great attention for TSC with promising results. However, these RL-TSC models still need to be improved for real-world deployment due to limited exploration of different performance metrics such as fair traffic scheduling or air quality impact. In this work, we introduce a constrained multi-objective RL model that minimizes multiple constrained objectives while achieving a higher expected reward. Furthermore, our proposed RL strategy integrates the peak and average constraint models to the RL problem formulation with maximum entropy off-policy models. We applied this strategy to a single TSC and a network of TSCs. As part of this constrained RL-TSC formulation, we discuss fairness and air quality parameters as constraints for the close-loop control system optimization model at TSCs called FAirLight. Our experimental analysis shows that the proposed FAirLight achieves a good traffic flow performance in terms of average waiting time while being fair and environmentally friendly. Our method outperforms the baseline models and allows a more comprehensive view of RL-TSC regarding its applicability to the real world.

Key words:

reinforcement learning, traffic signal controller, constrained optimization, multi-objective learning, multi-agent systems