Publication Detail
Constrained Reinforcement Learning for Fair and Environmentally Efficient Traffic Signal Controllers
UCD-ITS-RP-24-50 Journal Article
Available online at
https://doi.org/10.1145/3676169
|
Suggested Citation:
Haydari, Ammar, Vaneet Aggarwal, Michael Zhang, Chen-Nee Chuah (2024)
Constrained Reinforcement Learning for Fair and Environmentally Efficient Traffic Signal Controllers
. Journal on Autonomous Transportation SystemsTraffic signal controller (TSC) has a crucial role in managing traffic flow in urban areas. Recently, reinforcement learning models have received a great attention for TSC with promising results. However, these RL-TSC models still need to be improved for real-world deployment due to limited exploration of different performance metrics such as fair traffic scheduling or air quality impact. In this work, we introduce a constrained multi-objective RL model that minimizes multiple constrained objectives while achieving a higher expected reward. Furthermore, our proposed RL strategy integrates the peak and average constraint models to the RL problem formulation with maximum entropy off-policy models. We applied this strategy to a single TSC and a network of TSCs. As part of this constrained RL-TSC formulation, we discuss fairness and air quality parameters as constraints for the close-loop control system optimization model at TSCs called FAirLight. Our experimental analysis shows that the proposed FAirLight achieves a good traffic flow performance in terms of average waiting time while being fair and environmentally friendly. Our method outperforms the baseline models and allows a more comprehensive view of RL-TSC regarding its applicability to the real world.
Key words:
reinforcement learning, traffic signal controller, constrained optimization, multi-objective learning, multi-agent systems