Safe Reinforcement Learning for Self-Adaptive Systems

Project Seminar on Safe Reinforcement Learning for Self-Adaptive Systems

ABSTRACT

One significant challenge in traffic control systems is managing unprotected right turns at intersections, where the absence of dedicated turning signals can create collision risks. In this report we explore how to utilize reinforcement learning algorithms to dynamically adjust traffic light phases for mitigating collision risks while optimizing traffic flow. We use Simulation of Urban Mobility (SUMO) to simulate intersection traffic, integrating RL algorithms such as Deep Q-Learning (DQN), Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), and Trust Region Policy Optimization (TRPO) through SUMO-RL. In the report we investigate the impact of RL algorithms and speed limits on safety and efficiency, evaluating performance based on collision occurrences and time loss metrics. Results showTRPO’s superior safety performance, while A2C exhibits the smallest time loss. Further analysis shows the influence of speed limits on collision types. Additionally, we introduce a shield algorithm to enhance safety in A2C agents, observing a trade-off between safety improvement and performance degradation. The report concludes with avenues for future research, including optimizing shield distances and incorporating safety into RL rewards.

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
causal_discovery		causal_discovery
causal_inference		causal_inference
experiments		experiments
nets		nets
r		r
safe-rl		safe-rl
slides		slides
visualization		visualization
.RData		.RData
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Safe-RL-4-SAS.Rproj		Safe-RL-4-SAS.Rproj
collision.xml		collision.xml
environment.yml		environment.yml
info.log		info.log
run_evaluation.sh		run_evaluation.sh
run_evaluation_debug.sh		run_evaluation_debug.sh
run_experiments.sh		run_experiments.sh
run_experiments_safe_rl_report.sh		run_experiments_safe_rl_report.sh
safe-rl.ipynb		safe-rl.ipynb
safe-rl_report.ipynb		safe-rl_report.ipynb
statistics.xml		statistics.xml
train_models.sh		train_models.sh
tripinfo.xml		tripinfo.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Safe Reinforcement Learning for Self-Adaptive Systems

About

Releases

Packages

Contributors 3

Languages

License

hpi-sam/Safe-RL-4-SAS

Folders and files

Latest commit

History

Repository files navigation

Safe Reinforcement Learning for Self-Adaptive Systems

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages