Hybrid Shielded-CMDP Framework for Safe Smart Grid Control

Author: Hasan Muhaidat (supervised by Dr. Zekun Guo)
Affiliation: University of Hull
Core Implementation: Conservative Bounds Shielding for Distribution Network Voltage Control

Executive Summary

This repository implements a Hybrid Shielded-CMDP Framework. The architecture combines First-Order Constrained Optimization in Policy Space (FOCOPS) with a deterministic safety shield. By projecting agent actions into conservative bounds (10% power margin), the framework achieves 100% constraint satisfaction in active distribution networks (ADNs).

Scientific Contributions

Deterministic Safety Guarantee: Introduces a structural safety mechanism that ensures constraint satisfaction via hard-clipping rather than standard Lagrangian penalty optimization.
Performance Gains: Demonstrated a 7.60% increase in cumulative reward and a 5.1% reduction in episodic cost compared to unshielded baselines.
Positive Scaling: Empirical evidence shows the shield's effectiveness increases with system complexity (1.92× improvement when scaling from IEEE 33-bus to 69-bus systems).
Real-time Feasibility: Shielding logic adds negligible computational overhead (0.18 ms/action latency).

Technical Architecture

[Input: State] -> [FOCOPS Policy] -> [Unsafe Action] 
                                          |
                                          v
[Conservative Bounds Shield] <--- [Deterministic Projection]
                                          |
                                          v
[Result: 100% Safe Action] -> [Pandapower / IEEE Environment]

Experimental Results

Results validated on IEEE 33-bus and 69-bus systems across 1M interactions (n=3 independent seeds).

Metric	Baseline (FOCOPS)	Hybrid Shielded	Improvement	p-value
Cumulative Reward	-2381.67 ± 305.63	-2200.74 ± 295.32	+7.6%	0.0489*
Episodic Cost	-4.05M ± 53k	-3.84M ± 28k	-5.1%	0.0045†
Shield Latency	N/A	0.18 ms	N/A	N/A

(*) p < 0.05, (†) p < 0.01

Repository Structure

/rl_constrained_smartgrid_control: Core environment package (Gymnasium-compatible).
shield_model.py: Implementation of the conservative bounds projection logic.
launch_focops_hybrid.py: Main training script for the shielded agent.
Notebooks/: Documentation of shield mechanics and statistical validation.
final_results/: Trained model checkpoints and TensorBoard logs.

Quick Start

# Clone and Install
git clone https://github.com/HZM99/Shielded-CMDP-Optimization.git
cd Shielded-CMDP-Optimization
conda env create -f omnisafe310_env.yml
pip install -e .

# Run Shielded Training (69-bus)
python launch_focops_hybrid.py --env-id IEEE69-Hybrid-v0 --epochs 100 --seed 0

Citation

If you use this framework in your research, please cite:

@mastersthesis{muhaidat2025shield,
  author = {Muhaidat, Hasan},
  title = {Safe Reinforcement Learning for Smart Grid Control: Conservative Bounds Shielding},
  school = {University of Hull},
  year = {2025},
  note = {MSc Dissertation}
}

Contact

Hasan Muhaidat
h.muhaidat-2024@hull.ac.uk
[ORCID: 0009-0009-0036-9581]

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
final_results		final_results
final_results_33bus		final_results_33bus
ppo_shielded_tensorboard		ppo_shielded_tensorboard
rl_constrained_smartgrid_control		rl_constrained_smartgrid_control
tests		tests
train/FOCOPS_smoke/FOCOPS_IEEE33Wrapper		train/FOCOPS_smoke/FOCOPS_IEEE33Wrapper
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
33bus_statistical_results.txt		33bus_statistical_results.txt
3_Statistical_Analysis.ipynb		3_Statistical_Analysis.ipynb
4_Visualization_Results.ipynb		4_Visualization_Results.ipynb
COMPREHENSIVE_33BUS_ANALYSIS.md		COMPREHENSIVE_33BUS_ANALYSIS.md
COMPUTATIONAL_OVERHEAD_ANALYSIS.md		COMPUTATIONAL_OVERHEAD_ANALYSIS.md
CROSS_SYSTEM_COMPARISON.md		CROSS_SYSTEM_COMPARISON.md
FAILURE_CASE_ANALYSIS.md		FAILURE_CASE_ANALYSIS.md
LICENSE		LICENSE
LIST_OF_FIGURES.md		LIST_OF_FIGURES.md
LIST_OF_SYMBOLS.md		LIST_OF_SYMBOLS.md
LIST_OF_TABLES.md		LIST_OF_TABLES.md
MSC_APPENDIX_A_PARAMETERS.md		MSC_APPENDIX_A_PARAMETERS.md
MSC_APPENDIX_C_CODE.md		MSC_APPENDIX_C_CODE.md
MSC_APPENDIX_D_TOPOLOGY.md		MSC_APPENDIX_D_TOPOLOGY.md
MULTI_SEED_ANALYSIS.md		MULTI_SEED_ANALYSIS.md
Notebook1_Shield_Implementation.ipynb		Notebook1_Shield_Implementation.ipynb
Notebook2_Training_Demo.ipynb		Notebook2_Training_Demo.ipynb
README.md		README.md
RESULTS_SUMMARY.csv		RESULTS_SUMMARY.csv
actor_mlp_wrapper.py		actor_mlp_wrapper.py
custom_focops.py		custom_focops.py
custom_logger.py		custom_logger.py
debug_violations.py		debug_violations.py
focops_cfg.yaml		focops_cfg.yaml
ieee33_wrapper.py		ieee33_wrapper.py
ieee69_wrapper.py		ieee69_wrapper.py
launch_focops.py		launch_focops.py
launch_focops_hybrid.py		launch_focops_hybrid.py
launch_ppo_lag.py		launch_ppo_lag.py
lightweight_smoke_test_script.py		lightweight_smoke_test_script.py
omnisafe310_env.yml		omnisafe310_env.yml
omnisafe_ieee33_wrapper.py		omnisafe_ieee33_wrapper.py
poetry.lock		poetry.lock
ppo_lagrangian_cfg.yaml		ppo_lagrangian_cfg.yaml
pyproject.toml		pyproject.toml
register_envs.py		register_envs.py
results_summary_table.csv		results_summary_table.csv
results_summary_table.tex		results_summary_table.tex
run_33bus_experiments.ps1		run_33bus_experiments.ps1
run_a2c.py		run_a2c.py
run_a2c_69bus.py		run_a2c_69bus.py
run_all_33bus.ps1		run_all_33bus.ps1
run_ppo.py		run_ppo.py
run_ppo_69bus.py		run_ppo_69bus.py
shield_model.py		shield_model.py
shield_model_conservative.py		shield_model_conservative.py
shield_model_lookahead.py		shield_model_lookahead.py
shield_wrapper.py		shield_wrapper.py
simulation_shield.py		simulation_shield.py
train_shielded_ppo.py		train_shielded_ppo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hybrid Shielded-CMDP Framework for Safe Smart Grid Control

Executive Summary

Scientific Contributions

Technical Architecture

Experimental Results

Repository Structure

Quick Start

Citation

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Hybrid Shielded-CMDP Framework for Safe Smart Grid Control

Executive Summary

Scientific Contributions

Technical Architecture

Experimental Results

Repository Structure

Quick Start

Citation

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages