Trusted Release Boundary - Benchmark

A reproducible benchmark showing that CI architecture, not just SHA pinning, is what materially limits supply-chain attacks like CVE-2025-30066 (tj-actions/changed-files).

The Finding

Tier	Architecture	Score	Annual Cost
1	No security	10/100	$0
2	SHA-pinned (typical AI advice)	20/100	$0
3	Trusted Release Boundary	75/100	$0
4	Enterprise (egress + attestation)	83/100	enterprise-style overhead

Tier 3 closes the largest security gap at zero tooling cost.

Why You Should Care

Most CI hardening advice stops at:

pin actions by SHA
reduce GITHUB_TOKEN permissions
add selective hardening later

Those are useful, but they do not solve the core problem:

If untrusted third-party code runs in the same job as secrets and release authority, a compromised action can still steal secrets, poison artifacts, and exfiltrate data.

This benchmark isolates that exact question by running the same malicious action against four different workflow architectures.

What Was Tested

A simulated malicious GitHub Action, modeled on the behavior class exposed by CVE-2025-30066, ran the same six attack behaviors in every tier:

Environment variable dumping
GITHUB_TOKEN permission probing
Process memory access checks
Network exfiltration attempts
Artifact poisoning
Source enumeration

The only changing variable was the workflow design.

Full benchmark results ->

Full technical paper ->

Reproduce It Yourself

Fork this repository
Add four dummy secrets
Create a production environment with required reviewers
Run the workflows in order
Compare the artifact hashes and logs

Step-by-step reproduction guide ->

The Framework: 6 Rules

Rule	Name	Purpose
0	PIN	Use immutable SHA references for all external actions
1	QUARANTINE	Untrusted lane gets no secrets and no write authority
2	ISOLATE	Trusted lane is separate and first-party only
3	REBUILD	Trusted lane rebuilds from source on a fresh runner
4	ARTIFACT QUARANTINE	Only metadata crosses the boundary, never untrusted binaries
5	VALIDATE	Outputs crossing the boundary are explicitly sanitized

Source Hash (Ground Truth)

GitHub-hosted runners built the artifacts from Linux checkouts with LF line endings. That normalized source hash is the ground truth for clean artifact comparison:

$ sha256sum src/app.js
c4657bc50ab6be26c54354f5304097ead527c46dbf2d72e0efbc35b1727b5988  src/app.js

Evidence

Caveats

This repo is public, so source exposure impact is capped relative to a private-repo benchmark
The malicious action is a controlled simulation, not a live attacker
Windows CRLF vs Linux LF normalization affected raw local hashes; artifact verification used the normalized Linux hash above
Tier 4 initially failed attestation until Sigstore endpoints were allowlisted through the hardened release egress policy

Repo Layout

.github/actions/malicious-tool/   simulated compromised action
.github/workflows/                baseline and tier workflows
evidence/                         extracted logs, artifacts, score sheets, comparison table
scripts/                          local verification helpers
src/app.js                        deterministic source input
REPRODUCE.md                      end-to-end rerun instructions
RESULTS.md                        narrative benchmark results
PRESENTATION.md                   short presentation / video notes

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.github		.github
evidence		evidence
scripts		scripts
src		src
.gitignore		.gitignore
ANOMALIES.md		ANOMALIES.md
BENCHMARK_ROADMAP.md		BENCHMARK_ROADMAP.md
LICENSE		LICENSE
PAPER.md		PAPER.md
PRESENTATION.md		PRESENTATION.md
README.md		README.md
RELEASE_CHECKLIST.md		RELEASE_CHECKLIST.md
REPRODUCE.md		REPRODUCE.md
RESULTS.md		RESULTS.md
TrustedReleaseBoundary.txt		TrustedReleaseBoundary.txt
deploy.sh		deploy.sh
expected-results.md		expected-results.md
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Trusted Release Boundary - Benchmark

The Finding

Why You Should Care

What Was Tested

Reproduce It Yourself

The Framework: 6 Rules

Source Hash (Ground Truth)

Evidence

Caveats

Repo Layout

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Trusted Release Boundary - Benchmark

The Finding

Why You Should Care

What Was Tested

Reproduce It Yourself

The Framework: 6 Rules

Source Hash (Ground Truth)

Evidence

Caveats

Repo Layout

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages