SOUL Grader — Setup Guide¶

Source: cobibean/soul-grader-skill (25⭐, community skill) Category: Agent Identity & Governance

Grade, review, rewrite, or approve a Hermes Agent SOUL.md identity file using a narrow, research-backed rubric. The bundled grading standard is the only normative source — no generic prompt-engineering advice or personal taste. Scores 11 categories out of 100 with automatic fail conditions for secrets, false claims, and missing safety gates.

Installation¶

npx skills add cobibean/soul-grader-skill

Or clone directly for manual inspection:

git clone https://github.com/cobibean/soul-grader-skill.git

Prerequisites¶

Requirement	Details
Hermes Agent	Any version. SOUL.md loading behavior is consistent across releases.
Target SOUL.md	A SOUL.md file to grade (local path or inline text).
Adjacent files (optional)	`CLAUDE.md`, `AGENTS.md`, manifests, or operator guides — only needed for contradiction checks and artifact separation scoring.

Key Capabilities¶

Core Features¶

Capability	How to Trigger	Notes
Full SOUL grade	"Grade this SOUL.md" or "Review my agent SOUL"	Returns 11-category score table, automatic blockers, drift risks, suggested wording
Quick grade	"Quick grade this SOUL"	Returns score/verdict, deployability, biggest issue, top 3 fixes
SOUL rewriting	"Rewrite this weak SOUL" or "Make this SOUL stronger"	Patches from bundled research artifacts, not generic advice
Fleet grading	"Grade all my agent SOULs"	Uses fleet workflow reference — active/retired classification, contradiction checks
Deployability check	"Is this SOUL ready to deploy?"	Runs automatic fail scan first (secrets, false claims, missing gates)
Deep research	"Do a SOUL research project"	Swarm deliverable pattern — independent research lanes, static HTML review surface

Grading Rubric (11 Categories, 100 Points)¶

Category	Points	What It Evaluates
Mission clarity	15	Names who/what the agent serves and what outcome matters
Identity + negations	12	What the agent is and what it must not become
Core thesis	10	Durable decision lens about the user/domain/problem
Optimization hierarchy	10	Ranks tradeoffs instead of listing virtues
Hard constraints	10	3–5 true filters with approval/override semantics
Soft preferences	8	Separates scoring signals from bans
Authority + escalation	10	Allowed / ask-before / never boundaries
Voice + truthfulness	10	Tone, vocabulary, never-claims, evidence thresholds
Success / artifacts	8	Durable/verifiable completion definitions
Artifact separation	5	Keeps commands, workflows, secrets elsewhere
Runtime hygiene	2	Fits Hermes loading behavior, avoids hidden assumptions

Automatic Fail Conditions¶

Secrets, tokens, passwords, API keys in SOUL
False or unverified claims of access, deployment state, or authority
Ungated spend, publishing, outreach, destructive edits
Cross-client data/credential contamination
Assuming YAML frontmatter is hidden from Hermes (it's visible prompt text)
Contradictions with nearby operating files or approval policy

Verdict Bands¶

Score	Verdict	Meaning
90–100	Excellent	Production-grade; keep reviewed as scope changes
75–89	Operational	Usable; patch missing layers before high-risk autonomy
60–74	Scaffold	Serviceable draft; needs constraints, negations, or success artifacts
0–59	Needs rewrite	Rewrite from mission/constraints upward
Any auto-fail	Not deployable	Blocker must be resolved before deployment

CorpusIQ Use Cases¶

Use Case	How
New agent onboarding	Grade every new agent's SOUL.md before deployment — catch missing gates, false claims, and secret leaks
Agent identity drift detection	Re-grade SOUL.md monthly; compare scores to catch mission drift, constraint erosion, or scope creep
Client agent handoff	Grade client-facing agent SOULs before delivery — ensures isolation, approval, and credential language is correct
SOUL authoring	Use the wording standards and rubric to author strong SOUL.md files from scratch — operational language, no vibes
Fleet audit	Run fleet-wide grading across all deployed Hermes agents — classify active/retired, catch contradictions, generate remediation reports

Troubleshooting¶

Issue	Fix
Bundled references missing	The grader stops and reports "source bundle missing" — re-clone the skill or check `references/` directory
Frontmatter treated as hidden	Hermes native SOUL loading injects SOUL as prompt text; YAML is visible. Move secrets to env vars or config
Session cache ignores SOUL fix	A corrected SOUL may not affect a running session — restart Hermes or start a new session
Adjacent file contradictions	If CLAUDE.md or AGENTS.md conflicts with SOUL, the grader flags it — decide which file owns the rule

Verification¶

# Verify skill installed
hermes skills list | grep soul-grader

# Quick functional test — load the grading standard reference
# (run inside a Hermes session)
skill_view(name="soul-grader", file_path="references/soul-md-grading-standard.md")

← Skills Catalog | Home → Powered by CorpusIQ