Creating Skills

Relevant source files

This page covers the methodology, format, and process for authoring new skills in the superpowers library. It explains how the TDD discipline is applied to process documentation, what makes a skill effective, and how to deploy and verify one before committing it.

For details on how existing skills are found and invoked by agents, see Finding and Invoking Skills. For the full reference on any individual built-in skill, see Key Skills Reference.

What This Process Is

Creating a skill is not writing documentation—it is Test-Driven Development applied to process documentation. The same Iron Law from test-driven-development applies here without modification:

NO SKILL WITHOUT A FAILING TEST FIRST

The canonical implementation of this process lives in `skills/writing-skills/SKILL.md`

TDD Mapping

Every concept in the TDD cycle has a direct equivalent in skill creation.

TDD Concepts Mapped to Skill Creation

TDD Concept	Skill Creation Equivalent
Test case	Pressure scenario dispatched to a subagent
Production code	The `SKILL.md` document
Watch test fail (RED)	Agent violates rule without the skill (baseline)
Watch test pass (GREEN)	Agent complies with skill present
Refactor	Find new rationalizations, add explicit counters, re-test
Write test first	Run baseline scenario before writing the skill
Minimal code	Skill addresses only the specific failures from baseline

Sources: `skills/writing-skills/SKILL.md32-44

Skill Types

Skills fall into three categories. Each type requires a different testing strategy (see Testing Skills with Pressure Scenarios).

Type	Definition	Examples
Technique	Concrete method with sequential steps	`condition-based-waiting`, `root-cause-tracing`
Pattern	Mental model for a class of problems	`flatten-with-flags`, `test-invariants`
Reference	API docs, syntax guides, command references	Office docs, library API guides

Sources: `skills/writing-skills/SKILL.md63-71

Directory Layout

Each skill lives in its own subdirectory under skills/. The only required file is SKILL.md.

skills/
  skill-name/
    SKILL.md              # Required — frontmatter + content
    supporting-file.*     # Optional — only for heavy reference or reusable tools

Supporting files are warranted only when reference material exceeds ~100 lines or when providing a reusable script/template. Everything else stays inline.

Sources: `skills/writing-skills/SKILL.md74-91

The RED-GREEN-REFACTOR Cycle for Skills

Skill Authoring Lifecycle

Sources: `skills/writing-skills/SKILL.md533-560

`SKILL.md` Structure and Code Entities

The diagram below maps the sections of a SKILL.md file to the code constructs and conventions that govern them.

SKILL.md Anatomy

Sources: `skills/writing-skills/SKILL.md93-137 `skills/writing-skills/SKILL.md140-197

Claude Search Optimization (CSO)

The description field in the YAML frontmatter controls whether an agent loads the skill at all. It must describe only triggering conditions, never the workflow inside the skill.

Why this matters: If the description summarizes the workflow, the agent may follow the description as a shortcut and skip the skill body entirely. This was confirmed by testing: a description mentioning "code review between tasks" caused agents to do one review instead of two, because they never read the flowchart that required a two-stage process.

Pattern	Example	Verdict
Triggering conditions only	`Use when executing implementation plans with independent tasks`	✅
Workflow summary	`Use when executing plans - dispatches subagent per task with code review between tasks`	❌
Too abstract	`For async testing`	❌
First person	`I can help you with async tests`	❌
Technology-specific (non-specific skill)	`Use when tests use setTimeout/sleep`	❌

The description should also include keyword coverage: error messages, symptom words, tool names, and synonyms that an agent would search for when encountering the relevant problem.

For full CSO guidance, see Claude Search Optimization (CSO).

Sources: `skills/writing-skills/SKILL.md140-197

Testing Strategies by Skill Type

Different skill types require different testing approaches. The table below maps each type to its test design and success criteria.

Skill Type	Test Approach	Success Criteria
Discipline-enforcing	Pressure scenarios with combined stressors (time + sunk cost + exhaustion)	Agent follows rule under maximum pressure
Technique	Application scenarios + variation/edge-case scenarios	Agent applies technique correctly to new scenario
Pattern	Recognition scenarios + counter-examples	Agent identifies when/how to apply the pattern
Reference	Retrieval scenarios + gap testing	Agent finds and correctly uses reference information

Sources: `skills/writing-skills/SKILL.md395-443

Bulletproofing Against Rationalization

Discipline-enforcing skills must actively resist the rationalizations agents produce under pressure. The writing-skills skill documents the concrete techniques:

Close every loophole explicitly — forbid specific workarounds by name, not just the general rule.
Address spirit-vs-letter arguments — add the phrase "Violating the letter of the rules is violating the spirit of the rules" early in the skill body.
Build a rationalization table — collect verbatim excuses from baseline testing and add a counter for each.
Create a Red Flags list — enumerate specific phrases or behaviors that indicate the agent is rationalizing.
Update CSO for violation symptoms — include in the description the conditions where a violation is about to happen.

Common Rationalizations and Their Reality

Excuse	Reality
"Skill is obviously clear"	Clear to you ≠ clear to other agents. Test it.
"Testing is overkill"	Untested skills always have issues. 15 min testing saves hours.
"I'll test if problems emerge"	Problems mean agents can't use the skill. Test before deploying.
"Academic review is enough"	Reading ≠ using. Test application scenarios.
"No time to test"	Deploying untested skill wastes more time fixing later.

Sources: `skills/writing-skills/SKILL.md459-523

The Iron Law and the DELETE Rule

The Iron Law applies to new skills and edits to existing skills.

NO SKILL WITHOUT A FAILING TEST FIRST

If code is written before the test, TDD requires deleting it and starting over. The same applies here:

Write skill before running baseline? Delete it. Start over.
Edit skill without re-testing? Same violation.

No exceptions:

Not for "simple additions"
Not for "just adding a section"
Not for "documentation updates"
Do not keep untested changes as "reference"
Delete means delete

Sources: `skills/writing-skills/SKILL.md374-393

Skill Discovery Flow

This diagram shows how an agent discovers and loads a skill, tracing through the code entities involved.

Skill Discovery: Natural Language to Code Entity Path

Sources: `skills/writing-skills/SKILL.md635-645 `skills/writing-skills/SKILL.md93-100

Deployment Checklist Summary

The full checklist is detailed in Skill Creation Checklist. The phases are:

RED Phase

Create pressure scenarios with 3+ combined pressures (for discipline skills)
Run scenarios without the skill — document baseline behavior verbatim
Identify patterns in rationalizations

GREEN Phase

Name uses only letters, numbers, hyphens
YAML frontmatter valid, max 1024 chars total
Description starts with Use when..., triggers only, third person
Skill body addresses the specific baseline failures from RED
Scenarios pass with skill present

REFACTOR Phase

Identify new rationalizations from testing
Add explicit counters and Red Flags list
Build rationalization table
Re-test until bulletproof

Deployment

Commit and push to fork
Contribute via PR if broadly useful (see Contributing Skills)

Sources: `skills/writing-skills/SKILL.md596-633

Child Pages

Page	What It Covers
Test-Driven Development for Skills	Full RED-GREEN-REFACTOR cycle adapted for skill authoring
SKILL.md Format and Structure	Required YAML fields and Markdown section conventions
Testing Skills with Pressure Scenarios	How to construct effective pressure scenarios
Claude Search Optimization (CSO)	How the `description` field controls discoverability
Skill Creation Checklist	Deployment checklist for each phase
Contributing Skills	Fork and pull request process for the superpowers-skills repository

Creating Skills

Relevant source files

For details on how existing skills are found and invoked by agents, see Finding and Invoking Skills. For the full reference on any individual built-in skill, see Key Skills Reference.

What This Process Is

Creating a skill is not writing documentation—it is Test-Driven Development applied to process documentation. The same Iron Law from test-driven-development applies here without modification:

NO SKILL WITHOUT A FAILING TEST FIRST

The canonical implementation of this process lives in `skills/writing-skills/SKILL.md`

TDD Mapping

Every concept in the TDD cycle has a direct equivalent in skill creation.

TDD Concepts Mapped to Skill Creation

TDD Concept	Skill Creation Equivalent
Test case	Pressure scenario dispatched to a subagent
Production code	The `SKILL.md` document
Watch test fail (RED)	Agent violates rule without the skill (baseline)
Watch test pass (GREEN)	Agent complies with skill present
Refactor	Find new rationalizations, add explicit counters, re-test
Write test first	Run baseline scenario before writing the skill
Minimal code	Skill addresses only the specific failures from baseline

Sources: `skills/writing-skills/SKILL.md32-44

Skill Types

Skills fall into three categories. Each type requires a different testing strategy (see Testing Skills with Pressure Scenarios).

Type	Definition	Examples
Technique	Concrete method with sequential steps	`condition-based-waiting`, `root-cause-tracing`
Pattern	Mental model for a class of problems	`flatten-with-flags`, `test-invariants`
Reference	API docs, syntax guides, command references	Office docs, library API guides

Sources: `skills/writing-skills/SKILL.md63-71

Directory Layout

Each skill lives in its own subdirectory under skills/. The only required file is SKILL.md.

skills/
  skill-name/
    SKILL.md              # Required — frontmatter + content
    supporting-file.*     # Optional — only for heavy reference or reusable tools

Supporting files are warranted only when reference material exceeds ~100 lines or when providing a reusable script/template. Everything else stays inline.

Sources: `skills/writing-skills/SKILL.md74-91

The RED-GREEN-REFACTOR Cycle for Skills

Skill Authoring Lifecycle

Sources: `skills/writing-skills/SKILL.md533-560

`SKILL.md` Structure and Code Entities

The diagram below maps the sections of a SKILL.md file to the code constructs and conventions that govern them.

SKILL.md Anatomy

Sources: `skills/writing-skills/SKILL.md93-137 `skills/writing-skills/SKILL.md140-197

Claude Search Optimization (CSO)

The description field in the YAML frontmatter controls whether an agent loads the skill at all. It must describe only triggering conditions, never the workflow inside the skill.

Pattern	Example	Verdict
Triggering conditions only	`Use when executing implementation plans with independent tasks`	✅
Workflow summary	`Use when executing plans - dispatches subagent per task with code review between tasks`	❌
Too abstract	`For async testing`	❌
First person	`I can help you with async tests`	❌
Technology-specific (non-specific skill)	`Use when tests use setTimeout/sleep`	❌

The description should also include keyword coverage: error messages, symptom words, tool names, and synonyms that an agent would search for when encountering the relevant problem.

For full CSO guidance, see Claude Search Optimization (CSO).

Sources: `skills/writing-skills/SKILL.md140-197

Testing Strategies by Skill Type

Different skill types require different testing approaches. The table below maps each type to its test design and success criteria.

Skill Type	Test Approach	Success Criteria
Discipline-enforcing	Pressure scenarios with combined stressors (time + sunk cost + exhaustion)	Agent follows rule under maximum pressure
Technique	Application scenarios + variation/edge-case scenarios	Agent applies technique correctly to new scenario
Pattern	Recognition scenarios + counter-examples	Agent identifies when/how to apply the pattern
Reference	Retrieval scenarios + gap testing	Agent finds and correctly uses reference information

Sources: `skills/writing-skills/SKILL.md395-443

Bulletproofing Against Rationalization

Discipline-enforcing skills must actively resist the rationalizations agents produce under pressure. The writing-skills skill documents the concrete techniques:

Close every loophole explicitly — forbid specific workarounds by name, not just the general rule.
Address spirit-vs-letter arguments — add the phrase "Violating the letter of the rules is violating the spirit of the rules" early in the skill body.
Build a rationalization table — collect verbatim excuses from baseline testing and add a counter for each.
Create a Red Flags list — enumerate specific phrases or behaviors that indicate the agent is rationalizing.
Update CSO for violation symptoms — include in the description the conditions where a violation is about to happen.

Common Rationalizations and Their Reality

Excuse	Reality
"Skill is obviously clear"	Clear to you ≠ clear to other agents. Test it.
"Testing is overkill"	Untested skills always have issues. 15 min testing saves hours.
"I'll test if problems emerge"	Problems mean agents can't use the skill. Test before deploying.
"Academic review is enough"	Reading ≠ using. Test application scenarios.
"No time to test"	Deploying untested skill wastes more time fixing later.

Sources: `skills/writing-skills/SKILL.md459-523

The Iron Law and the DELETE Rule

The Iron Law applies to new skills and edits to existing skills.

NO SKILL WITHOUT A FAILING TEST FIRST

If code is written before the test, TDD requires deleting it and starting over. The same applies here:

Write skill before running baseline? Delete it. Start over.
Edit skill without re-testing? Same violation.

No exceptions:

Not for "simple additions"
Not for "just adding a section"
Not for "documentation updates"
Do not keep untested changes as "reference"
Delete means delete

Sources: `skills/writing-skills/SKILL.md374-393

Skill Discovery Flow

This diagram shows how an agent discovers and loads a skill, tracing through the code entities involved.

Skill Discovery: Natural Language to Code Entity Path

Sources: `skills/writing-skills/SKILL.md635-645 `skills/writing-skills/SKILL.md93-100

Deployment Checklist Summary

The full checklist is detailed in Skill Creation Checklist. The phases are:

RED Phase

Create pressure scenarios with 3+ combined pressures (for discipline skills)
Run scenarios without the skill — document baseline behavior verbatim
Identify patterns in rationalizations

GREEN Phase

Name uses only letters, numbers, hyphens
YAML frontmatter valid, max 1024 chars total
Description starts with Use when..., triggers only, third person
Skill body addresses the specific baseline failures from RED
Scenarios pass with skill present

REFACTOR Phase

Identify new rationalizations from testing
Add explicit counters and Red Flags list
Build rationalization table
Re-test until bulletproof

Deployment

Commit and push to fork
Contribute via PR if broadly useful (see Contributing Skills)

Sources: `skills/writing-skills/SKILL.md596-633

Child Pages

Page	What It Covers
Test-Driven Development for Skills	Full RED-GREEN-REFACTOR cycle adapted for skill authoring
SKILL.md Format and Structure	Required YAML fields and Markdown section conventions
Testing Skills with Pressure Scenarios	How to construct effective pressure scenarios
Claude Search Optimization (CSO)	How the `description` field controls discoverability
Skill Creation Checklist	Deployment checklist for each phase
Contributing Skills	Fork and pull request process for the superpowers-skills repository

Creating Skills

What This Process Is

TDD Mapping

Skill Types

Directory Layout

The RED-GREEN-REFACTOR Cycle for Skills

`SKILL.md` Structure and Code Entities

Claude Search Optimization (CSO)

Testing Strategies by Skill Type

Bulletproofing Against Rationalization

The Iron Law and the DELETE Rule

Skill Discovery Flow

Deployment Checklist Summary

Child Pages

On this page

Creating Skills

What This Process Is

TDD Mapping

Skill Types

Directory Layout

The RED-GREEN-REFACTOR Cycle for Skills

`SKILL.md` Structure and Code Entities

Claude Search Optimization (CSO)

Testing Strategies by Skill Type

Bulletproofing Against Rationalization

The Iron Law and the DELETE Rule

Skill Discovery Flow

Deployment Checklist Summary

Child Pages

On this page

Creating Skills

What This Process Is

TDD Mapping

Skill Types

Directory Layout

The RED-GREEN-REFACTOR Cycle for Skills

SKILL.md Structure and Code Entities

Claude Search Optimization (CSO)

Testing Strategies by Skill Type

Bulletproofing Against Rationalization

The Iron Law and the DELETE Rule

Skill Discovery Flow

Deployment Checklist Summary

Child Pages

On this page

Creating Skills

What This Process Is

TDD Mapping

Skill Types

Directory Layout

The RED-GREEN-REFACTOR Cycle for Skills

SKILL.md Structure and Code Entities

Claude Search Optimization (CSO)

Testing Strategies by Skill Type

Bulletproofing Against Rationalization

The Iron Law and the DELETE Rule

Skill Discovery Flow

Deployment Checklist Summary

Child Pages

On this page

`SKILL.md` Structure and Code Entities

`SKILL.md` Structure and Code Entities