Skill replay
What you'll find here
The three records that compose skill replay — fixture, observation, result — and the boundary that keeps replays reproducible without raw payloads.
Failures block promotion.
Failed replay results block skill promotion until reviewed.
Records
SkillReplayFixtureSkillReplayObservationSkillReplayResultreplay_skill_fixture — pass/fail status · reason · expected and observed outcome · missing output refs · unexpected output refs · telemetry id · receipt ids · timestamp.Fixture expectations
Redacted and evidence-backed.
Replay fixtures reference case files, worker results, telemetry, receipts, and evidence by id — never raw prompts, outputs, traces, stdout, stderr, payloads, responses, or credentials.