Skip to content

muddlebee/.agents

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 

Repository files navigation

Agentic Development Workflow

This is a full workflow from idea → shipped code → written-down context. Each stage uses a small, focused assistant plus reusable playbooks (agent-skills, awesome-copilot). The same setup works in Cursor, Claude Code, Codex, or anything that reads the repo’s instruction files.

Core repository structure + Agent configuration

Standard instruction files committed to the repo root. Every major agent runtime reads at least one - same conventions apply whether you're on Cursor, Claude Code, or Codex.

Path Purpose
AGENTS.md Top-level agent behavior, goals, boundaries
SPEC.md Feature spec, acceptance criteria, tasks
.github/workflows/ CI/CD pipeline definitions
.skills/ Skills installed via skills.sh
docs/ Raw sources + wiki (long-lived context)
test-results/ Test results

For language or framework-specific conventions (React, Next.js, Python, etc.), find and install the relevant community skill from skills.sh and reference it in your instruction files alongside the core agent skills.

MCP servers (tooling bridge).

Server Role
GitHub MCP Branches, PRs, issues, labels, PR comments (e.g. preview URL, E2E report)
Playwright MCP Drive the browser for real flows (pre-commit QA loop on local/preview, full E2E on staging)
Chrome DevTools MCP Console, network, runtime inspection on a live page
Custom / vendor Whatever your repo needs (e.g. Vercel, DB, Slack, company tools)

The Pipeline

Idea → Design → Spec → Code → Review → PR/Issues → CI/CD → E2E + debugging → Document/Wiki

1) Design

Agent: Designer

  • Establishes visual direction - branding, UI/UX, design system - before spec or code is written
  • Outputs (tokens, component names, screen flows) feed directly into SPEC.md

Tools used:

  • Superdesign - UI/UX + branding exploration; exports React/HTML/CSS, design tokens, and component specs

Skills used:

  • impeccable - builds on frontend-design with 7 domain references (typography, color/contrast, spatial, motion, interaction, responsive, UX writing).

Key commands:

  • /critique (UX review)
  • /audit (quality checks like performance, accessibility, etc.)
  • /normalize (design system alignment)

Anti-patterns it guards against: overused fonts (Inter/Arial) · gray text on color · pure black/gray neutrals · nested cards · bounce easing

2) Define + Plan

Agent: Planner

  • Refines raw idea → writes spec → breaks into atomic tasks with acceptance criteria

Skills used:

3) Build

Agent: Builder

  • Implements spec in small, testable slices - never one-shot

Skills used:

4) Self-Review (Pre-Commit Gate)

Agent: Reviewer

  • Runs a review pass before anything is staged - CI should never catch what this misses
  • For UI-facing work: Playwright QA loop on local or preview (Playwright MCP; details in playwright-interactive below) - inventory from spec + shipped behavior + PR claims; functional then visual pass; one persistent session; proof or documented gaps before commit

Skills used:

5) PR Creation

Agent: PR Writer (GitHub MCP + awesome-copilot)

  • Creates branch, writes PR description from spec, links issues - no manual writing

Skills used:

Issue resolution flow: agent reads open issue via GitHub MCP → generates fix branch → implements fix → opens PR with fixes #N

6) PR Review Loop

Agent: PR Reviewer (AI + CI)

  • Reviews like a staff engineer, enforces quality gates before merge

Skills used:

7) CI/CD + deploy

Pipeline: GitHub Actions + Vercel.

Skills used:

8) E2E Testing

Agent: Runner (Playwright MCP + Chrome DevTools MCP)

  • Executes real user flows on live preview · captures traces, logs, screenshots

Skills used:

Output: Structured test report posted back to PR as a review comment via GitHub MCP. Loop closes - acceptance criteria written in step 2, verified here.

9) Debugging + Failure Handling

Agent: Debugger

  • Investigates CI failures, test regressions, deploy issues autonomously

Skills used:

  • debugging-and-error-recovery - reproduce → isolate → hypothesize → fix → verify
  • Chrome DevTools MCP - live console + network inspection on the running preview

10) Document / Wiki

Agents: Feedback + Documenter — same job from two angles: turn what just happened into durable memory so the next session does not rediscover or undo it. Agents have no cross-session memory; the repo (AGENTS.md, wiki, ADRs) is the substitute.

Skills used:

Long-form capture (lives in docs/wiki/): specs + drift · ADRs · incident post-mortems · stable patterns · skills/MCP reference

Maintenance model: Inspired by Karpathy's LLM Wiki — raw sources in docs/raw/ (immutable), agent processes them into wiki pages; standing rules stay in AGENTS.md. Knowledge compounds instead of resetting.

Key Design Choices

  • Standard instruction files - AGENTS.md at the repo root so any agent runtime picks up the same conventions without re-prompting
  • Skill-driven - each step maps to a proven engineering workflow, not a one-off prompt
  • Small agents > one big agent - each phase has a clear input, output, and failure mode
  • Structured outputs everywhere - validated contracts at every boundary
  • Read before you write - every agent invocation starts with AGENTS.md + SPEC.md

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors