From bug to fix, automatically

Find bugs. Fix them. Ship clean code.

Oversight doesn't just review your PRs — it fixes them. Multi-agent code review, automated bug fixing, engagement tracking, and dispatch to Claude Code. From first observation to merged commit.

$ git clone https://github.com/CopilotKit/oversight && cd oversight

$ docker compose up

oversight.yaml

repos:
  - org: "CopilotKit"
    repo: "CopilotKit"
    review: true
    resolve: true

supabase:
  url: "https://xxx.supabase.co"
  key: "${SUPABASE_KEY}"

github:
  token: "${GITHUB_TOKEN}"

agents:
  review: 6
  model: "claude-opus-4-6"

terminal

End-to-End

Triage. Review. Fix. Merge.

Analyze

AI triages open PRs by risk, size, and urgency.

Review

6 specialized agents examine every angle of the diff.

Fix

Implements fixes, runs self-review until clean.

Push

Reorganizes commits by area of concern, pushes to branch.

See the resolve workflow →

terminal — oversight resolve

$ oversight resolve CopilotKit/CopilotKit#4301

⚡ Analyzing PR #4301...
  23 files changed, 847 additions, 312 deletions
  Risk: HIGH — streaming regression detected

⚠ Dispatching 6-agent review...
✓ code-reviewer        3 findings
✓ silent-failure-hunter 1 finding
✓ pr-test-analyzer     0 findings
✓ type-design-analyzer 0 findings
✓ comment-analyzer     0 findings
✓ code-simplifier      0 findings

● Findings: 3 critical, 1 high

⚡ Fixing issues...
  ✓ Race condition in flush()
  ✓ Missing backpressure check
  ✓ Stale buffer on reconnect

⚡ Self-review... 0 findings — clean
  Reorganizing commits... 3 by area of concern
  Pushing to branch... done

✓ PR #4301 ready to merge

Six agents. Every angle.
Zero blind spots.

🔍 code-reviewer

😈 silent-failure-hunter

✅ pr-test-analyzer

🔬 type-design-analyzer

💬 comment-analyzer

✂ code-simplifier

Standard 6-agent review for diffs under 2,000 lines. MSAL (module-scoped agent loops) for large, multi-module PRs.

agent dispatch

⚡ CR Round 1 — dispatching 6 agents
  diff: 1,247 lines across 8 files
  mode: standard (under 2,000 line threshold)

✓ Agent 1/6: code-reviewer
  Scope: full diff
  2 findings (1 critical, 1 high)

✓ Agent 2/6: silent-failure-hunter
  Scope: error paths, catch blocks
  1 finding (1 critical)

✓ Agent 3/6: pr-test-analyzer
  Scope: test coverage, assertions
  0 findings

✓ Agents 4-6: all clean

MSAL mode — large PRs

⚡ CR Round 1 — MSAL activated
  diff: 4,891 lines across 23 files
  mode: module-scoped (over 2,000 line threshold)

⚠ Partitioning into 4 modules:
  Module A: runtime/     (1,247 lines)
  Module B: sdk/         (1,103 lines)
  Module C: react-core/  (1,402 lines)
  Module D: shared/      (1,139 lines)

⚡ Dispatching 6 agents per module...
  24 total agent invocations
  Each module gets a fresh, unbiased review

✓ All modules reviewed — 5 findings total

Everything you need

🔧

Automated Bug Fixing

Finds issues AND implements the fix. Review, fix, self-review, push — fully automated from observation to merged commit.

📈

PR Triage

AI-ranked by risk, size, and urgency. Surfaces the most critical PRs first so nothing slips through the cracks.

🤖

Multi-Agent CR

6 specialized review agents examine every angle. MSAL for large diffs. Zero blind spots.

🔄

Resolve Workflow

Review, fix, self-review, push. The full loop from triage to clean commits, hands-free.

📝

Implementation Plans

Translates observations into granular execution steps. From "what's wrong" to "here's exactly how to fix it."

⚡

Dispatch to Claude Code

Automated plan execution via Claude Code. Oversight creates the plan, Claude Code implements it.

📈

Engagement Health

Active, stale, and blocked PR tracking. Know which PRs need attention before they rot.

🔗

Merge Conflict Resolution

Automated detection and fixing of merge conflicts. Keeps branches clean without manual intervention.

💬

CopilotKit Chat

Natural language analysis interaction. Ask questions about your PRs, get context-aware answers.

How oversight compares

Capability	Oversight	Manual Review	CodeRabbit	GitHub Copilot
Automated bug fixing	Built-in ✓	✗	✗	✗
Multi-agent review (6 agents)	Built-in ✓	1 reviewer	1 AI agent	1 AI agent
Engagement tracking	Built-in ✓	manual	✗	✗
Implementation plans	Built-in ✓	✗	✗	✗
Dispatch execution	Claude Code ✓	✗	✗	✗
Fork PR handling	Built-in ✓	✓	✓	✓
Self-hosted	Docker ✓	✓	SaaS only	SaaS only
Multi-repo	Built-in ✓	manual	✓	✓
Notion sync	Built-in ✓	✗	✗	✗
Customizable prompts	YAML config ✓	N/A	Limited	✗
CopilotKit integration	Native ✓	✗	✗	✗