Claude Code Community Meetup

AI Agents
& The Path to AGI

From Evolutionary Algorithms to Multi-Agent Swarms

Rach Pradhan
March 2026

About Me

BUILDER

Rach Pradhan

Researcher & serial builder. Creator of MerJS (Zig web framework), TurboAPI (7x faster FastAPI), and NanoBrew (fastest macOS package manager, 3ms installs).

2x exited founder.

RESEARCH

Credited In

ADAS (ICLR 2025) — Automated Design of Agentic Systems. Meta-agents that design better agents.

Omni-EPIC (ICLR 2025) — Open-ended AI generating increasingly complex tasks & environments in code.

Deeply embedded in the evolutionary algorithms & agentic AI space

What is an AI Agent?

$7.6B
AI agent market 2025
$47B
Projected by 2030
43%
CAGR growth rate

The Evolution

Chatbots
Pattern matching
LLMs
Text generation
Tool-Using LLMs
Function calling
Agents
Autonomous reasoning

The key shift: from responding to doing

An Evolutionary Lens

BIOLOGICAL

Natural Selection

Populations adapt through mutation, crossover, and selection pressure over generations

ARTIFICIAL (ADAS)

Agent Evolution

Meta-agents that iteratively design better agents in code — outperforming hand-designed systems across coding, science, and math

Same principle: selection pressure drives optimization

Part II

Claude Opus 4.6
Almost AGI?

1M context • Extended thinking • Tool use • Multi-turn reasoning

Opus 4.6 By The Numbers

82.1%
SWE-bench Verified
97.8%
HumanEval (coding)
91.3%
GPQA Diamond
1M
Context tokens
65%
Fewer tokens used
+144
Elo vs GPT-5.2
72.7%
OSWorld (computer use)

Sources: Anthropic, SWE-bench leaderboard, GPQA benchmark suite

What Opus Excels At

AGENTIC CODING

Long-Horizon Tasks

Handles complex multi-file refactors autonomously. Reaches peak performance in 4 iterations where others can't match after 10.

TOOL MASTERY

Hundreds of Tools

Tool Search jumps accuracy from 79.5% to 88.1%. Works across files, terminals, browsers, APIs seamlessly.

DEEP CONTEXT

1M Tokens

MRCR v2: 76% vs Sonnet's 18.5%. Retrieval and reasoning across massive codebases without losing the thread.

Middle Management is Cooked

Coordinate. Delegate. Summarize. Route information. Sound like a job description — or a prompt?

Task delegation
92%
Status reporting
95%
Code review triage
88%
Sprint planning
75%
People leadership
15%

McKinsey: 57% of U.S. work hours are automatable with current AI

The Real Job Impact

Cooked

  • × Ticket shufflers & status reporters
  • × Copy-paste code monkeys
  • × "Let me sync with the team" middlemen
  • × Manual QA on happy paths
  • × Meeting schedulers as a career
vs

Thriving

  • + System architects & designers
  • + Agent orchestrators (the new 10x)
  • + People leaders with real empathy
  • + Product thinkers & domain experts
  • + People who build the swarms
92M
Jobs displaced by 2030 (WEF)
170M
New jobs created (WEF)
300M
Jobs exposed (Goldman Sachs)
Part III

One Agent is Good.
A Swarm is Better.

What if you could run 5 Opus-level agents in parallel?

What is a Multi-Agentic Swarm?

THE PROBLEM

One Agent, One Thread

A single agent works sequentially. Read file A, then B, then C. For a 50-file refactor, you're waiting forever.

THE SOLUTION

Many Agents, In Parallel

A swarm decomposes the task, assigns pieces to specialized agents running simultaneously, then synthesizes one coherent result.

One person painting a house vs a coordinated crew finishing it in a day

What Does This Actually Mean?

How Orchestration Works

Opus as the brain. Sonnet as the hands. Haiku as the eyes.

STEP 1: DECOMPOSE

Orchestrator (Opus)

Reads your prompt. Understands the full codebase via CodeGraph. Breaks the task into independent sub-tasks with dependencies.

STEP 2: EXECUTE

Workers (Sonnet)

Up to 100 parallel agents, each with a specialized role (finder, fixer, reviewer). Zig threads — real parallelism, not async.

STEP 3: SYNTHESIZE

Synthesizer (Sonnet)

Merges all worker outputs into one coherent result. Resolves conflicts. Generates a unified report or PR.

Architecture matters more than agent count — centralized orchestration contains error amplification to 4.4x vs 17.2x for naive parallelism

Introducing DevSwarm

The Swarm Pattern

You
"Find all memory leaks"
Orchestrator
Opus decomposes
Worker 1
Worker 2
Worker 3
Worker N
Synthesizer
Combines results

8 Specialized Roles

Finder Sonnet

Search & locate

Reviewer Sonnet

Read-only review

Fixer Sonnet

Writable edits

Explorer Sonnet

Deep exploration

Architect Opus

System design

Orchestrator Opus

Decompose tasks

Synthesizer Sonnet

Combine outputs

Monitor Haiku

Lightweight checks

37 MCP Tools

One binary. Zero npm. Plugs into Claude Code, Codex, or Amp.

run_swarm
run_agent
run_task
review_fix_loop
decompose_feature
get_next_task
prioritize_issues
create_branch
create_pr
merge_pr
commit_with_context
blast_radius
find_callers
find_dependents
relevant_context
symbol_at
Agents Planning Git/PR Code Intel

The Review-Fix Loop

Evolutionary convergence, applied to code

Reviewer
Read-only
Fixer
Patch
Re-Review
Verify
Converge
0 issues

Repeats up to 5 iterations until the reviewer finds zero remaining issues

Like biological selection: each cycle improves fitness until convergence

Find Me on GitHub

R

@justrach

Creator of DevSwarm, MerJS, TurboAPI, NanoBrew & Codegraff

DevSwarm
MerJS
TurboAPI
NanoBrew
Codegraff

github.com/justrach

Sidequest

Codegraff.com

SIMD-accelerated CLI tools that replace Claude Code's built-in file, search, and edit tools. Written in Zig. Zero dependencies. $99/year — includes DevSwarm + CodeDB.

zigrep
SIMD search
zigread
Structural reader
zigpatch
Atomic patcher
zigcreate
File creator
zigdiff
Git diff + stats
zigmemo
Persistent memory
zigpar
Parallel ops
zigharness
Multi-agent
muonry
MCP daemon
codedb
Code intel DB
$ muonry — persistent MCP daemon
create  python/test.py  ‹ 339.0µs
  bytes:3024, lines:71
create  python/pyproject.toml  ‹ 373.0µs
  bytes:371, lines:14
patch  src/handler.zig  ‹ 0.7ms
  L42-58 replaced, 16 lines
outline  src/main.zig  ‹ 245.0µs
  12 symbols, 191 tokens (vs 6,826 raw)
0.7ms
Avg op latency (muonry)
260KB
Single binary each
0
Dependencies

codegraff.com    If it ever gets open sourced — everyone gets refunded!

90% Fewer Tokens

muonry's structural reads slash token cost vs raw file dumps

Operation Raw dump muonry Savings
ls dir84131
outline file6,82619197%
symbol read6,8261,38379%
search88880%
edit response6,82616997%
TOTAL20,6501,96290%

Typical code-editing workflow: ls → outline → symbol read → search → edit

Raw dump = cat entire file into context. muonry = structural reads (outline, symbol, patch-and-return).

DEMO TIME

Let's See It Live

DevSwarm in action — spawning a multi-agent swarm from Claude Code

run_swarm
Orchestrate
Parallel workers
Results

These Slides? MerJS.

This entire deck is a single .zig file, served by merjs at 115K req/s

MERJS

Zero Node.js

Next.js-style framework in Zig. File-based routing, SSR, comptime HTML builder. 260KB binary. Zero node_modules.

PERFORMANCE

115K req/s

0.39ms avg latency. <5ms cold start. Vs Next.js: ~2K req/s, ~77ms latency, ~300MB node_modules.

github.com/justrach/merjs

Thank You.

Questions? Let's talk agents, swarms, and Zig.

DevSwarm QR
DevSwarm
MerJS QR
MerJS
Codegraff QR
Codegraff
GitHub QR
@justrach
Rach Pradhan
@rachpradhan
Claude Code Community
Slides built with MerJS
arrows / space