Half the multi-agent systems trending right now are 1 agent with 3 expensive context switches.
Cyril's piece is the cleanest map of the 4-agent shape circulating. Worth saving. Here's what breaks when you actually run it.
6 production failure modes to plan for before you split into 4 agents:
The handoff tax compounds.
Each agent re-reads the CLAUDE.md, the brief, the prior output. By the time the Distribution Agent fires, you've paid for the same context 4 times. Token cost goes 4x. So does context drift.The debug surface multiplies, not divides.
One bad output in a single-agent system is one prompt to fix. Same bug across 4 agents is tracing which handoff dropped the signal. Operations logs make this bearable. 90% of teams ship without them.Evals don't add up. They multiply.
4 agents means evals per agent, plus integration evals, plus regression evals on the handoff format. 50 examples per agent is 200 minimum to start. Most teams ship with zero and call it shipping.Latency is the silent killer.
4 sequential model calls is 4x p99 latency unless you've genuinely parallelized. The "agents work in parallel where the workflow allows" line hides a hard infra problem most solo builders won't solve in a weekend.Most "agents" are skills wearing a costume.
A research step that runs once with a prompt template isn't an agent. It's a tool call with a name. The naming doesn't change the cost or the behavior. It just makes the org chart look impressive.The orchestrator becomes the new bottleneck.
Every routing decision goes through it. Every failure recovery goes through it. The thing you built to coordinate is the thing you can't debug when something goes sideways at 2am.
The catch:
The 4-agent shape isn't wrong. It's the wrong starting point. Build one strong agent with 3 sharp skills. Watch where it actually breaks on real traffic. Split into 2 only when you have a logged failure pattern that requires isolation. Architecture earned from failures beats architecture copied from a post every time.
If your 4-agent system works the same when collapsed to one agent plus three skills, that's not architecture.
That's vocabulary.
CyrilXBT (@cyrilXBT)
x.com/i/article/205246749202…
— https://nitter.net/cyrilXBT/status/2054037093785928157#m