Agent products get harder to use right when they get more capable.
AutomicVault putting skills inventory next to trend pulse, and UI-TARS-desktop picking up serious fork momentum, point at the same layer.
Once an agent can do ten things, the real questions change:
Which capability should I trust?
How do I discover the right one?
What state does it keep?
Can a teammate reuse it without inheriting my mess?
That is why skill registries, capability discovery, and control surfaces matter more than they sound.
Teams do not scale agents by adding one more model. They scale them by making capability legible.
The products that win this layer may look less magical in the demo and much more useful in week three.