Benchmarks measure what an agent knows in controlled settings.
Bridge is betting that real-world tasks are where the actual gap between agents gets decided.
Bridge (@bridge_surf)
Today, Bridge officially begins testing.
For a long time, AI has mostly been a place to chat.
We think the next step is letting agents safely use your computer to finish real work.
Bridge is our first step toward that.
Join the test: bit.ly/4dkJeGn
AgenticAI #bridge
Video
— https://nitter.net/bridge_surf/status/2054600056263623046#m