Benchmarks measure what an agent knows in controlled settings.

Bridge is betting that real-world tasks are where the actual gap between agents gets decided.

Bridge (@bridge_surf)

Today, Bridge officially begins testing.

For a long time, AI has mostly been a place to chat.
We think the next step is letting agents safely use your computer to finish real work.

Bridge is our first step toward that.

Join the test: bit.ly/4dkJeGn

AgenticAI #bridge

Video

— https://nitter.net/bridge_surf/status/2054600056263623046#m