The old Gemini API worked like a vending machine. Prompt in, answer out, no visibility into what happened between.
This update gives you a structured timeline of every step the model took: thinking, searching, tool calls, final output. Each one labeled and separated. You can now build automations that react to individual steps instead of parsing a blob.
The real reason for the change: Google is preparing the API for mid-flight steering (redirecting the model while it's working) and async tool calls (the model keeps processing while waiting on external data). Both are coming. The schema had to change first.
If you built anything on the Gemini Interactions API, the old format breaks June 6. Check your integrations.
Google AI Studio (@GoogleAIStudio)
we’re evolving the gemini interactions api to support rich, multi-step agentic workflows
instead of strict "user" and "model" roles, every action (from thinking to tool calls) is now represented as its own step
— https://nitter.net/GoogleAIStudio/status/2052487438967140700#m