Key Takeaways
- A hidden model selector in Google App v17.18.22 reveals seven previously unreported AI model options for Gemini Live voice conversations.
- Three of the codenames — “Capybara,” “Nitrogen” and a “personalization” variant — do not appear in any prior Google documentation. Two new “RC2” models appeared overnight.
- The feature is currently hidden behind a server-side flag and appears to be an internal testing tool, but the underlying infrastructure could support a consumer-facing model picker in the future.
Google has been testing multiple unknown AI models within Gemini Live, hinting at a possible upgrade to the voice-controlled chatbot in time for Google I/O 2026 later this month.
While investigating an unrelated unreleased feature in the Google app code, a previously unseen settings cog button caught my eye. Enabling the button in the code revealed a new model selector menu that allows the user to change the AI model Gemini Live uses to power its voice chats.
The menu, currently hidden behind a server-side flag, reveals seven AI model options I haven’t seen before, including codenames “Capybara,” “Nitrogen” and a specialized personalization model. The displayed menu options are as follows:
- Default
- A2A_Rev25_RC2
- A2A_Rev25_RC2_Thinking
- A2A_Rev23_P13n
- A2A_Nitrogen_Rev23
- A2A_Capybara
- A2A_Capybara_Exp
- A2A_Native_Input
Of these, “A2A_Rev25_RC2” and “A2A_Rev25_RC2_Thinking” appeared overnight on May 8, showing that Google now has two new audio-to-audio models at the Release Candidate 2 stage, nearing production readiness. The presence of a Thinking model is particularly interesting, as it suggests a variant with enhanced reasoning capabilities may soon be available.
What The Code Reveals
Currently, Gemini Live uses only one model — Gemini 3.1 Flash Live, a native input model designed to process raw audio and video streams directly. The existence of multiple new models strongly suggests that Google is trying out some alternatives internally before a public release.
No public information exists for these models, but there are a few small clues in the naming. Here, “A2A” most likely stands for Audio-to-Audio, Google’s term for models that process speech and audio directly, rather than converting them to text first.
The P13n Variant
In the screenshot above we see a model labeled “P13n,” a shortened form of the word “personalization.” This hints at a specialized Gemini variant with additional personalization and behavioral features baked directly into the model.
Why This Matters
While the regular Gemini interface currently offers users a choice between Fast, Thinking and Pro models, Gemini Live currently offers no such option.
Switchable models would allow the company to provide a more powerful voice assistant to customers willing to pay for it, or perhaps allow users to trade Gemini Live’s snappy responses for more thoughtful ones that take a little longer.
What We Don’t Know
Nitrogen And Capybara Variants
Neither Capybara nor Nitrogen appear in any prior Google documentation. However, terms like “Rev25” and “Exp” suggest that the company has already been through several revisions of these models and likely has both stable and experimental versions of the Capybara model under test.
The list of available models is delivered by Google’s servers, meaning the company can add or remove models without an app update.
We don’t know at this point whether Google is about to bring model selection to Gemini Live users, or whether this is purely an internal testing tool. The model selector interface, as it stands, remains unpolished and isn’t ready for release.
My testing confirms that the selected model name is transmitted to Google’s servers when a voice session begins, but it remains unclear whether functionally different models are actually served in response.
I’ll be watching Google I/O 2026 closely to see whether Capybara gets a public name.


