The models are both built on Google Gemini, a multimodal foundation model that can process text, voice, and image data to ...