Gemini 2.0 Flash is now available as an experimental preview release through the Gemini Developer API and Google AI Studio. The model introduces new features and enhanced core capabilities:
- Multimodal Live API: This new API helps you create real-time vision and audio streaming applications with tool use.
- Speed and performance: Gemini 2.0 has a significantly improved time to first token (TTFT) over 1.5 Flash.
- Quality: Better performance across most benchmarks than Gemini 1.5 Pro.
- Improved agentic capabilities: Gemini 2.0 delivers improvements to multimodal understanding, coding, complex instruction following, and function calling.
- New modalities: Gemini 2.0 introduces native image generation and controllable text-to-speech capabilities.