Model Comparison
Authorgoogle
Context Length1M
Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like Gemini Pro 1.5. It introduces notable enhancements in multimodal understanding, coding capabilities, complex instruction following, and function calling. These advancements come together to deliver more seamless and robust agentic experiences.
Pricing
Input$0.10 / M tokens
Output$0.40 / M tokens
Images$0.026 / K
Endpoint Features
Quantization– –
Max Tokens (input + output)1M
Max Output Tokens8K
Stream cancellation– –
Supports Tools– –
No Prompt Training
Reasoning– –