Model Comparison

Author
google
Context Length1M

Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like Gemini Pro 1.5. It introduces notable enhancements in multimodal understanding, coding capabilities, complex instruction following, and function calling. These advancements come together to deliver more seamless and robust agentic experiences.

Provider

Pricing

Input$0.10 / M tokens
Output$0.40 / M tokens
Images$0.026 / K

Endpoint Features

Quantization– –
Max Tokens (input + output)1M
Max Output Tokens8K
Stream cancellation– –
Supports Tools– –
No Prompt Training
Reasoning– –