Gemini 3.1 Flash-Lite
6.8/10
Provisional 0 practitioner reviewsContext window
1,000,000 tokens
Input tokens per dollar
1,780,000
Output tokens per dollar
445,000
Dimension Breakdown
Reliability of function/tool calling — correct schema adherence and parameter extraction
Price per token relative to output quality for agent tasks
Response time — time to first token and total generation time under load
Uptime, rate limit headroom, and error rates in production
Long-context coherence and instruction following over turns
Share Your Experience
Have production experience with Gemini 3.1 Flash-Lite? Add the first real review to improve confidence in this score.
Submit a ReviewTop Use Cases
Summary
Google's ultra-budget model with 34/60 on Artificial Analysis Intelligence Index. Fastest model available (329 tokens/s) at just $0.56/1M tokens.
Sources
Practitioner Reviews
No reviews yet
Be the first to share your experience with Gemini 3.1 Flash-Lite.
Related Models
- GPT-5.5 Instant (OpenAI, 10/10)
- Claude Opus 4.7 (Anthropic, 9.5/10)
- Gemini 3.1 Pro (Google, 9.5/10)
Last updated: May 8, 2026