Gemini 3.1 Flash-Lite
Google - gemini-3.1-flash-lite
8.6/10
Provisional 0 practitioner reviews Source verifiedContext window
1,048,576 tokens
Input price
$0.25/1M
Output price
$1.5/1M
Checked
May 16, 2026
Pricing caveat: Standard paid-tier text/image/video pricing; batch and flex are lower.
Decision Notes
Best for
- - High-volume classification, translation, extraction, and lightweight support agents
- - RAG workflows where long context matters more than frontier reasoning depth
- - Cost-sensitive multimodal apps that still need tool calling and structured output
Avoid when
- - Deep reasoning or high-risk decisions that need a frontier model
- - Complex coding agents with many brittle tool decisions
Dimension Breakdown
Reliability of function/tool calling — correct schema adherence and parameter extraction
Price per token relative to output quality for agent tasks
Response time — time to first token and total generation time under load
Uptime, rate limit headroom, and error rates in production
Long-context coherence and instruction following over turns
Share Your Experience
Have production experience with Gemini 3.1 Flash-Lite? Add the first real review to improve confidence in this score.
Submit a ReviewTop Use Cases
Summary
A cost-efficient Gemini 3.1 option for high-volume, low-latency agent workloads. It is a practical baseline before paying for a frontier model.
Sources
Practitioner Reviews
No reviews yet
Be the first to share your experience with Gemini 3.1 Flash-Lite.
Related Models
- GPT-5.4 mini (OpenAI, 8.6/10)
- Claude Opus 4.7 (Anthropic, 8.4/10)
- DeepSeek V4 Pro (DeepSeek, 8.4/10)
Last updated: May 16, 2026