Gemini 3.1 Flash-Lite

Google

6.8/10

Provisional 0 practitioner reviews

Data source: This is a provisional model-fit score based on public benchmark signals, provider pricing/context data, and manual curation. It is not yet backed by approved practitioner reviews.

Context window

1,000,000 tokens

Input tokens per dollar

1,780,000

Output tokens per dollar

445,000

Dimension Breakdown

Tool Calling 6/10

Reliability of function/tool calling — correct schema adherence and parameter extraction

Cost Efficiency 10/10

Price per token relative to output quality for agent tasks

Latency 10/10

Response time — time to first token and total generation time under load

API Reliability 7/10

Uptime, rate limit headroom, and error rates in production

Context Quality 6/10

Long-context coherence and instruction following over turns

Share Your Experience

Have production experience with Gemini 3.1 Flash-Lite? Add the first real review to improve confidence in this score.

Submit a Review

Top Use Cases

General RAG

Summary

Google's ultra-budget model with 34/60 on Artificial Analysis Intelligence Index. Fastest model available (329 tokens/s) at just $0.56/1M tokens.

Sources

Practitioner Reviews

No reviews yet

Be the first to share your experience with Gemini 3.1 Flash-Lite.

Related Models

Last updated: May 8, 2026