Model · Google · Text
Gemini 3.1 Flash-Lite
Google's low-latency Gemini 3-series workhorse for straightforward multimodal tasks at scale. It is designed for high-frequency agent routing, extraction, translation, and summarization work.
- Modality
- Text
- License
- Proprietary (Proprietary)
- Context window
- 1,048,576 tokens
- Released
- May 7, 2026
- Last verified
- May 10, 2026
- Runs locally
- No
- Also handles
- vision, speech-to-text
Strengths
- 1M-token context window at the cheapest Gemini 3 tier
- Supports text, image, video, audio, and PDF inputs
- Good fit for routing, extraction, summarization, and high-volume translation
Weaknesses
- Less depth than Gemini 3.1 Pro on open-ended work
- Closed weights
Try it
| Where | Type | Notes |
|---|---|---|
| Google AI Studio | hosted-api | Free tier / paid scale-up |
| Vertex AI | hosted-api | GCP enterprise |
Used in solutions
Version history
- Gemini 3.1 Flash-Lite May 2026 Current
- Gemini 3.1 Flash Live Preview May 2026
- Gemini 3.1 Pro Preview May 2026
- Gemini 3 Flash Preview May 2026
Official sources
- Model docs docs
Change log
- — Added after verification of the Gemini 3 family rollout in current Google model docs.
Esc