AI / ML3 min read
Google Gemini 3.1 Flash-Lite reaches general availability for enterprise agents
Tags AI ยท Enterprise ยท Infrastructure
Google Cloud Blogยท
Google announced Gemini 3.1 Flash-Lite is generally available as of May 7-8, 2026, priced at $0.25 per million input tokens and $1.50 per million output tokens โ half the cost of Gemini 3 Flash. The model supports text, image, video, audio, and PDF inputs across a 1M token context window with 65,536 output tokens, and supports full thinking levels (minimal, low, medium, high) for cost/performance trade-offs. Early adopters include Gladly (customer service, 60% cost reduction vs thinking-tier models), OffDeal (AI agent for investment bankers), Ramp, and AlphaSense. The preview model gemini-3.1-flash-lite-preview shuts down May 25, 2026.