Gemini 3.1 Pro debuts with 1M context window and agentic reliability

0
18

Gemini 3.1 Pro debuts with 1M context window and agentic reliability

Google released Gemini 3.1 Pro on Thursday. The model is available as a preview with a general release planned for the future. This iteration represents a significant advancement over the previous Gemini 3 release from November. The company shared benchmark data highlighting the model’s improved capabilities in professional task performance.

Independent benchmarks such as “Humanity’s Last Exam” demonstrate that Gemini 3.1 Pro performs significantly better than its predecessor. The previous version, Gemini 3, was already regarded as a highly capable AI tool upon its November release. These new statistics indicate a measurable jump in performance metrics. Google positioned the new model as one of the most powerful LLMs currently available.

Brendan Foody, CEO of AI startup Mercor, commented on the model’s performance using his company’s evaluation system. “Gemini 3.1 Pro is now at the top of the APEX-Agents leaderboard,” Foody stated in a social media post. The APEX system is designed to measure how well new AI models perform real professional tasks. Foody added that the model’s results demonstrate “how quickly agents are improving at real knowledge work.”

The release occurs amid intensifying competition within the AI industry. OpenAI and Anthropic have recently released new models focused on agentic work and multi-step reasoning. These companies continue to develop increasingly powerful LLMs designed for complex workflows. The market for advanced AI models remains active as major technology firms push for improvements in reasoning capabilities.

Video: Google

Featured image credit