TL;DR Gemini 3.1 Pro achieves 77.1% on ARC-AGI-2 logic testing. Model keeps a 1M token context and expands output to 65k tokens. New custom tools endpoint improvesTL;DR Gemini 3.1 Pro achieves 77.1% on ARC-AGI-2 logic testing. Model keeps a 1M token context and expands output to 65k tokens. New custom tools endpoint improves

Google Rolls Out Gemini 3.1 Pro Upgrade With Strong Reasoning Gains

2026/02/20 06:38
Okuma süresi: 4 dk

TL;DR

  • Gemini 3.1 Pro achieves 77.1% on ARC-AGI-2 logic testing.
  • Model keeps a 1M token context and expands output to 65k tokens.
  • New custom tools endpoint improves file actions and coding agents.
  • Preview rolls out across Gemini app, Vertex AI, and developer tools.

Google has released Gemini 3.1 Pro, an updated model designed to improve complex reasoning, planning, and tool use across consumer and enterprise services. The company said the model more than doubles the ARC-AGI-2 score achieved by Gemini 3 Pro, delivering stronger performance in areas that require problem solving rather than simple text generation. The update is now rolling out in the Gemini app, Vertex AI, NotebookLM, and through developer tools.

Gemini 3.1 Pro reached a verified 77.1% on the ARC-AGI-2 benchmark. The benchmark measures a model’s ability to reason through new logic patterns not contained in training data. Google said the improvement supports agent-driven workloads, which depend on stable long-form reasoning across many steps in a task.

The release follows last week’s Gemini 3 Deep Think update, which targeted scientific and engineering use cases. Google said the new model builds on that work while offering wider access for developers and enterprise users.

Gemini 3.1 Pro Expanded Context Window and Output Capacity

Gemini 3.1 Pro supports a one million token input context window. This allows users to load full code repositories, research datasets, or long documents into a single request. Google said the model can maintain stable reasoning across files and data segments when the content spans hundreds of thousands of tokens.

The model also introduces a 65,000 token output window. This supports long-form generation, including technical manuals, structured reports, or multi-file code output. Google said this wider output window reduces task fragmentation, as large outputs can complete in a single response.

The company said these upgrades support developers who build autonomous agents. These agents often need to read large collections of files, move through complex directories, or generate long technical results.

Improved Benchmarks Across Logic, Coding, and Science

Google reported gains across several internal and external benchmarks. The model scored 94.1 percent on GPQA Diamond, which tests scientific reasoning. It reached 92.6 percent on MMMLU for multimodal understanding. The model also posted strong results on coding tests, including SWE-Bench Verified and LiveCodeBench Pro.

The company said the gains come from refinements in how the model allocates reasoning tokens. The structure is designed to reduce errors during long-horizon tasks and produce more stable outputs across dependent steps.

Google said the model can handle scientific workflows that need grounded reasoning or calculations. It can also support engineering teams that require robust code generation and complex debugging.

New Tools and Updated Agent Workflows

With this release, Google introduced a specialized endpoint called gemini-3.1-pro-preview-customtools. The endpoint is optimized for developers who use file system navigation, code search, and structured tool calls. The model is tuned to prioritize local tools, reducing the chance of unnecessary external searches.

The update also integrates with Google Antigravity, the company’s agent development platform. Developers can set a “medium” thinking level for tasks that need balanced depth and latency. Google said this option helps teams manage reasoning budgets while maintaining accuracy.

The Interactions API also includes a breaking change. The field total_reasoning_tokens is now named total_thought_tokens. Google said the change supports thought signatures, which preserve reasoning context for multi-turn workflows.

Pricing, Access, and Deployment Across Google Products

Pricing for Gemini 3.1 Pro Preview remains the same as the earlier model. Input tokens cost $2 per million for prompts under 200,000 tokens and $4 per million for larger prompts. Output tokens cost $12 per million for shorter prompts and $18 per million for longer prompts. Context caching remains available for workloads that require repeated calls.

The model is accessible through the Gemini API, Google AI Studio, Android Studio, and the Gemini CLI. Enterprise users can access the model through Vertex AI and Gemini Enterprise. Consumers can use the model in the Gemini app and NotebookLM with higher limits for paid subscribers.

Google said the preview period will allow the company to refine model behavior and safety before general availability. The company added that Gemini 3.1 Pro is positioned as a foundation for agentic AI systems that must reason through long tasks and work across complex environments.

The post Google Rolls Out Gemini 3.1 Pro Upgrade With Strong Reasoning Gains appeared first on CoinCentral.

Piyasa Fırsatı
Ucan fix life in1day Logosu
Ucan fix life in1day Fiyatı(1)
$0.0005642
$0.0005642$0.0005642
+1.58%
USD
Ucan fix life in1day (1) Canlı Fiyat Grafiği
Sorumluluk Reddi: Bu sitede yeniden yayınlanan makaleler, halka açık platformlardan alınmıştır ve yalnızca bilgilendirme amaçlıdır. MEXC'nin görüşlerini yansıtmayabilir. Tüm hakları telif sahiplerine aittir. Herhangi bir içeriğin üçüncü taraf haklarını ihlal ettiğini düşünüyorsanız, kaldırılması için lütfen [email protected] ile iletişime geçin. MEXC, içeriğin doğruluğu, eksiksizliği veya güncelliği konusunda hiçbir garanti vermez ve sağlanan bilgilere dayalı olarak alınan herhangi bir eylemden sorumlu değildir. İçerik, finansal, yasal veya diğer profesyonel tavsiye niteliğinde değildir ve MEXC tarafından bir tavsiye veya onay olarak değerlendirilmemelidir.

Ayrıca Şunları da Beğenebilirsiniz

Trading time: Tonight, the US GDP and the upcoming non-farm data will become the market focus. Institutions are bullish on BTC to $120,000 in the second quarter.

Trading time: Tonight, the US GDP and the upcoming non-farm data will become the market focus. Institutions are bullish on BTC to $120,000 in the second quarter.

Daily market key data review and trend analysis, produced by PANews.
Paylaş
PANews2025/04/30 13:50
Why LYNO’s Presale Could Trigger the Next Wave of Crypto FOMO After SOL and PEPE

Why LYNO’s Presale Could Trigger the Next Wave of Crypto FOMO After SOL and PEPE

The post Why LYNO’s Presale Could Trigger the Next Wave of Crypto FOMO After SOL and PEPE appeared on BitcoinEthereumNews.com. Cryptocirca has never been bereft of hype cycles and fear of missing out (FOMO). The case of Solana (SOL) and Pepe (PEPE) is one of the brightest examples that early investments into the correct projects may yield the returns that are drifting. Today there is an emerging rival in the limelight—LYNO. LYNO is in its presale stage, and already it is being compared to former breakout tokens, as many investors are speculating that LYNO will be the next big thing to ignite the market in a similar manner. Early Bird Presale: Lowest Price LYNO is in the Early Bird presale and costs only $0.050 for each token; the initial round will rise to $0.055. To date, approximately 629,165.744 tokens have been sold, with approximately $31,458.287 of that amount going towards the $100,000 project goal.  The crypto presales allow investors the privilege to acquire tokens at reduced prices before they become available to the general market, and they tend to bring substantial returns in the case of great fundamentals. The final goal of the project: 0.100 per token. This gradual development underscores increasing investor confidence and it brings a sense of urgency to those who wish to be first movers. LYNO’s Edge in a Competitive Market LYNO isn’t just another presale token—it’s a powerful AI-driven cross-chain arbitrage platform designed to deliver real utility and long-term growth. Operating across 15+ blockchains, LYNO’s AI engine analyzes token prices, liquidity, volume, and gas fees in real-time to identify the most profitable trade routes. It integrates with bridges like LayerZero, Wormhole, and Axelar, allowing assets to move instantly across networks, so no opportunity is missed.  The platform also includes community governance, letting $LYNO holders vote on protocol upgrades and fee structures, staking rewards for long-term investors, buyback-and-burn mechanisms to support token value, and audited smart…
Paylaş
BitcoinEthereumNews2025/09/18 16:11
Nvidia’s Strategic Masterstroke: Deepening Early-Stage Ties with India’s Booming AI Startup Ecosystem

Nvidia’s Strategic Masterstroke: Deepening Early-Stage Ties with India’s Booming AI Startup Ecosystem

BitcoinWorld Nvidia’s Strategic Masterstroke: Deepening Early-Stage Ties with India’s Booming AI Startup Ecosystem NEW DELHI, INDIA – October 2025: Nvidia Corporation
Paylaş
bitcoinworld2026/02/20 09:30