Claude's new dynamic filtering feature cuts input tokens by 24% while improving search accuracy. Opus 4.6 hits 61.6% on BrowseComp benchmark. (Read More)Claude's new dynamic filtering feature cuts input tokens by 24% while improving search accuracy. Opus 4.6 hits 61.6% on BrowseComp benchmark. (Read More)

Anthropic Upgrades Claude AI Web Search Tools With 11% Accuracy Boost

2026/02/18 02:34
3 min read

Anthropic Upgrades Claude AI Web Search Tools With 11% Accuracy Boost

Caroline Bishop Feb 17, 2026 18:34

Claude's new dynamic filtering feature cuts input tokens by 24% while improving search accuracy. Opus 4.6 hits 61.6% on BrowseComp benchmark.

Anthropic Upgrades Claude AI Web Search Tools With 11% Accuracy Boost

Anthropic has rolled out a significant upgrade to Claude's web search capabilities, with the AI assistant now writing and executing code on the fly to filter search results before processing them. The improvement delivers an average 11% accuracy gain while consuming 24% fewer input tokens, according to the company's internal benchmarks.

The update, released alongside Claude Opus 4.6 and Sonnet 4.6, addresses a persistent challenge in AI-powered web search: context window bloat. Traditional search tools pull entire HTML files into memory, much of it irrelevant noise that degrades response quality and burns through tokens.

How Dynamic Filtering Works

Rather than reasoning over raw HTML dumps, Claude now dynamically generates code to post-process query results. The system keeps relevant data and discards the rest before anything hits the context window. Think of it as the AI building its own custom search scraper in real-time.

Anthropic tested the approach on two industry benchmarks. On BrowseComp—which measures an agent's ability to hunt down deliberately hard-to-find information across multiple websites—Opus 4.6 jumped from 45.3% to 61.6% accuracy. Sonnet 4.6 climbed from 33.3% to 46.6%.

DeepsearchQA, which tests systematic multi-step research with many correct answers, showed similar gains. Opus 4.6's F1 score rose from 69.8% to 77.3%, while Sonnet 4.6 improved from 52.6% to 59.4%.

Real-World Validation

Quora's Poe platform, which serves millions of users across 200+ AI models, has already tested the upgrade internally. "The model behaves like an actual researcher, writing Python to parse, filter, and cross-reference results rather than reasoning over raw HTML in context," said Gareth Jones, the company's Product and Research Lead. Quora found Opus 4.6 with dynamic filtering achieved the highest accuracy against other frontier models on their internal evaluations.

Token Economics Get Complicated

Cost implications vary by use case. Price-weighted tokens decreased for Sonnet 4.6 across both benchmarks, but actually increased for Opus 4.6—the more powerful model sometimes writes more complex filtering code. Anthropic recommends developers benchmark against their specific query patterns before deployment.

Dynamic filtering ships enabled by default for the new web search and web fetch tools on the Claude API. The company also graduated several related tools to general availability: code execution sandboxes, persistent memory across conversations, programmatic tool calling, and dynamic tool discovery.

For developers building search-heavy applications—think research assistants, citation verification tools, or competitive intelligence bots—the upgrade could meaningfully cut operational costs while improving output quality. The API documentation is live now on Claude's developer platform.

Image source: Shutterstock
  • anthropic
  • claude ai
  • web search
  • machine learning
  • api tools
Market Opportunity
Boost Logo
Boost Price(BOOST)
$0.0001176
$0.0001176$0.0001176
-20.00%
USD
Boost (BOOST) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact [email protected] for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Best Altcoins to Buy Now – Noomez ($NNZ) Defies the Crash

Best Altcoins to Buy Now – Noomez ($NNZ) Defies the Crash

The post Best Altcoins to Buy Now – Noomez ($NNZ) Defies the Crash appeared on BitcoinEthereumNews.com. Crypto Presales Wondering what are the best altcoins to buy right now? Noomez ($NNZ) stands out as the deflationary presale built to rise when markets fall. The best altcoins to buy now are the ones built to last when the market turns red. With volatility rising and traders looking for safety, many are asking what are the best altcoins to buy right now as they search for projects that can protect value during uncertainty. A handful are showing real strength through utility, deflation, and adoption. These tokens are structured to stay valuable even in correction phases. Among them, Noomez ($NNZ) stands out.  Its stage-based price system and automatic burns create measurable growth regardless of market sentiment, a design that rewards early entries before the next price jump hits. 5 Altcoins Built to Hold Value in Any Market Market uncertainty has pushed investors to look beyond speculation and toward structure. These five altcoins combine scarcity, utility, and real adoption, traits that can keep portfolios balanced even when prices dip.  And one, Noomez ($NNZ), is already proving that design can outperform sentiment. 1. Noomez ($NNZ) Noomez ($NNZ) has quickly become the next altcoin to explode, thanks to its built-in deflationary mechanics.  Now deep into Stage 2 at $0.0000123, the token has already climbed 23% from its launch price, with over 107 holders and $17,487 raised.  Every presale stage ends with a token burn and automatic price increase, creating scarcity that strengthens even when the wider market dips. The project’s structure, 280 billion fixed supply, 66% APY staking, 6–12-month vesting, and locked liquidity, makes it stand out as a long-term hedge, not just a short-term play.  With less than two days left before Stage 3 activates, new buyers are moving fast to secure entries before the next price floor resets higher. 2. Quant…
Share
BitcoinEthereumNews2025/11/08 20:10
Stripe-Backed Bridge Secures U.S. National Trust Banking License

Stripe-Backed Bridge Secures U.S. National Trust Banking License

The payment giant's stablecoin subsidiary is the latest crypto-native company to secure a banking license.
Share
Coinstats2026/02/18 05:28
Revolutionary Trio Accelerates Development To Dominate 2027 Market

Revolutionary Trio Accelerates Development To Dominate 2027 Market

The post Revolutionary Trio Accelerates Development To Dominate 2027 Market appeared on BitcoinEthereumNews.com. Apple AI Wearables: Revolutionary Trio Accelerates
Share
BitcoinEthereumNews2026/02/18 05:46