Together Evaluations now benchmarks proprietary AI models from OpenAI, Anthropic, and Google against open-source alternatives, claiming 10x cost savings. (Read Together Evaluations now benchmarks proprietary AI models from OpenAI, Anthropic, and Google against open-source alternatives, claiming 10x cost savings. (Read

Together AI Opens Evaluations to OpenAI, Anthropic, Google Models

2026/02/03 04:01
2 min read
For feedback or concerns regarding this content, please contact us at [email protected]

Together AI Opens Evaluations to OpenAI, Anthropic, Google Models

Lawrence Jengar Feb 02, 2026 20:01

Together Evaluations now benchmarks proprietary AI models from OpenAI, Anthropic, and Google against open-source alternatives, claiming 10x cost savings.

Together AI Opens Evaluations to OpenAI, Anthropic, Google Models

Together AI has expanded its Evaluations platform to support direct benchmarking against proprietary models from OpenAI, Anthropic, and Google—a move that could reshape how enterprises make AI infrastructure decisions.

The update, announced February 3, enables side-by-side comparisons between open-source models and closed-source alternatives including GPT-5, Claude Sonnet 4.5, and Gemini 2.5 Pro. For AI-focused crypto projects and decentralized compute networks, this creates a standardized framework for proving cost-efficiency claims.

What's Actually New

Together Evaluations now accepts models from three major providers as both evaluation targets and judges:

OpenAI: GPT-5, GPT-5.2
Anthropic: Claude Sonnet 4.5, Claude Haiku 4.5, Claude Opus 4.5
Google: Gemini 2.5 Pro, Gemini 2.5 Flash

The platform also supports any OpenAI Chat Completions-compatible URL, meaning self-hosted and decentralized inference endpoints can plug directly into the benchmarking system.

The Cost Argument Gets Data

Together AI published accompanying research showing fine-tuned open-source judges (GPT-OSS 120B, Qwen3 235B) outperforming GPT-5.2 as evaluators—62.63% accuracy versus 61.62%—while running at reportedly 10x lower cost and 15x higher speed.

That's a specific, testable claim. For decentralized AI networks competing on inference pricing, having a neutral benchmarking platform that accepts custom endpoints could prove valuable for customer acquisition.

The company, founded in 2020 and known for research innovations like FlashAttention-3, has positioned itself as infrastructure-agnostic. Its platform already offers access to over 200 open-source models with claimed 4x faster inference and 11x lower cost compared to GPT-4o, according to December 2024 benchmarks.

Why This Matters for Crypto AI

Several blockchain-based AI projects—from decentralized GPU marketplaces to inference networks—have struggled to prove their cost advantages aren't just marketing. A third-party evaluation framework that accepts any compatible endpoint changes that dynamic.

The Evaluations API runs on Together's Batch API at roughly 50% lower cost than real-time inference, making large-scale model comparisons economically viable for smaller teams.

Together AI remains a private company with no associated token. But its tooling increasingly touches the infrastructure layer where crypto AI projects compete—and now those projects have a standardized way to benchmark against the incumbents they're trying to displace.

Image source: Shutterstock
  • together ai
  • ai infrastructure
  • llm benchmarking
  • open source ai
  • enterprise ai
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact [email protected] for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Spot Bitcoin ETFs Face Outflows Despite Strong March Inflows

Spot Bitcoin ETFs Face Outflows Despite Strong March Inflows

Spot Bitcoin ETFs continue to attract attention as market dynamics shift rapidly. Recent data shows a short term pullback in investor activity. However, the broader
Share
Coinfomania2026/03/21 18:45
Strategy CEO: If Morgan Stanley allocates 2% to Bitcoin, it will bring in approximately $160 billion in funds.

Strategy CEO: If Morgan Stanley allocates 2% to Bitcoin, it will bring in approximately $160 billion in funds.

PANews reported on March 21 that, regarding Morgan Stanley's second revised S-1 filing for a spot Bitcoin ETF, Strategy CEO Phong Le stated that Morgan Stanley
Share
PANews2026/03/21 17:58
Fed’s 25bps cut sparks Bitcoin repricing: October breakout ahead?

Fed’s 25bps cut sparks Bitcoin repricing: October breakout ahead?

The post Fed’s 25bps cut sparks Bitcoin repricing: October breakout ahead? appeared on BitcoinEthereumNews.com. Journalist Posted: September 18, 2025 Key Takeaways How is BTC reacting to the Fed’s rate cut? Bitcoin is grinding +0.72%, range-bound, with flows measured and a potential long squeeze in play. What’s setting up Bitcoin for year-end? Dovish Fed signals, seasonal tailwinds, and aligned macro flows keep BTC primed for a potential ATH. No parabolic moves, just Bitcoin [BTC] grinding +0.72% intraday as the FOMC delivers its first 25 bps cut of 2025. The tape is cautious, with range-bound action signaling traders are sitting tight. What’s the takeaway? Market participants are still sizing up Q4, with Fed Chair Powell’s mixed signals on future rate cuts keeping flows measured, as Matt Mena, Crypto Research Strategist at 21Shares, told AMBCrypto. “The cut itself was widely priced in – what mattered more was the Fed’s updated dot plot. Futures markets had been discounting only a 50% chance of 4–5 cuts through the end of next year.” He added, “While today’s 25bps cut provided the spark, it is the path implied by the dots – more than the cut itself – that may set the stage for Bitcoin to challenge new highs into year-end.” Fed’s dot plot shapes BTC’s long-term positioning Bitcoin traders are leaning on the Fed’s dot plot to size up positioning.  According to the latest projections, the Fed is signaling two more 25bps cuts by year-end, pushing the target range down to 3.50%–3.75% from 4.00%–4.25%. In short, Bitcoin’s long-term positioning remains dovish. Powell’s inflation caution capped the short-term squeeze, keeping the tape range-bound. Yet the dot plot shows most Fed officials leaning toward two more cuts, keeping BTC positioned to grind toward new highs by year-end. “The dots leaned more dovish, signaling the Fed is open to accelerating the pace of easing if conditions demand it. That repricing risk is now…
Share
BitcoinEthereumNews2025/09/18 22:27