TLDR Alibaba slashes GPU usage 82% with Aegaeon, fueling AI at massive scale. Aegaeon cuts AI model-switching latency by 97%, boosting performance. One Nvidia H20 GPU now runs 7 LLMs at once in Alibaba’s AI upgrade. Alibaba Cloud improves GPU efficiency with token-level auto-scaling. Aegaeon powers China’s AI goals while cutting reliance on Nvidia chips. [...] The post Alibaba Group Holding Limited (BABA) stock soars as new AI pooling tech slashes Nvidia GPU use by 82% appeared first on CoinCentral.TLDR Alibaba slashes GPU usage 82% with Aegaeon, fueling AI at massive scale. Aegaeon cuts AI model-switching latency by 97%, boosting performance. One Nvidia H20 GPU now runs 7 LLMs at once in Alibaba’s AI upgrade. Alibaba Cloud improves GPU efficiency with token-level auto-scaling. Aegaeon powers China’s AI goals while cutting reliance on Nvidia chips. [...] The post Alibaba Group Holding Limited (BABA) stock soars as new AI pooling tech slashes Nvidia GPU use by 82% appeared first on CoinCentral.

Alibaba Group Holding Limited (BABA) stock soars as new AI pooling tech slashes Nvidia GPU use by 82%

2025/10/18 20:15
3 min read
For feedback or concerns regarding this content, please contact us at [email protected]

TLDR

  • Alibaba slashes GPU usage 82% with Aegaeon, fueling AI at massive scale.
  • Aegaeon cuts AI model-switching latency by 97%, boosting performance.
  • One Nvidia H20 GPU now runs 7 LLMs at once in Alibaba’s AI upgrade.
  • Alibaba Cloud improves GPU efficiency with token-level auto-scaling.
  • Aegaeon powers China’s AI goals while cutting reliance on Nvidia chips.

Alibaba Group Holding Limited closed at $167.05, marking a 1.19% increase, following a major breakthrough in AI infrastructure.

BABA Stock Card

Alibaba Group Holding Limited, BABA

The company introduced a computing pooling solution that cut Nvidia GPU usage by 82% in model-serving operations. This advance positions Alibaba Cloud ahead in the race to optimize AI deployment at scale.

Aegaeon boosts efficiency, cuts GPU dependency

Alibaba Cloud, the cloud computing arm of the Hangzhou-based firm, implemented a new system called Aegaeon to boost AI efficiency. The solution allows a single Nvidia H20 GPU to serve up to seven large language models concurrently. This change reduced GPU usage from 1,192 to just 213 units during internal testing.

Aegaeon works by performing auto-scaling at the token level during model inference across concurrent AI workloads. This strategy enables dynamic resource reallocation, allowing the same GPU to switch between models mid-processing. It also cut latency in model-switching tasks by 97%.

The solution was beta-tested for over three months in Alibaba Cloud’s Bailian marketplace. It handled dozens of models with up to 72 billion parameters without service degradation. Aegaeon has now been formally deployed in Alibaba’s model marketplace, which serves its proprietary Qwen models.

Model market insights and performance optimization

Alibaba Cloud found that only a small number of models are frequently used in real-world AI tasks. Despite this, many GPUs were allocated to rarely called models, resulting in low resource utilization. Data showed that 17.7% of GPUs served just 1.35% of total inference requests.

With Aegaeon, the company resolved this imbalance through pooling and smart scaling strategies. The system ensured consistent GPU usage and prevented idle processing across rarely used models. Alibaba achieved higher throughput and improved hardware efficiency for enterprise deployments.

Peking University and Alibaba Cloud researchers co-authored a technical paper detailing the innovation, presented at SOSP 2025 in South Korea. The study underlined that serving concurrent workloads with traditional GPU methods incurred unnecessary costs. This breakthrough directly supports China’s goal of AI infrastructure modernization under resource constraints.

Nvidia’s role and China’s chip strategy shift

Nvidia developed the H20 GPU specifically for AI inference in China, complying with U.S. export restrictions. However, Chinese regulators recently launched a probe into possible backdoor security vulnerabilities in the chip. This scrutiny has affected the chip’s market position and adoption within China.

Chinese firms like Huawei and Cambricon are accelerating development of domestic GPUs to reduce foreign dependency. Nvidia’s CEO stated that the company’s market share for advanced AI chips in China has fallen to zero. This trend pushes local players to innovate and localize AI hardware supply chains.

Alibaba’s new approach strengthens its market stance while aligning with national strategies for tech self-sufficiency. By reducing reliance on U.S. chips, Alibaba gains a stronger foothold in China’s evolving AI ecosystem. The stock rise reflects confidence in its technology-led cost savings and scalability.

 

The post Alibaba Group Holding Limited (BABA) stock soars as new AI pooling tech slashes Nvidia GPU use by 82% appeared first on CoinCentral.

Market Opportunity
null Logo
null Price(null)
--
----
USD
null (null) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact [email protected] for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Pi Network Maps 50M Coins Daily as Mainnet Tops 9B

Pi Network Maps 50M Coins Daily as Mainnet Tops 9B

Pi Network news today shows the migration engine appears to be speeding up again. Community posts claim the Pi Core Team is now mapping about 50 million Pi coins
Share
Coinfomania2026/03/03 15:31
FCA, crackdown on crypto

FCA, crackdown on crypto

The post FCA, crackdown on crypto appeared on BitcoinEthereumNews.com. The regulation of cryptocurrencies in the United Kingdom enters a decisive phase. The Financial Conduct Authority (FCA) has initiated a consultation to set minimum standards on transparency, consumer protection, and digital custody, in order to strengthen market confidence and ensure safer operations for exchanges, wallets, and crypto service providers. The consultation was published on May 2, 2025, and opened a public discussion on operational responsibilities and safeguarding requirements for digital assets (CoinDesk). The goal is to make the rules clearer without hindering the sector’s evolution. According to the data collected by our regulatory monitoring team, in the first weeks following the publication, the feedback received from professionals and operators focused mainly on custody, incident reporting, and insurance requirements. Industry analysts note that many responses require technical clarifications on multi-sig, asset segregation, and recovery protocols, as well as proposals to scale obligations based on the size of the operator. FCA Consultation: What’s on the Table The consultation document clarifies how to apply rules inspired by traditional finance to the crypto perimeter, balancing innovation, market integrity, and user protection. In this context, the goal is to introduce minimum standards for all firms under the supervision of the FCA, an essential step for a more transparent and secure sector, with measurable benefits for users. The proposed pillars Obligations towards consumers: assessment on the extension of the Consumer Duty – a requirement that mandates companies to provide “good outcomes” – to crypto services, with outcomes for users that are traceable and verifiable. Operational resilience: introduction of continuity requirements, incident response plans, and periodic testing to ensure the operational stability of platforms even in adverse scenarios. Financial Crime Prevention: strengthening AML/CFT measures through more stringent transaction monitoring and structured counterpart checks. Custody and safeguarding: definition of operational methods for the segregation of client assets, secure…
Share
BitcoinEthereumNews2025/09/18 05:40
Written on the UAE-Oman border: Survival lessons for the crypto natives after navigating through gunfire.

Written on the UAE-Oman border: Survival lessons for the crypto natives after navigating through gunfire.

Author: Brother Bing , co-founder of MegaETH Compiled by: Yuliya, PANews Having personally experienced the Middle East conflict and witnessed the awe-inspiring
Share
PANews2026/03/03 15:28