The post DeepSeek reveals $294,000 as cost of training its AI model appeared on BitcoinEthereumNews.com. China’s DeepSeek has claimed its flagship AI system, known as R1, was trained for just $294,000, which is a fraction of the sums believed to be spent by US competitors. The details were published in a peer-reviewed paper in Nature this week, and it is likely to fuel further debate over Beijing’s ambitions in the global artificial intelligence race. The Hangzhou-based company said the reasoning-focused model was trained using 512 Nvidia H800 chips. This hardware was designed specifically for China after the US prohibited sales of the more powerful H100 and A100 processors. The paper, which was co-authored by founder Liang Wenfeng, marks the first time the firm has disclosed such costs. DeepSeek uses a fraction of US models’ cost In January, the release of DeepSeek’s cheaper AI tools destabilized global markets, resulting in a sell-off in tech shares on fears they could undercut established giants such as Nvidia and OpenAI. However, Liang and his team have kept a low profile, surfacing only for sporadic product updates ever since. The reported $294,000 price tag stands in stark contrast to estimates from American firms. The chief executive of OpenAI, Sam Altman, in 2023 said: “Training foundational models cost much more than $100 million.” However, he did not give out any specific breakdown. Training large language models involves running banks of powerful chips for extended periods, consuming enormous amounts of electricity while processing text and code. Industry observers have long assumed the bill for such projects runs into the tens or even hundreds of millions. That assumption is now being challenged, and in a supplementary document, DeepSeek admitted it owns A100 chips and had used them in early development, before moving the full-scale training onto its H800 cluster. According to the tech firm, the model ran for 80 hours during its final… The post DeepSeek reveals $294,000 as cost of training its AI model appeared on BitcoinEthereumNews.com. China’s DeepSeek has claimed its flagship AI system, known as R1, was trained for just $294,000, which is a fraction of the sums believed to be spent by US competitors. The details were published in a peer-reviewed paper in Nature this week, and it is likely to fuel further debate over Beijing’s ambitions in the global artificial intelligence race. The Hangzhou-based company said the reasoning-focused model was trained using 512 Nvidia H800 chips. This hardware was designed specifically for China after the US prohibited sales of the more powerful H100 and A100 processors. The paper, which was co-authored by founder Liang Wenfeng, marks the first time the firm has disclosed such costs. DeepSeek uses a fraction of US models’ cost In January, the release of DeepSeek’s cheaper AI tools destabilized global markets, resulting in a sell-off in tech shares on fears they could undercut established giants such as Nvidia and OpenAI. However, Liang and his team have kept a low profile, surfacing only for sporadic product updates ever since. The reported $294,000 price tag stands in stark contrast to estimates from American firms. The chief executive of OpenAI, Sam Altman, in 2023 said: “Training foundational models cost much more than $100 million.” However, he did not give out any specific breakdown. Training large language models involves running banks of powerful chips for extended periods, consuming enormous amounts of electricity while processing text and code. Industry observers have long assumed the bill for such projects runs into the tens or even hundreds of millions. That assumption is now being challenged, and in a supplementary document, DeepSeek admitted it owns A100 chips and had used them in early development, before moving the full-scale training onto its H800 cluster. According to the tech firm, the model ran for 80 hours during its final…

DeepSeek reveals $294,000 as cost of training its AI model

China’s DeepSeek has claimed its flagship AI system, known as R1, was trained for just $294,000, which is a fraction of the sums believed to be spent by US competitors.

The details were published in a peer-reviewed paper in Nature this week, and it is likely to fuel further debate over Beijing’s ambitions in the global artificial intelligence race. The Hangzhou-based company said the reasoning-focused model was trained using 512 Nvidia H800 chips. This hardware was designed specifically for China after the US prohibited sales of the more powerful H100 and A100 processors.

The paper, which was co-authored by founder Liang Wenfeng, marks the first time the firm has disclosed such costs.

DeepSeek uses a fraction of US models’ cost

In January, the release of DeepSeek’s cheaper AI tools destabilized global markets, resulting in a sell-off in tech shares on fears they could undercut established giants such as Nvidia and OpenAI.

However, Liang and his team have kept a low profile, surfacing only for sporadic product updates ever since.

The reported $294,000 price tag stands in stark contrast to estimates from American firms.

The chief executive of OpenAI, Sam Altman, in 2023 said: “Training foundational models cost much more than $100 million.” However, he did not give out any specific breakdown.

Training large language models involves running banks of powerful chips for extended periods, consuming enormous amounts of electricity while processing text and code. Industry observers have long assumed the bill for such projects runs into the tens or even hundreds of millions.

That assumption is now being challenged, and in a supplementary document, DeepSeek admitted it owns A100 chips and had used them in early development, before moving the full-scale training onto its H800 cluster. According to the tech firm, the model ran for 80 hours during its final training stage.

Even though Nvidia has insisted that the Chinese startup has access only to their H800 processors, American officials remain sceptical. A few months back, US sources told Reuters that DeepSeek illegally owns large volumes of the H100 chips that have export bans to China.

Putting innovation under the microscope

R1 has drawn attention not only for its low training costs but also because it may be the first major model to undergo formal peer review.

“This is a very welcome precedent, and if we don’t have this norm of sharing, it becomes very hard to evaluate risks,” said Lewis Tunstall, a machine-learning engineer at Hugging Face who reviewed the Nature paper.

The review process prompted DeepSeek to clarify technical details, including how its model was trained and what safeguards were in place.

“Going through a rigorous peer-review process certainly helps verify the validity and usefulness of the model,” said Huan Sun, an AI researcher at Ohio State University.

DeepSeek’s key breakthrough was using a pure reinforcement learning approach. Instead of relying on human-curated reasoning examples, according to the paper. The model was rewarded for solving problems correctly and gradually developed its own problem-solving strategies.

The firm says this trial-and-error system allowed R1 to verify its workings without copying human tactics.

“This model has been quite influential,” Sun added. “Almost all reinforcement learning work in 2025 may have been inspired by R1 one way or another.”

DeepSeek denies copying claims

Soon after R1’s release, speculation swirled that DeepSeek had leaned on rival outputs, particularly from OpenAI, to accelerate training; however, the company has now flatly denied that charge.

In correspondence with referees, DeepSeek insisted that R1 did not copy reasoning examples generated by OpenAI. However, like most large language models, it was trained on internet text. This means that some AI-produced content was inevitably included, and the explanation has convinced some reviewers.

“I cannot be 100% sure R1 was not trained on OpenAI examples. However, replication attempts by other labs suggest reinforcement learning is good enough on its own.” Tunstall said.

DeepSeek says R1 is built to excel at reasoning-heavy tasks such as coding and mathematics. Unlike most closed systems developed by U.S. firms, it was released as an open-weight model, freely downloadable by researchers. On the AI community site Hugging Face, it has already been downloaded more than 10 million times.

The firm spent around $6 million developing the base model that R1 is built upon, but even with that added, its costs fall well short of the sums associated with rivals. For many in the field, that makes R1 attractive.

Sun and colleagues recently tested the system on scientific data tasks and found it was not the most accurate, but among the best in terms of cost-to-performance.

 

The smartest crypto minds already read our newsletter. Want in? Join them.

Source: https://www.cryptopolitan.com/deepseek-reveals-cost-of-its-ai-model/

Market Opportunity
Moonveil Logo
Moonveil Price(MORE)
$0,001925
$0,001925$0,001925
-1,23%
USD
Moonveil (MORE) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact [email protected] for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Microsoft Corp. $MSFT blue box area offers a buying opportunity

Microsoft Corp. $MSFT blue box area offers a buying opportunity

The post Microsoft Corp. $MSFT blue box area offers a buying opportunity appeared on BitcoinEthereumNews.com. In today’s article, we’ll examine the recent performance of Microsoft Corp. ($MSFT) through the lens of Elliott Wave Theory. We’ll review how the rally from the April 07, 2025 low unfolded as a 5-wave impulse followed by a 3-swing correction (ABC) and discuss our forecast for the next move. Let’s dive into the structure and expectations for this stock. Five wave impulse structure + ABC + WXY correction $MSFT 8H Elliott Wave chart 9.04.2025 In the 8-hour Elliott Wave count from Sep 04, 2025, we saw that $MSFT completed a 5-wave impulsive cycle at red III. As expected, this initial wave prompted a pullback. We anticipated this pullback to unfold in 3 swings and find buyers in the equal legs area between $497.02 and $471.06 This setup aligns with a typical Elliott Wave correction pattern (ABC), in which the market pauses briefly before resuming its primary trend. $MSFT 8H Elliott Wave chart 7.14.2025 The update, 10 days later, shows the stock finding support from the equal legs area as predicted allowing traders to get risk free. The stock is expected to bounce towards 525 – 532 before deciding if the bounce is a connector or the next leg higher. A break into new ATHs will confirm the latter and can see it trade higher towards 570 – 593 area. Until then, traders should get risk free and protect their capital in case of a WXY double correction. Conclusion In conclusion, our Elliott Wave analysis of Microsoft Corp. ($MSFT) suggested that it remains supported against April 07, 2025 lows and bounce from the blue box area. In the meantime, keep an eye out for any corrective pullbacks that may offer entry opportunities. By applying Elliott Wave Theory, traders can better anticipate the structure of upcoming moves and enhance risk management in volatile markets. Source: https://www.fxstreet.com/news/microsoft-corp-msft-blue-box-area-offers-a-buying-opportunity-202509171323
Share
BitcoinEthereumNews2025/09/18 03:50
WTI drifts higher above $59.50 on Kazakh supply disruptions

WTI drifts higher above $59.50 on Kazakh supply disruptions

The post WTI drifts higher above $59.50 on Kazakh supply disruptions appeared on BitcoinEthereumNews.com. West Texas Intermediate (WTI), the US crude oil benchmark
Share
BitcoinEthereumNews2026/01/21 11:24
MYX Finance price surges again as funding rate points to a crash

MYX Finance price surges again as funding rate points to a crash

MYX Finance price went parabolic again as the recent short-squeeze resumed. However, the formation of a double-top pattern and the funding rate point to an eventual crash in the coming days. MYX Finance (MYX) came in the spotlight earlier this…
Share
Crypto.news2025/09/18 02:57