As costs of developing AI and the limited amount of available hardware, DeepSeek has presented a new plan for developing and scaling AI.As costs of developing AI and the limited amount of available hardware, DeepSeek has presented a new plan for developing and scaling AI.

DeepSeek’s mHC debut meets skepticism ahead of peer validation

At a time when there are issues with the growing costs of developing and maintaining AI and the limited amount of available hardware, DeepSeek has presented a new plan for developing and scaling artificial intelligence (AI).

The Chinese based start-up believes it can create significantly better AI models without necessarily adding more chips and therefore increasing power consumption. Although the proposed mHC concept has garnered significant attention from many researchers of the subject, it is generally considered to still be in the early stages.

Further research will be required to determine the benefits of the approach in developing larger AI systems. A technical paper detailing the mHC concept was released last week and is co-authored by Liang Wenfeng, DeepSeek’s founder and CEO.

DeepSeek rethinks network design to scale AI

One of the main components of the work is a re-evaluation of how information is transferred between the various layers of a multi-layered neural network.

Each layer in a neural network passes on a form of processed information to the next layer in the model, creating what has been termed a ‘Residual Learning Network’ (ResNet). Developed by Microsoft Research’s Kaiming He and others approximately ten years ago, ResNets provided the fundamental basis to a number of today’s most advanced AI systems.

A concept developed by DeepSeek was created after ByteDance introduced Hyper-Connections in 2024. Hyper-Connections allow information to travel multiple routes through a network, rather than just one main path, which can increase the speed of learning and the richness of the experience.

However, while they can be beneficial, they can also lead to problematic training occurrences, where models experience training instability or complete failure.

According to Song Linqi (City University of Hong Kong), DeepSeek’s research is a progression of an existing idea, a continuation of how DeepSeek looks at other companies’ work, instead of inventing something from the ground up.

ResNet is compared to a one-lane expressway while Hyper-Connections resemble a multi-lane expressway; however, Song cautioned that having multiple lanes with no proper rules may lead to more collisions.

Professor Guo Song of the Hong Kong University of Science and Technology believes that this research paper may indicate a change in research behaviour for AI research. Instead of continuing to make small modifications to the designs of existing models, he feels that research may evolve towards developing new models based on theoretical constructs.

Researchers test mHC but raise practical concerns

While there is excitement over the recent milestone reached in the testing of mHC for deep learning, experts have stressed that the research is still not done. The testing provided by DeepSeek only utilized four paths of data when testing models with 27 billion parameters.

The AI models that are available today are larger and typically have hundreds of billions of parameters compared to the 30 billion parameters that were the standard just a few years ago.

Guo echoed these sentiments and stated that no one can conclude yet if mHC will be able to perform work at the frontier of AI technology. He also stated that the infrastructure needed for mHC to function may be too advanced for smaller research institutions to use and for companies to utilize on mobile devices.

According to Cryptopolitan, DeepSeek’s popularity came from their release of the DeepSeek V3 large language model, and the subsequent release of their DeepSeek-R1 reasoning model only a couple of weeks after.

When comparing the results of the models to their competitors during benchmark tests, both models were able to reach or exceed the results of their competitors despite being released using only a fraction of the training data used for the other competing language models.

Get $50 free to trade crypto when you sign up to Bybit now

Market Opportunity
Sleepless AI Logo
Sleepless AI Price(AI)
$0.04013
$0.04013$0.04013
-1.52%
USD
Sleepless AI (AI) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact [email protected] for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Grayscale Registers New HYPE and BNB ETFs in Delaware

Grayscale Registers New HYPE and BNB ETFs in Delaware

The post Grayscale Registers New HYPE and BNB ETFs in Delaware appeared on BitcoinEthereumNews.com. Key Points: Grayscale registers ETFs in Delaware. Market anticipates
Share
BitcoinEthereumNews2026/01/12 06:17
Fed Decides On Interest Rates Today—Here’s What To Watch For

Fed Decides On Interest Rates Today—Here’s What To Watch For

The post Fed Decides On Interest Rates Today—Here’s What To Watch For appeared on BitcoinEthereumNews.com. Topline The Federal Reserve on Wednesday will conclude a two-day policymaking meeting and release a decision on whether to lower interest rates—following months of pressure and criticism from President Donald Trump—and potentially signal whether additional cuts are on the way. President Donald Trump has urged the central bank to “CUT INTEREST RATES, NOW, AND BIGGER” than they might plan to. Getty Images Key Facts The central bank is poised to cut interest rates by at least a quarter-point, down from the 4.25% to 4.5% range where they have been held since December to between 4% and 4.25%, as Wall Street has placed 100% odds of a rate cut, according to CME’s FedWatch, with higher odds (94%) on a quarter-point cut than a half-point (6%) reduction. Fed governors Christopher Waller and Michelle Bowman, both Trump appointees, voted in July for a quarter-point reduction to rates, and they may dissent again in favor of a large cut alongside Stephen Miran, Trump’s Council of Economic Advisers’ chair, who was sworn in at the meeting’s start on Tuesday. It’s unclear whether other policymakers, including Kansas City Fed President Jeffrey Schmid and St. Louis Fed President Alberto Musalem, will favor larger cuts or opt for no reduction. Fed Chair Jerome Powell said in his Jackson Hole, Wyoming, address last month the central bank would likely consider a looser monetary policy, noting the “shifting balance of risks” on the U.S. economy “may warrant adjusting our policy stance.” David Mericle, an economist for Goldman Sachs, wrote in a note the “key question” for the Fed’s meeting is whether policymakers signal “this is likely the first in a series of consecutive cuts” as the central bank is anticipated to “acknowledge the softening in the labor market,” though they may not “nod to an October cut.” Mericle said he…
Share
BitcoinEthereumNews2025/09/18 00:23
FCA komt in 2026 met aangepaste cryptoregels voor Britse markt

FCA komt in 2026 met aangepaste cryptoregels voor Britse markt

De Britse financiële waakhond, de FCA, komt in 2026 met nieuwe regels speciaal voor crypto bedrijven. Wat direct opvalt: de toezichthouder laat enkele klassieke financiële verplichtingen los om beter aan te sluiten op de snelle en grillige wereld van digitale activa. Tegelijkertijd wordt er extra nadruk gelegd op digitale beveiliging,... Het bericht FCA komt in 2026 met aangepaste cryptoregels voor Britse markt verscheen het eerst op Blockchain Stories.
Share
Coinstats2025/09/18 00:33