Your bank is using your money. You’re getting the scraps.WATCH FREE

DeepSeek’s latest AI upgrade challenges Alibaba’s Qwen3 and OpenAI’s o3 model

In this post:

  • DeepSeek rolled out R1-0528 as a major boost to its R1 model after Alibaba’s Qwen3 outperformed it.
  • R1-0528 now matches models from big tech firms like OpenAI o3, o4-mini, and Google’s Gemini 2.5-Pro.
  • The team distilled R1-0528’s chain-of-thought into a smaller Qwen3 8B base version that outperforms the original by 10%.

DeepSeek has rolled out R1-0528, a major upgrade to the Chinese start-up’s R1 reasoning model, which was released in January. The upgrade arrived just one month after Alibaba Group Holding’s Qwen3 beat the original DeepSeek R1 in LiveBench, an online benchmark for open-source artificial-intelligence models.

DeepSeek’s upgraded R1-0528 model now stands alongside leading AI models from OpenAI and Google in performance. The comeback shows how quickly China’s big technology firms and newer tech firms are pushing to improve their AI tools.

In its statement, DeepSeek said R1-0528 shows better reasoning and creative writing skills. The update also brings stronger coding ability. Most importantly, the company claims the model now produces 50% fewer “hallucinations.”

DeepSeek explained that the upgrades came from extra computing power invested during the post-training phase, when engineers fine-tune a model after the main training process. During the post-training phase, engineers aim to increase the model’s efficiency and enhance its accuracy and safety.

R1-0528 now ranks right behind OpenAI’s o3 and o4-mini

On LiveCodeBench, which measures AI model performance, R1-0528 now ranks just behind OpenAI’s o4-mini and o3 models.

“DeepSeek’s latest upgrade is sharper on reasoning, stronger on math and code, and closing in on top-tier models like Gemini and O3,” said Adina Yakefu, an AI researcher at Hugging Face.

She added that the new version shows “major improvements in inference and hallucination reduction” and proves the start-up is not merely catching up but actively competing.

See also  Edward Snowden Warns of Looming AI Revolution as Public Faith in Institutions Declines

The rapid progress came after Washington had restricted advanced chips and other technology exports to China. Yet Chinese firms continue to refine their systems. Earlier this month, Baidu and Tencent described ways they are making their models run more efficiently despite limited access to cutting-edge semiconductors.

Nvidia chief executive Jensen Huang criticized export controls on Wednesday. “The U.S. has based its policy on the assumption that China cannot make AI chips,” he said. “That assumption was always questionable, and now it’s clearly wrong. The question is not whether China will have AI. It already does.”

DeepSeek raised the performance of Alibaba’s Qwen3 8B model by 10%

DeepSeek also said it distilled the reasoning steps used in R1-0528 into Alibaba’s Qwen3 8B Base model. That process created a new, smaller model that surpassed Qwen3’s performance by more than 10%, according to the company. At the same time, the model was 30 times smaller.

“We believe the chain-of-thought from DeepSeek-R1-0528 will hold significant importance for academic research on reasoning models and industrial work on small models,” the firm stated.

According to Reuters, a DeepSeek representative told a WeChat group that the change was a “minor trial upgrade” that was already open for public testing. In response to fiercer competition, Google has discounted some Gemini access tiers, while OpenAI introduced the lower-cost o3 Mini model.

See also  Mistral releases a new AI audio model named Voxtral

Still letting the bank keep the best part? Watch our free video on being your own bank.

Share link:

Disclaimer. The information provided is not trading advice. Cryptopolitan.com holds no liability for any investments made based on the information provided on this page. We strongly recommend independent research and/or consultation with a qualified professional before making any investment decisions.

Most read

Loading Most Read articles...

Stay on top of crypto news, get daily updates in your inbox

Editor's choice

Loading Editor's Choice articles...

- The Crypto newsletter that keeps you ahead -

Markets move fast.

We move faster.

Subscribe to Cryptopolitan Daily and get timely, sharp, and relevant crypto insights straight to your inbox.

Join now and
never miss a move.

Get in. Get the facts.
Get ahead.

Subscribe to CryptoPolitan