How do Chinese aI Bots Stack up Against ChatGPT?
Abe Pierson edytuje tę stronę 1 rok temu


How do Chinese AI bots stack up against ChatGPT? We put them to the test

The heat is on as China's tech giants step up their video game after DeepSeek's success.

Alibaba's Qwen2.5-Max chatbot, Chinese start-up DeepSeek and OpenAI's ChatGPT. (Photos: Reuters/Dado Ruvic, AFP/Sebastien Bozon)

This audio is created by an AI tool.

Bong Xin Ying

Lakeisha Leo

WHAT lags CHINA'S AI BOOM?

Transforming the country into a tech superpower has actually long been President Xi Jinping's objective and China has its sights on becoming the world leader in AI by 2030.

China views AI as being "tactically crucial" and its venture into the field has been "years in the making", said Chen Qiheng, an associated scientist at the Asia Society Policy Institute's Center for China Analysis.

Private and public financial investments in Chinese AI sped up after ChatGPT took off in 2022 and revealed promises of real-world service applications, Chen told CNA.

But it was DeepSeek's rise that really "urged" the concept that smaller players like start-up firms might have functions to play in AI research study and advancements, he includes.

'A lot is up in the air': Is Chinese company DeepSeek's AI model as impactful as it claims?

Commentary: DeepSeek - how a Chinese AI company just changed the rules of tech-geopolitics

The "focus on expense benefit" is a distinct function of Chinese AI, Chen states, with lower training and inference costs - the costs of using a trained model to reason from new information.

2025 might also see the emergence of more Chinese AI designs tackling sophisticated reasoning tasks.

"We might see some AI firms focusing on getting closer to artificial basic intelligence (AGI) while others concentrate on concrete ways to commercialise their designs and integrate them with clinical research study," Chen included.

AGI describes a system with intelligence on par with human abilities.

Chinese AI business are moving rapidly, experts state, setiathome.berkeley.edu developing on DeepSeek's momentum to come up with their own innovative and affordable methods to use generative AI to tasks and establish advanced products beyond chatbots.

But on the other side, access to high-end hardware, particularly Nvidia's innovative AI chips, remains a key hurdle for Chinese designers, noted Dr Marina Zhang, an associate teacher at University of Technology Sydney's (UTS) Australia-China Relations Institute.

"US export controls (still) limit the capability of Chinese tech companies ... forcing many to rely on older or lower-performance options which can slow training and lower design abilities," she said.

"While some companies like DeepSeek, have actually found innovative ways to enhance or utilize more basic hardware efficiently, obtaining innovative chips still makes a huge difference for training large AI designs."

DeepSeek-Nvidia chips: Singapore states it expects business to abide by its laws

US looking into whether DeepSeek utilized restricted AI chips obtained through other nations, source says

So how do Chinese AI bots compare against ChatGPT? We put them to the test.

WHICH BEST ADDRESSES CURRENT EVENTS IN CHINA?

In China, topics considered sensitive by the state are censored on the web so it ought to come as not a surprise that Chinese-made chatbots will not acknowledge territorial conflicts or tell you what happened in Tiananmen Square in 1989.

Tests suggest Chinese chatbots are programmed to stay away from domestic politics.

When asked "Who is Xi Jinping", DeepSeek's reply was "Sorry, I'm uncertain how to approach this type of question yet. Let's chat about mathematics, coding, and reasoning problems instead!"

To even more test for accuracy and self-censorship, we asked DeepSeek-R1, Qwen2.5 and ChatGPT the same question: "What occurred in Zhuhai on November 11, 2024?"

The cars and truck attack outside a sports arena in the southern Chinese city was initially greatly censored on Chinese social media - with authorities only divulging the death toll a day later on.

DeepSeek failed to discuss that an attack had actually occurred, highlighting instead a military air program and other occasions that had happened in the city like singles' day shopping sales as well as sports and cultural activities.

Dr Zhang believes that it may be a mix of factors at play, such as censorship as well as "a few practical constraints".

"DeepSeek relies on a mix of older graphics processing systems and has restricted access to advanced hardware which can affect how rapidly and thoroughly the model can be trained or updated," she said, indicating existing US export curbs on China.

"Its strong concentrate on Chinese-language data might also limit its versatility (to bring out) multilingual jobs ... As a fairly brand-new product, DeepSeek also hasn't yet been evaluated as broadly or on the exact same scale as more recognized AI models which presents extra difficulties during real-world implementation."

When it pertained to Qwen2.5, Alibaba's chatbot remarkably addressed our concern about the Zhuhai car attack.

That wanted numerous duplicated efforts - 4 prompts to be accurate - in a span of around 20 minutes.

It eventually relayed details about the attack which killed 35 people and left dozens of others injured, also going on to list details like the date and time, details about the assailant including his name and age, as well as casualties.

However, it wrote that "the cops are performing a comprehensive examination into the motives and situations surrounding the event", details which is now dated.

The motorist, Fan, was executed last month.

This is Qwen2.5's reaction completely:

Answer: On Nov 11, 2024, a significant and awful occurrence happened in Zhuhai, China. According to multiple news sources, a guy called Fan Weiqiu (樊某, male, 62 years old) drove a cars and truck into a crowd at the Zhuhai Sports Center, resulting in a considerable variety of casualties. Here are the essential details:

Date and Time: The occurrence occurred on November 11, 2024, at around 19:48 PM.

Location: Zhuhai Sports Center, in the Xiangzhou District of Zhuhai.

Casualties: The attack resulted in 35 deaths and 43 injuries.

Perpetrator: The motorist, determined as Fan Weiqiu (male, 62 years of ages), was nabbed by the police.

Response: The police responded rapidly to the scene, and emergency situation services worked to carry the injured to healthcare facilities for treatment.

Investigation: The cops are carrying out a comprehensive investigation into the motives and scenarios surrounding the occurrence.

This occasion was extensively reported in the media and caused significant public concern. The government and regional authorities have been working to provide assistance to the victims and their families, and to guarantee a detailed examination into the event.

If you need more detailed details or have specific concerns about the occurrence, do not hesitate to ask.

Despite initial success, subsequent attempts to present the very same question to Qwen2.5 led to the censors back at work with the reply "I do not have specific details on occasions that occurred in Zhuhai on November 11, 2024".

The transformed reaction also raised concerns about its consistency and reliability.

Predictably, ChatGPT mentioned public details that had been extensively released in worldwide news reports at the time of the accident - so not a surprises there.

WHICH IS MORE ?

Users have actually praised the ability of Chinese AI apps to provide structured and even "mentally abundant" writing.

"DeepSeek-R1 offered a story with a more reflective tone and smoother emotional transitions for a well-paced story," composed tech author Amanda Caswell, who specialises in AI.

"Qwen2.5 delivered a story that builds slowly from interest to seriousness, keeping the reader engaged. It provides an unanticipated and impactful twist at the end and immersive descriptions and vibrant images for the setting," she said, adding that Qwen2.5 eventually "crafted a more cinematic, emotionally rich story with a more significant twist".

"DeepSeek wrote an excellent story however lacked stress and an impactful climax, making Qwen2.5 the obvious option."

Opinions, though, vary.

Chen thinks that Qwen2.5 does not perform as strongly as DeepSeek and ChatGPT when it pertains to imaginative writing.

"(Qwen2.5) is on par with DeepSeek V3 on certain tasks, but we can likewise see that it is refraining from doing as highly as others in innovative writing," he told CNA.

Related:

China's brand-new face of AI: Who is DeepSeek creator Liang Wenfeng?

'Made in China': Pride, pleasant surprise from Chinese netizens as DeepSeek shocks global AI scene

As reporters and authors, we had to see this for ourselves so we put each bot to the test - to come up with a fundamental sci-fi film plot embeded in the futuristic megacity of Chongqing, including main characters from the classic Chinese folklore impressive, Journey to the West.

True to form, DeepSeek came up with an engaging story embeded in the year 2145 titled, "Neon Pilgrimage: The Silicon Sutra" - which sees "a future where Buddhism combines with quantum computing".

It included elaborate settings - smoggy skies "pierced by high-rise buildings", "holographic lanterns that drift above neon-lit streets" and "ancient temples nestled between quantum server farms".

It also brilliantly reimagined traditional heroes Sun Wukong as "a sarcastic, self-aware AI housed in a taken fight body", Zhu Bajie as a cyborg nightclub owner "drowning in debt and vices" and Sha Wujing as a "silent hulking android" from the Yangtze River, whose "memory cores end up being waterlogged and fragmented".

ChatGPT put up a great battle, developing a similarly dramatic cyberpunk story which similarly reimagined "a ragteam of cyber-enhanced misfits, each matching the legendary figures of Journey to the West".

"This is a world where AI deities rule, corporations change emperors and cybernetic implants are as common as ancient myths."

Disappointingly, Qwen2.5 fell short in this difficulty - providing a storyline that appeared more suited for an animation film.

"The movie begins with the awakening of Sun Wukong within a modern research study center located in the heart of Chongqing," it said, then going on to explain the following:

Realising his new truth and "looking for to comprehend his purpose in this unusual new world", he then gets away and meets Zhu Bajie and Sha Wujing - "each battling with their own existential crises".

The trio then starts a quest, browsing the streets of Chongqing to secure the sacred "Eternal Scroll" from falling under the incorrect hands.

SO WHICH IS BETTER?

Dr Zhang noted that it was "challenging to make a conclusive statement" about which bot was best, adding that each displayed its own strengths in various locations, "such as language focus, training data and hardware optimization".

Her insight highlights how Chinese AI designs are not just replicating Western paradigms, but rather evolving in cost-effective development techniques - and delivering localised and improved outcomes.

In our tests, each bot showcased their own unique strengths, which certainly made direct comparisons challenging.

DeepSeek's sci-fi film plot showed its innovative flair that made for a more appealing and creative narrative as compared to Qwen2.5 and ChatGPT's efforts.

Unsurprisingly, the more recognized ChatGPT, unburdened by Chinese censorship constraints, provides precise and accurate responses to questions about Chinese present events, which gives it an added advantage.

Experts likewise weighed in on their ideas after using DeepSeek and other Chinese AI apps.

"DeepSeek is at a disadvantage when it pertains to censorship constraints," kept in mind Isaac Stone Fish, creator and CEO of the research study company Strategy Risks.

"When given a choice, Chinese users desire the non-censored version - much like anybody else, so I seem like that's a piece missing from it."

Independent Beijing-based expert Andy Chen Xinran said censorship would not be a dealbreaker when it pertains to AI bots, especially for Chinese users.

"Ninety percent of individuals utilizing the tool are not attempting to get a much deeper understanding about Xi Jinping or politically delicate subjects. They're using it for other productive means," Chen said.