Tencent and Alibaba’s AI models understand Chinese better than humans, new rankings show


Rivalling models from the two Chinese tech giants have achieved record-high scores on the Chinese Language Understanding Evaluation benchmark. While machines can do better than humans in some language tasks, researchers say AI still has a long way to go before it can truly understand languages. — SCMP

Artificial intelligence (AI) models from Chinese tech giants Tencent Holdings and Alibaba Group Holding understand the Chinese language better than humans, according to a benchmark test measuring natural language processing (NLP).

The two rival models have achieved record-high scores on the Chinese Language Understanding Evaluation (CLUE) benchmark, which is a set of tasks designed to evaluate how well a machine can understand and respond to Chinese text similar to how humans do.

This marks the first time that AI models have scored better than humans on CLUE since the benchmark was established by dozens of researchers three years ago.

Tencent’s “Hunyuan AI model” came first with a score of 86.918, followed by Alibaba’s AliceMind with a score of 86.685, according to Monday’s rankings on CLUE’s website. Both ranked higher than humans, which have been given a score of 86.678.

AI models from Chinese smartphone maker Oppo and food delivery giant Meituan ranked fourth and fifth, respectively.

“Although the rankings change a lot, the top scorers had never surpassed humans until now,” said Alibaba Cloud – the cloud computing arm of the ecommerce giant – in an article published on Friday. It added that the latest results mean that “the Chinese language understanding of the AI model has reached a new level”.

Alibaba owns the South China Morning Post.

Alibaba’s Tmall Genie smart speaker. Photo: Handout

China’s Big Tech firms have been working to improve their NLP technology, which is used to support voice-enabled virtual assistants, such as Alibaba’s AliGenie and Tencent’s Xiaowei, as well as other functions like machine translation and spam detection.

Earlier this year, Chinese internet search giant Baidu said that its bot Du Xiaoxiao wrote an article that scored higher than most students on China’s notoriously difficult national college entrance examination, known locally as gaokao, although it mistakenly used an internet slang term.

Still, some researchers say most AI models still have a long way to go before they can truly understand the intricacies of languages.

Last year, scientists at Auburn University in the US state of Alabama and the research unit of American software company Adobe found that many AIs that performed better than humans on certain comprehension tasks were unable to tell when words in a sentence were randomly shuffled. – South China Morning Post

Follow us on our official WhatsApp channel for breaking news alerts and key updates!
   

Next In Tech News

Social media platform X withholds some political posts in India after election commission order
Portugal's far-right Chega vows legal action over 10-year Facebook curbs
AMD introduces AI chips for business laptops and desktops
Startup Rivos raises $250 million to develop RISC-V AI chips
Bain proposes Japan's Kioxia IPO to clear $5.8 billion loan refinance
Meta oversight board reviews handling of AI-created celebrity porn
UK starts drafting AI regulations for most powerful models
UK plans talks with Big Tech to limit online harm for teens
Nissan says it will make next-generation EV batteries by 2028
UK to criminalise the creation of intimate deepfake images

Others Also Read