Alibaba debuts AI model that can process video, audio on phones


Alibaba said in a statement that its new Qwen2.5-Omni-7B system demonstrated particularly high performance in speech understanding and generation. — Reuters

Alibaba Group Holding Ltd has released a new artificial intelligence model in its Qwen series that the company says can process text, pictures, audio and video, and is efficient enough to run directly on mobile phones and laptops.

The company said it expects that the new model, now publicly available on Hugging Face and GitHub, will be used to build so-called AI agents that can, for example, help a visually impaired person navigate their environment through real-time audio descriptions. 

Follow us on our official WhatsApp channel for breaking news alerts and key updates!

Next In Tech News

Exclusive-Tesla board made $3 billion via stock awards that dwarfed tech peers
Electricity is now holding back growth across the global economy
North Korean leader's sister sports Chinese foldable phone
STMicro has shipped 5 billion chips for Starlink in past decade; that could double by 2027
Tech support scammers stole US$85,000 from him. His bank declined to refund him.
Analysis-Old meets new economy: AI boom to supercharge European banks' rally
Humanoid robots take center stage at Silicon Valley summit, but scepticism remains
Asahi CEO mulls new cybersecurity unit as disruption drags on
China's smaller manufacturers look to catch the automation wave
From Zelda to Civ VI: understanding game complexity

Others Also Read