Alibaba debuts AI model that can process video, audio on phones


Alibaba said in a statement that its new Qwen2.5-Omni-7B system demonstrated particularly high performance in speech understanding and generation. — Reuters

Alibaba Group Holding Ltd has released a new artificial intelligence model in its Qwen series that the company says can process text, pictures, audio and video, and is efficient enough to run directly on mobile phones and laptops.

The company said it expects that the new model, now publicly available on Hugging Face and GitHub, will be used to build so-called AI agents that can, for example, help a visually impaired person navigate their environment through real-time audio descriptions. 

Follow us on our official WhatsApp channel for breaking news alerts and key updates!

Next In Tech News

Waymo discusses raising billions at over $100 billion valuation, the Information reports
Hacking group ‘ShinyHunters’ threatens to expose premium users of sex site Pornhub
X Corp sues social media startup over bid to claim 'Twitter' brand
US threatens countermeasures after EU fine on Musk's X
Bank of Canada wants stablecoins to be backed by high-quality liquid assets
Factbox-From trend to mainstay: AI to cement its place at the core of 2026 investment strategies
Data and AI firm Databricks valued at $134 billion in latest funding round
Business leaders agree AI is the future. They just wish it worked right now
Review: Defend a moving city in 'Monsters Are Coming' for PC and Xbox
Chip crunch to curb smartphone output in 2026, researcher says

Others Also Read