Alibaba debuts AI model that can process video, audio on phones


Alibaba said in a statement that its new Qwen2.5-Omni-7B system demonstrated particularly high performance in speech understanding and generation. — Reuters

Alibaba Group Holding Ltd has released a new artificial intelligence model in its Qwen series that the company says can process text, pictures, audio and video, and is efficient enough to run directly on mobile phones and laptops.

The company said it expects that the new model, now publicly available on Hugging Face and GitHub, will be used to build so-called AI agents that can, for example, help a visually impaired person navigate their environment through real-time audio descriptions. 

Follow us on our official WhatsApp channel for breaking news alerts and key updates!

Next In Tech News

Robinhood to enter Indonesia with brokerage, crypto trader acquisition
Artificially intelligent: The evolving threat of deepfakes
Trump says he'll be involved in review of Netflix-Warner Brothers deal
Scale of social media use in pre-school children ‘deeply alarming’
Opinion: Are QR codes computer-friendly?
Pick your handle: WhatsApp preparing reservation queue for usernames
'Kirby Air Riders': A 'Mario Kart' alternative for the Switch 2
Meta delays release of Phoenix mixed-reality glasses to 2027, Business Insider reports
Opinion: How can you tell if something’s been written by ChatGPT? Let’s delve
'Stealing from a thief': How ChatGPT helped Delhi man outsmart scammer, make him 'beg' for forgiveness

Others Also Read