
A group of researchers from DAMO Academy have unveiled a new audiovisual language model called Video-LLaMA. The new DAMO Academy model is an enhancement from previous vision-LLMs as it can tackle two challenges in video understanding. — SCMP
Alibaba Group Holding’s in-house research unit is making progress with its own large language models (LLMs), as Chinese Big Tech companies continue to pile into the artificial intelligence (AI) space in an attempt to come up with a rival to OpenAI’s ChatGPT.
A group of researchers from DAMO Academy unveiled a new audiovisual language model called Video-LLaMA, which helps the system to understand visual and auditory content in videos, in a research paper published last week on ArXiv, an online scientific paper repository.
Subscribe to The Star Yearly Premium Plan for 30% off
Cancel anytime. Ad-free. Full access to Web and App.
Monthly Plan
RM 13.90/month
RM 9.73/month
Billed as RM 9.73 for the 1st month, RM 13.90 thereafter.
Annual Plan
RM 12.39/month
RM 8.63/month
Billed as RM 103.60 for the 1st year, RM 148 thereafter.