AI’s new frontier: affordable, domain-specific models are coming, says PolyU scientist

By Wency Chen

AI
Thursday, 27 Feb 2025
1:00 PM MYT

Related News

China 20 Jul 2025

Trump, Xi tipped to meet ahead of or during APEC summit in South Korea, SCMP reports

Aseanplus News 06 May 2025

Hong Kong's freight forwarding industry hit by trade war: SCMP

Aseanplus News 22 Jul 2024

Joe Biden drops out of US presidential race, endorses Kamala Harris - a SCMP special report

Former Alibaba scientist Yang Hongxia is at the forefront of an effort to evolve AI beyond DeepSeek's breakthroughs. — SCMP

A renowned professor at Hong Kong Polytechnic University (PolyU), and a former artificial intelligence (AI) scientist at Chinese tech giants ByteDance and Alibaba Group Holding, is trying to work with experts across different fields to develop “affordable” domain-specific models.

Yang Hongxia, who joined PolyU’s Department of Computing last year after decades in the technology industry, is at the forefront of an effort to use the capabilities of large language models (LLMs) in specialised applications. Her efforts come as Chinese companies, spurred by the success of start-up DeepSeek, move to open-source their AI models, giving greater access to the tech.

“While current LLMs have made impressive strides in general intelligence, they still fall short in specific domains in fields such as manufacturing and biochemistry,” Yang said in an interview with the South China Morning Post. Alibaba owns the Post.

“This gap exists because much of the relevant data for these fields hasn’t been incorporated into AI model development, because they cannot be crawled from the general web” she said. Yang added that general-purpose models require adjustments to fit specialised domains.

Yang Hongxia is at the forefront of developing affordable specialised AI models. Photo: Sina

Yang is leading the establishment of an AI academy, which aims to drive fundamental scientific breakthroughs. The team, primarily composed of students from domestic universities including PolyU, Zhejiang University and Harbin Institute of Technology, is working on “democratising AI development”.

They aim to provide a platform where domain experts can train small AI models using entry-level graphics processing units (GPUs), available through high-performance computing centres in Hong Kong and the provinces of Zhejiang and Guangdong.

Earlier this month, the team published papers illustrating some of its progress, introducing training pipelines designed to minimise computing costs while enabling small models to perform competitive reasoning tasks within specialised fields. “It’s a domain-specific continual pre-train infrastructure”, as Yang put it, likening it to a cloud service that is both cost-effective and accessible.

StarPicks

HONOURING 30 YEARS OF EXTRAORDINARY MALAYSIANS

This approach allows small models – ranging from 1 billion to 3 billion parameters, compared with the hundreds of billions in large models like the 671 billion-parameter DeepSeek-R1 – to complete training and achieve state-of-the-art reasoning capabilities in under 6,000 GPU hours.

“Specialised fields sometimes have been sidelined in AI development,” Yang said, citing challenges such as different data processing methods and limited access to extensive GPU resources. Her team is currently working on a cancer foundation model in collaboration with top hospitals in Zhejiang and Beijing.

MIT Technology Review’s 2025 list of breakthrough technologies highlights the increasing focus on small models in AI: “As the marginal gains for new high-end models trail off, researchers are figuring out how to do more with less. For certain tasks, smaller models that are trained on more focused data sets can now perform just as well as larger ones – if not better.”

“This approach also maximises the utility of less advanced, heterogeneous computing resources, allowing domestic chips to be more effectively used for small model training,” Yang said.

Yang’s team has pioneered a new machine learning paradigm called “model over models”, which integrates multiple domain-specific models into a single larger pivot model.

Yang moved to Hong Kong Polytechnic University last year after years working for tech companies. Photo: Sun Yeung

The team’s latest paper introduces InfiFusion, an efficient training pipeline that outperforms leading models including Alibaba’s Qwen-2.5-14B-Instruct and Microsoft’s Phi-4 on 11 widely used benchmarks, including reasoning, coding, maths and instruction-following tasks by focusing on small models. InfiFusion achieves superior results with just 160 H800 GPU hours, a fraction of the millions typically required for traditional LLM training, according to the paper.

“When we have enough domain-specific models and resources, we expect to see emerging capabilities besides data and test-of-time scaling,” Yang said. She likened the “model over models” paradigm to learning through “textbooks” (domain-specific models) rather than directly from data.

Regarding DeepSeek, Yang said the team behind the models that shook Wall Street last month had made significant breakthroughs in both pre-training and post-training phases. These include 8-bit floating point mixed-precision computing, which significantly improves efficiency in computation and usage of resources while maintaining model performance and improving reinforcement learning techniques. AI models commonly use 32-bit or 16-bit precision.

Yang’s team plans to focus on low-bit pre-training in the future.

She also praised DeepSeek for greater transparency than many other models on the market, offering industries across various sectors a clearer path to engaging in the AI ecosystem. DeepSeek announced last week that it would be making five of its code repositories open source to accelerate development. – South China Morning Post

Topic:

SCMP China AI Technology

Is this article useful?

Report a mistake

What is the issue about?

Spelling and grammatical error

Factually incorrect

Story is irrelevant

Email (optional)

Thank you for your report!

Next In Tech News

Amazon-backed Skild AI unveils general-purpose AI model for multi-purpose robots

JPMorgan nears deal to take over Apple's credit card program, WSJ reports

Moscow court fines Zoom for failing to abide by Russian internet rules

Microsoft in advanced talks for continued access to OpenAI tech, Bloomberg reports

Uber loses UK Supreme Court appeal over tax on rival apps

Nvidia-backed Enfabrica releases system aimed at easing memory costs

PayPal lifts 2025 profit forecast above estimates as turnaround picks up pace

LG Innotek to take stake in lidar maker Aeva as part of $50 million deal

Kyrgyzstan bans online porn in crackdown on internet freedoms

Microsoft's AI edge under scrutiny as OpenAI turns to rivals for cloud services

Others Also Read

STARPICKS

Even more ways to earn: GrabRewards just got even better

Cricket4h ago

Cricket-India head coach Gambhir clashes with ground staff during pitch inspection

Nation4h ago

Finance Ministry attributes EPF income drop to softer equity market performance

starplus23 Jul 2025

INTERACTIVE: Fake or fact? Only three in 10 Malaysians verify info online

Economy4h ago

IMF: Malaysia's real GDP growth forecast raised to 4.5% in 2025, 4.0% in 2026

Nation6h ago

Malaysian aid enters Gaza after six-month blockade

India6h ago

India extends free etourist visa for Malaysians till Dec 31 next year

India6h ago

Air India audit finds 51 safety lapses, from unapproved simulators to training gaps

Entertainment7h ago

Ex-TVB star Linna Huynh holds betrothal ceremony, with gold & expensive seafood

Cambodia7h ago

Cambodia rejects Thailand’s repeated accusations of ceasefire violations

Singapore7h ago

Terrorism threat in Singapore remains high, driven by events like Israeli-Palestinian conflict: Internal Security Department

China7h ago

China forensic doctor goes viral for muscular build, handles 600 corpses in three years

StarPicks

Allianz and p-hailing platforms unite for rider safety

ONE of Malaysia's leading protection solutions company – Allianz Malaysia Berhad (Allianz Malaysia), has partnered with the Malaysian Institute of Road Safety Research (MIROS) and leading p-hailing platforms – foodpanda Malaysia, Lalamove Malaysia and Grab Malaysia – to emphasise the importance of rider safety in Malaysia's rapidly expanding industry.

Symbol	Open	High	Low	Last	Chg	%Chg	Vol ('00)
FOCUS	0.010	0.010	0.005	0.005	-0.005	-50.00	3,624,846
HSI-PWHI	0.185	0.200	0.175	0.175	-0.005	-2.78	1,939,767
HSI-PWHV	0.170	0.185	0.160	0.160	-0.010	-5.88	1,563,864
HSI-CWGS	0.110	0.115	0.100	0.115	-0.005	-4.17	1,333,397
HSI-CWGX	0.230	0.230	0.200	0.220	-0.010	-4.35	1,222,222
HSI-CWGI	0.135	0.145	0.130	0.145	-0.005	-3.33	1,158,317
EKOVEST	0.430	0.430	0.380	0.405	-0.035	-7.95	1,044,750
OXB	0.360	0.400	0.360	0.385	0.095	32.76	1,015,514
ZETRIX	0.835	0.865	0.835	0.845	0.010	1.20	806,578
NEXG	0.530	0.535	0.525	0.530	0.000	0.00	531,175
PHARMA	0.195	0.200	0.180	0.180	-0.035	-16.28	517,753
TOPGLOV	0.685	0.685	0.655	0.675	-0.010	-1.46	475,806
TANCO	0.920	0.930	0.915	0.925	0.000	0.00	446,532
ECOSHOP	1.340	1.340	1.260	1.290	-0.060	-4.44	371,729
SAPNRG	0.040	0.040	0.035	0.040	0.000	0.00	361,312

AI’s new frontier: affordable, domain-specific models are coming, says PolyU scientist

HONOURING 30 YEARS OF EXTRAORDINARY MALAYSIANS

Next In Tech News

Others Also Read

Even more ways to earn: GrabRewards just got even better

Cricket-India head coach Gambhir clashes with ground staff during pitch inspection

Finance Ministry attributes EPF income drop to softer equity market performance

INTERACTIVE: Fake or fact? Only three in 10 Malaysians verify info online

IMF: Malaysia's real GDP growth forecast raised to 4.5% in 2025, 4.0% in 2026

Malaysian aid enters Gaza after six-month blockade

India extends free etourist visa for Malaysians till Dec 31 next year

Air India audit finds 51 safety lapses, from unapproved simulators to training gaps

Ex-TVB star Linna Huynh holds betrothal ceremony, with gold & expensive seafood

Cambodia rejects Thailand’s repeated accusations of ceasefire violations

Terrorism threat in Singapore remains high, driven by events like Israeli-Palestinian conflict: Internal Security Department

China forensic doctor goes viral for muscular build, handles 600 corpses in three years

StarPicks

Allianz and p-hailing platforms unite for rider safety

HONOURING 30 YEARS OF EXTRAORDINARY MALAYSIANS

Even more ways to earn: GrabRewards just got even better

Market Summary

FBM KLCI

33,578,288

Market Movers

Want to listen to full audio?

Majlis SIRIM Industri 2024

Thank you for downloading.

AI’s new frontier: affordable, domain-specific models are coming, says PolyU scientist

Related News

Related News

Next In Tech News

Others Also Read

Trending in Tech

Market Summary

FBM KLCI

33,578,288

Want to listen to full audio?

Majlis SIRIM Industri 2024

Thank you for downloading.