SAI大家说 | AI主题学术讲座(总第三十期)


SAI 大家说
School of Artificial Intelligence
SEMINAR INFORMATION

Topic
Towards Sustainable Multimodal Intelligence: Data-Centric Efficiency in the Open World
Time
04:00-05:00 PM
Date
Apr. 21st, 2026
Zoom meeting
https://cuhk-edu-cn.zoom.us/j/3535854285?omn=91224963845
Meeting ID:3535854285
Speaker
Dr. ZHANG Wei
(SenseTime Group Limited)
Host
Prof. Satoshi NAKAMURA
(School of Artificial Intelligence, The Chinese University of Hong Kong, Shenzhen)

ABOUT THE SPEAKER


Dr. ZHANG Wei
SenseTime Group Limited
Dr. ZHANG Wei Wayne currently serves as an Executive Director of Research at SenseTime Group Limited, responsible for the research and development of SenseNova Large Language Models. He led a team to develop the SenseNova LLM series, including China’s first model matching GPT-4 Turbo capabilities, ranking first in SuperCLUE three times. He has published over 70 papers in top-tier international academic journals and conferences in the field of artificial intelligence, including TPAMI, NeurIPS, ICLR, CVPR, and ACL.
According to Google Scholar, his work has been cited over 9,000 times, with an h-index of 39. From 2023 to 2025, he was consecutively named among the top 2% of global scientists by Stanford University. He served as an adjunct faculty of Qing Yuan Research Institute, Shanghai Jiao Tong University, and an EXCO member of AI Specialist Group of Hong Kong Computer Society. Dr. ZHANG received the B.Eng. degree in electronic engineering from Tsinghua University, and the Ph.D. degree in information engineering from The Chinese University of Hong Kong, advised by Prof. TANG Xiaoou.
ABSTRACT

The rapid scaling of artificial intelligence has catalyzed unprecedented advancements, yet the prevailing “more data, more compute” paradigm is increasingly hitting unsustainable economic and environmental ceilings. As we approach the limits of data availability and energy efficiency, the next frontier in AI is not merely larger models, but smarter, more efficient, and more adaptable systems. This talk introduces a research agenda focused on building intelligent systems that achieve high performance through deep data insights rather than brute-force scaling.
The speaker will propose a new paradigm for sustainable multimodal intelligence that addresses two critical bottlenecks: data efficiency and open-world adaptation. First, he will address data efficiency through a principled data curation framework, validated by his work on the SenseNova LLM series, which achieves state-of-the-art performance using only 18% of the data volume required by flagship open-source models. Second, he will tackle open-world adaptation by presenting robust, training-free methods for out-of-distribution detection and open-vocabulary segmentation. These innovations bridge the gap between foundational research and large-scale industrial deployment, with proven impact in complex domains such as manufacturing and remote sensing. The speaker will conclude by discussing his future research trajectory toward self-evolving, unified multimodal agents—a necessary evolution for AI to become truly reliable, scalable, and capable of navigating real-world complexity.

All of you are warmly welcome!

夜雨聆风