MPhil Student in Artificial Intelligence The Hong Kong University of Science and Technology (HKUST), Guangzhou Advised by Prof. Hui Xiong and Prof. Xuming Hu
I research efficient long video understanding with Multimodal Large Language Models (MLLMs) — query-aware keyframe selection, token-efficient prompting, and multimodal reasoning for Video-QA. My recent work has been published at CVPR and NeurIPS.
Email: shaoguangwang9@gmail.com Google Scholar: profile GitHub: @shaoguangwang
This page renders its main content with JavaScript. For the full interactive experience, please enable JavaScript in your browser.