About me

I’m Saining Zhang, a Ph.D. student in the College of Computing and Data Science at Nanyang Technological University, supervised by Prof. Hanwang Zhang. I am currently working as a research intern on MLLM at Huawei Singapore. My research interest includes computer vision, computer graphics and embodied AI.

Previously, I received my bachelor’s degree from Beijing Institute of Technology in 2024, where I was supervised by Prof. Hao Zhao from the Institute for AI Industry Research (AIR), Tsinghua University.

News

🎉 [11.2025] One paper got accepted to 3DV 2026!
🎉 [09.2025] One paper got accepted to NeurIPS 2025! Congrats to Nan!
🎉 [06.2025] One paper got accepted to ICCV 2025!
🎉 [06.2025] One paper got accepted to IROS 2025!
🎉 [04.2025] Our new technical report, Selftok, is now available. Congrats to all team members!

Publications [Google Scholar]

* denotes equal contributions, † denotes corresponding author, ‡ denotes project lead.

Light-X : Generative 4D Video Rendering with Camera and Illumination Control

Light-X : Generative 4D Video Rendering with Camera and Illumination Control

Tianqi Liu, Zhaoxi Chen, Zihao Huang, Shaocong Xu, Saining Zhang, Chongjie Ye, Bohan Li, Zhiguo Cao, Wei Li, Hao Zhao†, Ziwei Liu†
[Project page] [Paper] [Arxiv] [Code]

Unleashing and Benchmarking the Interleaved Cross-modality Comprehension and Generation

Unleashing and Benchmarking the Interleaved Cross-modality Comprehension and Generation

Wei Chow*, Jiachun Pan*, Yongyuan Liang, Mingze Zhou, Liyu Jia, Saining Zhang, Xue Song, Siliang Tang, Juncheng Li, Fengda Zhang†, Weijia Wu†, Hanwang Zhang, Tat-Seng Chua
[Project page] [Paper] [Arxiv] [Code]

GaussianArt: Unified Modeling of Geometry and Motion for Articulated Objects

GaussianArt: Unified Modeling of Geometry and Motion for Articulated Objects

Licheng Shen*, Saining Zhang*‡, Honghan Li*, Peilin Yang, Zihao Huang, Zongzheng Zhang, Hao Zhao†
3DV 2026
[Project page] [Paper] [Arxiv] [Code]

Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting

Unifying Appearance Codes and Bilateral Grids for Driving Scene Gaussian Splatting

Nan Wang, Yuantao Chen, Lixing Xiao, Weiqing Xiao, Bohan Li, Zhaoxi Chen, Chongjie Ye, Shaocong Xu, Saining Zhang, Ziyang Yan, Pierre Merriaux, Lei Lei, Tianfan Xue, Hao Zhao†
NeurIPS 2025
[Project page] [Paper] [Arxiv] [Code]

GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting

GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting

Saining Zhang*, Baijun Ye*, Minghui Qin*, Moonjun Gong, Shaoting Zhu, Zebang Shen, Luan Zhang, Lu Zhang, Hao Zhao, Hang Zhao†
ICCV 2025
[Project page] [Paper] [Arxiv]

CRUISE: Cooperative Reconstruction and Editing in V2X Scenarios using Gaussian Splatting

CRUISE: Cooperative Reconstruction and Editing in V2X Scenarios using Gaussian Splatting

Haoran Xu*, Saining Zhang*‡, Peishuo Li*, Baijun Ye, Xiaoxue Chen, Huan-ang Gao, Jv Zheng, Xiaowei Song, Ziqiao Peng, Run Miao, Jinrang Jia, Yifeng Shi, Guangqi Yi, Hang Zhao, Hao Tang, Hongyang Li, Kaicheng Yu, Hao Zhao†
IROS 2025, Oral
[Paper] [Arxiv] [Code]

Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning

Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning

Selftok Team (Saining Zhang: Core contributor)
Technical report, an extended version of DDT-LLaMA
[Project page] [Paper] [Arxiv] [Code]

Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding

Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding

Weiqing Xiao*, Hao Huang*, Chonghao Zhong*, Yujie Lin, Nan Wang, Xiaoxue Chen, Zhaoxi Chen, Saining Zhang, Shuocheng Yang, Pierre Merriaux, Lei Lei, Hao Zhao†
[Project page] [Paper] [Arxiv] [Code]

Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty

Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty

Saining Zhang*, Baijun Ye*, Xiaoxue Chen, Yuantao Chen, Zongzheng Zhang, Cheng Peng, Yongliang Shi, Hao Zhao†
BMVC 2024
[Project page] [Paper] [Arxiv] [Code]

A Dual-Direction Attention Mixed Feature Network for Facial Expression Recognition

A Dual-Direction Attention Mixed Feature Network for Facial Expression Recognition

Saining Zhang, Yuhang Zhang, Ye Zhang, Yufei Wang, Zhigang Song†
Electronics, 2023
[Paper] [Code]

Awards

Internships

Service

I served / was delegated as Reviewer for NeurIPS 2025, IROS 2025, ACMMM 2024 and BMVC 2024.