I am currently a Ph.D. student at the School of Artificial Intelligence, Beihang University, and a member of CoLab, advised by Prof. Si Liu, with an expected graduation date of 2027.06. I am also a research intern at Shanghai AI Laboratory, where I am supervised by Prof. Jiangmiao Pang and Prof. Jifeng Dai. Prior to my Ph.D., I spent two years in the master’s program at Beihang University under the supervision of Prof. Lv Sheng, and I received my B.Eng. from Beihang University. My research interests lie in Embodied AI and Multimodal Large Language Models, I also have research interests and experience in Reinforcement Learning.

Open to Opportunities I am currently seeking 2027 new-graduate full-time opportunities in Embodied AI and MLLM. Feel free to reach me out via email or WeChat.

🔥 News

🎉 2026.06: RoboInter++ was released as a world-action-modeling extension of RoboInter for robotic manipulation and embodied world modeling.
🎉 2026.05: LLaVA-CMoE was accepted to ICML 2026.
🎉 2026.02: Data, benchmarks, and models from RoboInter were open-sourced.
🎉 2026.02: As team leader, I led our team to the Second Place Award (2/62) in the RoCo Challenge @ AAAI 2026: Robotic Collaborative Assembling for Human-Centered Manufacturing.
🎉 2026.01: RoboInter was accepted to ICLR 2026.
🎉 2025.11: VaccineRAG was accepted to AAAI 2026.
🎉 2025.09: InternVLA-M1 was released as a technical report.
🎉 2025.03: AirStar was accepted to ACM MM 2025.
🎉 2025.01: OpenUAV was accepted to ICLR 2025.
🎉 2024.09: CooHOI was accepted to NeurIPS 2024 as a Spotlight paper.
🎉 2024.03: I joined Shanghai AI Laboratory as a research intern.
🎉 2024.01: Octavius was accepted to ICLR 2024.
🎉 2023.05: I joined NIO as a research intern.
🎉 2023.03: VL-SAT was accepted to CVPR 2023 as a Highlight paper.
🎉 2021.09: I joined SenseTime as a research intern.

Publications

Equal-contribution authors are in bold.

2026

Tech Report

RoboInter++: A Holistic Intermediate Representation Suite for Robotic Manipulation and Embodied World Modeling
Ziqin Wang, Hao Li, Weijun Wang, Junhao Cai, Jia Zeng, Jiangmiao Pang, Yilun Chen, Si Liu
Technical Report

Code

ICLR 2026

RoboInter: A Holistic Intermediate Representation Suite Towards Robotic Manipulation
Ziqin Wang, Hao Li, Zi-han Ding, Shuai Yang, Yilun Chen, Yang Tian, Xiaolin Hu, Tai Wang, Dahua Lin, Feng Zhao, Si Liu, Jiangmiao Pang
ICLR 2026

Paper Code

ICML 2026

LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models
Ziqin Wang, Hengyuan Zhao, Qixin Sun, Kaiyou Song, Yilin Li, Xiaolin Hu, Qingpei Guo, Si Liu
ICML, 2026

Paper

AAAI 2026

VaccineRAG: Boosting Multimodal Large Language Models’ Immunity to Harmful RAG Samples
Qixin Sun, Ziqin Wang, Hengyuan Zhao, Yilin Li, Kaiyou Song, Linjiang Huang, Xiaolin Hu, Qingpei Guo, Si Liu
AAAI 2026

Paper

2025

Tech Report

InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
InternVLA-M1 Team
Technical Report, 2025

Paper Code

ICLR 2025

Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology
Xiangyu Wang, Ziqin Wang, Donglin Yang, Hohin Kwan, Jinyu Chen, Wenjun Wu, Hongsheng Li, Yue Liao, Si Liu
ICLR 2025

Paper Project Page

ACM MM 2025

UAVClaw: Hi AirStar, Guide Me to the Badminton Court.
Ziqin Wang, Jinyu Chen, Xiangyi Zheng, Qinan Liao, Linjiang Huang, Si Liu
ACM MM 2025

Paper Code

2024

NeurIPS 2024 Spotlight

CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics
Ziqin Wang, Jiawei Gao, Zeqi Xiao, Jingbo Wang, Tai Wang, Jinkun Cao, Xiaolin Hu, Si Liu, Jifeng Dai, Jiangmiao Pang
NeurIPS 2024, Spotlight

Paper Project Page

ECCV 2024

Asynchronous Large Language Model Enhanced Planner for Autonomous Driving
Yuan Chen, Zi-han Ding, Ziqin Wang, Yan Wang, Lijun Zhang, Si Liu
ECCV 2024

Paper Code

ICLR 2024

Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE
Zeren Chen, Ziqin Wang, Zhen Wang, Huayang Liu, Zhenfei Yin, Si Liu, Lu Sheng, Wanli Ouyang, Yu Qiao, Jing Shao
ICLR 2024

Paper Project Page

2023

CVPR 2023 Highlight

VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud
Ziqin Wang, Bowen Cheng, Lichen Zhao, Dong Xu, Yang Tang, Lu Sheng
CVPR 2023, Highlight

Paper Code

Research and Industry Experience

Shanghai AI Laboratory *Research Intern, 2024.06 - 2026.06*

Worked on multi-agent collaboration, robotic manipulation, embodied world models, and VLA systems. Contributed to InternVLA-M1, RoboInter, and RoboInter++ related projects, including a co-first-author NeurIPS Spotlight paper.

NIO *Research Intern, 2023.09 - 2024.05*

Worked on point-cloud perception foundation models for autonomous driving and improved cone detection recall while optimizing inference speed for deployment.

SenseTime OpenGVLab *Research Intern, 2022.05 - 2023.09*

Participated in InternVL pretraining and CLIP reproduction and optimization, and explored multimodal large models with mixture-of-experts architectures.

SenseTime *Algorithm Intern, 2021.05 - 2022.05*

Worked on liveness detection algorithms, covering data collection, model iteration, and practical system improvement for real-world deployment.

Education

Beihang University *2023.09 - Present*

Ph.D. student, School of Artificial Intelligence.

Beihang University *2021.09 - 2023.06*

M.Eng., School of Software.

Beihang University *2017.09 - 2021.06*

B.Eng., School of Software.

Projects

AirStar
LLM-Agent UAV Assistant
An LLM-agent-based UAV assistant system that integrates skill selection and tool calling to support campus navigation, photo capture, following, and visual question answering for UAVs.

Code Project Page

ScholarMind
Research Productivity Tool
An intelligent tool for improving research reading efficiency, including paper crawling, content analysis, report generation, and research idea summarization.

Code

Honors and Awards

Second Place Award (2/62), RoCo Challenge @ AAAI 2026: Robotic Collaborative Assembling for Human-Centered Manufacturing, Team Leader
Grand Prize, Beihang "Feng Ru Cup" Creativity Track
Ranked 7/180 during undergraduate study
First-Place Academic Scholarship, Beihang University
First-Class Scholarship for Ph.D. Freshmen, Beihang University

Academic Service

Reviewer for CVPR, ICCV, ECCV, ICLR, NeurIPS, ICML, and ACM MM