I am currently a Ph.D. student at the School of Artificial Intelligence, Beihang University, and a member of CoLab, advised by Prof. Si Liu, with an expected graduation date of 2027.06. I am also a research intern at Shanghai AI Laboratory, where I am supervised by Prof. Jiangmiao Pang and Prof. Jifeng Dai. Prior to my Ph.D., I spent two years in the master’s program at Beihang University under the supervision of Prof. Lv Sheng, and I received my B.Eng. from Beihang University. My research interests lie in Embodied AI and Multimodal Large Language Models, I also have research interests and experience in Reinforcement Learning.

Open to Opportunities I am currently seeking 2027 new-graduate full-time opportunities in Embodied AI and MLLM. Feel free to reach me out via email or WeChat.

πŸ”₯ News

  • πŸŽ‰ 2026.06: RoboInter++ was released as a world-action-modeling extension of RoboInter for robotic manipulation and embodied world modeling.
  • πŸŽ‰ 2026.05: LLaVA-CMoE was accepted to ICML 2026.
  • πŸŽ‰ 2026.02: Data, benchmarks, and models from RoboInter were open-sourced.
  • πŸŽ‰ 2026.02: As team leader, I led our team to the Second Place Award (2/62) in the RoCo Challenge @ AAAI 2026: Robotic Collaborative Assembling for Human-Centered Manufacturing.
  • πŸŽ‰ 2026.01: RoboInter was accepted to ICLR 2026.
  • πŸŽ‰ 2025.11: VaccineRAG was accepted to AAAI 2026.
  • πŸŽ‰ 2025.09: InternVLA-M1 was released as a technical report.
  • πŸŽ‰ 2025.03: AirStar was accepted to ACM MM 2025.
  • πŸŽ‰ 2025.01: OpenUAV was accepted to ICLR 2025.
  • πŸŽ‰ 2024.09: CooHOI was accepted to NeurIPS 2024 as a Spotlight paper.
  • πŸŽ‰ 2024.03: I joined Shanghai AI Laboratory as a research intern.
  • πŸŽ‰ 2024.01: Octavius was accepted to ICLR 2024.
  • πŸŽ‰ 2023.05: I joined NIO as a research intern.
  • πŸŽ‰ 2023.03: VL-SAT was accepted to CVPR 2023 as a Highlight paper.
  • πŸŽ‰ 2021.09: I joined SenseTime as a research intern.

Publications

Equal-contribution authors are in bold.
2026
Tech Report
RoboInter++ teaser

RoboInter++: A Holistic Intermediate Representation Suite for Robotic Manipulation and Embodied World Modeling
Ziqin Wang, Hao Li, Weijun Wang, Junhao Cai, Jia Zeng, Jiangmiao Pang, Yilun Chen, Si Liu
Technical Report

ICLR 2026
RoboInter teaser

RoboInter: A Holistic Intermediate Representation Suite Towards Robotic Manipulation
Ziqin Wang, Hao Li, Zi-han Ding, Shuai Yang, Yilun Chen, Yang Tian, Xiaolin Hu, Tai Wang, Dahua Lin, Feng Zhao, Si Liu, Jiangmiao Pang
ICLR 2026

ICML 2026
LLaVA-CMoE teaser

LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models
Ziqin Wang, Hengyuan Zhao, Qixin Sun, Kaiyou Song, Yilin Li, Xiaolin Hu, Qingpei Guo, Si Liu
ICML, 2026

AAAI 2026
VaccineRAG teaser

VaccineRAG: Boosting Multimodal Large Language Models’ Immunity to Harmful RAG Samples
Qixin Sun, Ziqin Wang, Hengyuan Zhao, Yilin Li, Kaiyou Song, Linjiang Huang, Xiaolin Hu, Qingpei Guo, Si Liu
AAAI 2026

2025
Tech Report
InternVLA-M1 teaser

InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
InternVLA-M1 Team
Technical Report, 2025

ICLR 2025
OpenUAV teaser

Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology
Xiangyu Wang, Ziqin Wang, Donglin Yang, Hohin Kwan, Jinyu Chen, Wenjun Wu, Hongsheng Li, Yue Liao, Si Liu
ICLR 2025

ACM MM 2025
AirStar teaser

UAVClaw: Hi AirStar, Guide Me to the Badminton Court.
Ziqin Wang, Jinyu Chen, Xiangyi Zheng, Qinan Liao, Linjiang Huang, Si Liu
ACM MM 2025

2024
NeurIPS 2024 Spotlight
CooHOI teaser

CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics
Ziqin Wang, Jiawei Gao, Zeqi Xiao, Jingbo Wang, Tai Wang, Jinkun Cao, Xiaolin Hu, Si Liu, Jifeng Dai, Jiangmiao Pang
NeurIPS 2024, Spotlight

ECCV 2024
AsyncDriver teaser

Asynchronous Large Language Model Enhanced Planner for Autonomous Driving
Yuan Chen, Zi-han Ding, Ziqin Wang, Yan Wang, Lijun Zhang, Si Liu
ECCV 2024

ICLR 2024
Octavius teaser

Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE
Zeren Chen, Ziqin Wang, Zhen Wang, Huayang Liu, Zhenfei Yin, Si Liu, Lu Sheng, Wanli Ouyang, Yu Qiao, Jing Shao
ICLR 2024

2023
CVPR 2023 Highlight
VL-SAT teaser

VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud
Ziqin Wang, Bowen Cheng, Lichen Zhao, Dong Xu, Yang Tang, Lu Sheng
CVPR 2023, Highlight

Research and Industry Experience

Shanghai AI Laboratory *Research Intern, 2024.06 - 2026.06*

Worked on multi-agent collaboration, robotic manipulation, embodied world models, and VLA systems. Contributed to InternVLA-M1, RoboInter, and RoboInter++ related projects, including a co-first-author NeurIPS Spotlight paper.

NIO *Research Intern, 2023.09 - 2024.05*

Worked on point-cloud perception foundation models for autonomous driving and improved cone detection recall while optimizing inference speed for deployment.

SenseTime OpenGVLab *Research Intern, 2022.05 - 2023.09*

Participated in InternVL pretraining and CLIP reproduction and optimization, and explored multimodal large models with mixture-of-experts architectures.

SenseTime *Algorithm Intern, 2021.05 - 2022.05*

Worked on liveness detection algorithms, covering data collection, model iteration, and practical system improvement for real-world deployment.

Education

Beihang University *2023.09 - Present*

Ph.D. student, School of Artificial Intelligence.

Beihang University *2021.09 - 2023.06*

M.Eng., School of Software.

Beihang University *2017.09 - 2021.06*

B.Eng., School of Software.

Projects

AirStar visual

AirStar
LLM-Agent UAV Assistant
An LLM-agent-based UAV assistant system that integrates skill selection and tool calling to support campus navigation, photo capture, following, and visual question answering for UAVs.

ScholarMind visual

ScholarMind
Research Productivity Tool
An intelligent tool for improving research reading efficiency, including paper crawling, content analysis, report generation, and research idea summarization.

Honors and Awards

  • Second Place Award (2/62), RoCo Challenge @ AAAI 2026: Robotic Collaborative Assembling for Human-Centered Manufacturing, Team Leader
  • Grand Prize, Beihang "Feng Ru Cup" Creativity Track
  • Ranked 7/180 during undergraduate study
  • First-Place Academic Scholarship, Beihang University
  • First-Class Scholarship for Ph.D. Freshmen, Beihang University

Academic Service

  • Reviewer for CVPR, ICCV, ECCV, ICLR, NeurIPS, ICML, and ACM MM