Han Zhao 赵晗

About Me

Hi! I’m Han Zhao (赵晗). I hold my bachelor’s and master’s degrees in Control Science and Engineering from Beijing University of Posts and Telecommunications (BUPT) in 2020 and 2023, respectively. I am currently a third-year joint Ph.D. student in Computer Science and Technology at Zhejiang University and Westlake University (Machine Intelligence Lab, MiLAB), advised by Prof. Donglin Wang.

Research Interests

My current research interests include Embodied Artificial Intelligence, Foundation Models, Reinforcement Learning, and Robotics.

Specifically, I am interested in:

  • Foundation Models for Robotics: developing efficient and effective foundation models for robotics, including multi-modal large language models and vision-language-action models to enhance robots’ perception and decision-making capabilities.

  • Scalable Reinforcement Learning Algorithms: developing reinforcement learning algorithms that can effectively manage large-scale data and model capacity for robotic control. This includes methods such as offline reinforcement learning, imitation learning, and more, to enable robots to acquire scalable and generalizable skills.

Publications

Preprint

Han Zhao, Jiaxuan Zhang, Wenxuan Song, Pengxiang Ding, Donglin Wang*, "VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation". [ArXiv][project page]

Fuhao Li, Wenxuan Song, Han Zhao, Jingbo Wang, Pengxiang Ding, Donglin Wang, Long Zeng, Haoang Li*, "Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model". [ArXiv]

Shuanghao Bai, Wenxuan Song, Jiayi Chen, Yuheng Ji, Zhide Zhong, Jin Yang, Han Zhao, Wanqi Zhou, Wei Zhao, Zhe Li, Pengxiang Ding, Cheng Chi, Haoang Li, Chang Xu, Xiaolong Zheng, Donglin Wang, Shanghang Zhang*, Badong Chen*, "Towards a Unified Understanding of Robot Manipulation: A Comprehensive Survey". [ArXiv]

Yihao Wang, Pengxiang Ding, Lingxiao Li, Can Cui, Zirui Ge, Xinyang Tong, Wenxuan Song, Han Zhao, Wei Zhao, Pengxu Hou, Siteng Huang, Yifan Tang, Wenhui Wang, Ru Zhang, Jianyi Liu, Donglin Wang*, "VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model". [ArXiv]

Wenxuan Song, Ziyang Zhou, Han Zhao, Jiayi Chen, Pengxiang Ding, Haodong Yan, Yuxin Huang, Feilong Tang, Donglin Wang, Haoang Li*, "ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver". [ArXiv]

Can Cui, Pengxiang Ding, Wenxuan Song, Shuanghao Bai, Xinyang Tong, Zirui Ge, Runze Suo, Wanqi Zhou, Yang Liu, Bofang Jia, Han Zhao, Siteng Huang, Donglin Wang*, "OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation". [ArXiv]

Wenxuan Song, Jiayi Chen, Pengxiang Ding, Yuxin Huang, Han Zhao, Donglin Wang, Haoang Li*, "CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding". [ArXiv]

Wenxuan Song, Jiayi Chen, Wenxue Li, Xu He, Han Zhao, Can Cui, Pengxiang Ding, Shiyan Su, Feilong Tang, Donglin Wang, Xuelian Cheng, Zongyuan Ge, Xinhu Zheng, Zhe Liu, Hesheng Wang, Haoang Li, "RationalVLA: A Rational Vision-Language-Action Model with Dual System". [ArXiv]

Hongyin Zhang, Diyuan Shi, Zifeng Zhuang, Han Zhao, Zhenyu Wei, Feng Zhao, Sibo Gai, Shangke Lyu, Donglin Wang*, "Unlock Reliable Skill Inference for Quadruped Adaptive Behavior by Skill Graph". [ArXiv]

2025

Yang Liu, Ming Ma, Xiaomin Yu, Pengxiang Ding, Han Zhao, Mingyang Sun, Siteng Huang, Donglin Wang, "SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning".

Wenxuan Song, Jiayi Chen, Pengxiang Ding, Han Zhao, Wei Zhao, Zhide Zhong, Zongyuan Ge, Jun Ma, Haoang Li*, "Accelerating Vision-Language-Action Model Integrated with Action Chunking via Parallel Decoding". [ArXiv]

Xinyang Tong, Pengxiang Ding, Yiguo Fan, Donglin Wang*, Wenjie Zhang, Can Cui, Mingyang Sun, Han Zhao, Hongyin Zhang, Yonghao Dang, Siteng Huang, Shangke Lyu, "QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning". [ArXiv] [project page]

Han Zhao, Wenxuan Song, Donglin Wang*, Xinyang Tong, Pengxiang Ding, Xuelian Cheng, Zongyuan Ge, "MoRE: Unlocking Scalability in Reinforcement Learning for Quadruped Vision-Language-Action Models". [ArXiv]

Wei Zhao, Pengxiang Ding, Min Zhang, Zhefei Gong, Shuanghao Bai, Han Zhao, Donglin Wang*, "VLAS: Vision-Language-Action Model With Speech Instructions For Customized Robot Manipulation". [ArXiv]

Lei Guo*, Wenbo Xiong, Han Zhao, Yuan Song, Dongming Gan, "A nearly optimal adaptive saturation function tuning method for quasi-sliding mode control based on integral reinforcement learning". [paper]

Han Zhao, Min Zhang, Wei Zhao, Pengxiang Ding, Siteng Huang, Donglin Wang*, "Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference". [ArXiv] [project page] [zhihu] [github] [demo] [twitter@AK]

2024

Pengxiang Ding, Han Zhao, Wenjie Zhang, Wenxuan Song, Min Zhang, Siteng Huang, Ningxi Yang, Donglin Wang*, "QUAR-VLA: Vision-Language-Action Model for Quadruped Robots". [ArXiv]

Yang Liu, Pengxiang Ding, Siteng Huang, Min Zhang, Han Zhao, Donglin Wang*, "PiTe: Pixel-Temporal Alignment for Large Video-Language Model". [ArXiv]

Wenxuan Song, Han Zhao, Pengxiang Ding, Can Cui, Shangke Lyu, Yaning Fan, Donglin Wang*, "GeRM: A Generalist Robotic Model with Mixture-of-experts for Quadruped Robot". [ArXiv] [project page] [video]

Shangke Lyu, Xin Lang, Han Zhao, Hongyin Zhang, Pengxiang Ding, Donglin Wang*, "RL2AC: Reinforcement Learning-based Rapid Online Adaptive Control for Legged Robot Robust Locomotion".

2023

Shangke Lyu, Han Zhao, Donglin Wang*, "A Composite Control Strategy for Quadruped Robot by Integrating Reinforcement Learning and Model-Based Control". [paper]

Lei Guo*, Han Zhao, "Online adaptive optimal control algorithm based on synchronous integral reinforcement learning with explorations". [paper]

Lei Guo*, Han Zhao, "Model‐free adaptive optimal control of continuous‐time nonlinear non‐zero‐sum games based on reinforcement learning". [paper]

2022

Han Zhao*, Lei Guo, Yuan Song, "System Modelling and Controller Design of a Variable Structure Two-Wheeled Robot Using Robust Adaptive Dynamic Programming". [paper]

Han Zhao*, Lei Guo, "Model-free Nearly Optimal Control of Constrained-Input Nonlinear Systems Based on Synchronous Reinforcement Learning". [paper]

Service

Reviewer for:

Journal

  • Knowledge-Based Systems (KBS)
  • IET Control Theory & Applications (IET-CTA)

Conference

  • International Conference on Learning Representations (ICLR)
  • IEEE/CVF International Conference on Computer Vision (ICCV)
  • AAAI Conference on Artificial Intelligence (AAAI)
  • IEEE International Conference on Robotics and Automation (ICRA)
  • IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
  • IEEE Conference on Decision and Control (CDC)