I am a Research Engineer at the Robotics Lab of the
Beijing Institute of General Artificial Intelligence (BIGAI),
where I work on building real-world reinforcement learning pipelines for humanoid robots. Previously, I received my M.S. degree from
Beihang University, where I was supervised by
Yaodong Yang. I have interned at Tencent AI Lab and was a visiting scholar at
Peking University and
Tsinghua University.
I obtained my B.S. degree from
South China Normal University.
My research interests lie in advancing reinforcement learning toward complex real-world settings, with the goal of compressing environment dynamics into reusable and generalizable intelligence.
I focus on reinforcement learning, world models, and humanoid learning, and I enjoy building robotic systems that can safely and efficiently explore and continuously learn in the real world.
Publications and preprints
Papers sorted by recency. Representative papers are highlighted.
Weidong Huang, Zhehan Li, Hangxin Liu, Biao Hou, Yao Su, Jingwen Zhang
Under Review
project page
Weidong Huang, Jingwen Zhang, Hangxin Liu, Yaodong Yang, Yao Su
Under Review
project page
X Zhang, C Ma, Y Huang, Weidong Huang, S Qi, SC Zhu, X Feng, Y Yang
Conference on Neural Information Processing Systems (NeurIPS), 2025
project page / arXiv
X Zhang, X Cai, B Liu, Weidong Huang, SC Zhu, S Qi, Y Yang
AAAI Conference on Artificial Intelligence (AAAI), 2025 (Oral)
arXiv
Weidong Huang, Jiaming Ji, Borong Zhang, Chunhe Xia and Yaodong Yang
International Conference on Learning Representations (ICLR), 2024 Poster
project page / OpenReview / arXiv
Jiaming Ji, Jiayi Zhou, Borong Zhang, Juntao Dai, Xuehai Pan, Ruiyang Sun, Weidong Huang, ...
Journal of Machine Learning Research (JMLR)
code ⭐ 1K+ / OpenReview / arXiv
Jiaming Ji, Borong Zhang, Xuehai Pan, Jiayi Zhou, Weidong Huang, Juntao Dai and Yaodong Yang
Conference on Neural Information Processing Systems (NeurIPS), 2023
code ⭐ 500+ / arXiv
Select open-source projects
Weidong Huang, Zhehan Li, Hangxin Liu, Biao Hou, Yao Su, Jingwen Zhang
code / project page
Under Review. Scalable RL framework integrating large-scale SAC pretraining and physics-informed world-model finetuning
Weidong Huang, Jiaming Ji, Borong Zhang, Chunhe Xia and Yaodong Yang
code / project page / OpenReview / arXiv
ICLR 2024 Poster. Tackling zero-cost performance within SafeRL, finding optimal policy while satisfying safety constraints
Jiaming Ji, Jiayi Zhou, Borong Zhang, Juntao Dai, Xuehai Pan, Ruiyang Sun, Weidong Huang, ...
code / arXiv
I lead the model-based RL module development, contributing over 13,000+ lines of code
Jiaming Ji, Borong Zhang, Xuehai Pan, Jiayi Zhou, Weidong Huang, Juntao Dai and Yaodong Yang
code / arXiv
I contributed to the development, focusing on the design and validation of safe RL tasks based on visual inputs