I am a PhD student at MMLab, The Chinese University of Hong Kong, supervised by Prof. Xiangyu Yue. I received my bachelor's degree from Wuhan University, advised by Prof. Zuchao Li and Prof. Lefei Zhang.
I am a core member of the InternAgent team at Shanghai AI Laboratory, working closely with Dr. Bo Zhang. My work focuses on leading large-scale mid-training for fundamental agent capabilities, and developing long-horizon deep research tasks.
My research focuses on agentic models that can execute complex long-horizon tasks in real-world scenarios. I am interested in building systems with strong reasoning and execution capabilities through scaling model-environment interaction. In addition, I am broadly interested in multimodal reasoning, vision-language representation learning, and unified multimodal models.
I am always open to potential discussions and collaborations. Please feel free to reach out by email if you are interested in my work or would like to explore possible collaborations.
Long-horizon task execution, deep research, scientific discovery, model-environment interaction
Novel thinking and interaction patterns for enhanced logical reasoning and planning capabilities
Multimodal reasoning, unified multimodal models, vision-language representation