I am a PhD student at MMLab, The Chinese University of Hong Kong, supervised by Prof. Xiangyu Yue. I received my bachelor's degree from Wuhan University, advised by Prof. Zuchao Li and Prof. Lefei Zhang.
I am a core member of the InternAgent team at Shanghai AI Laboratory, working closely with Dr. Bo Zhang. My work focuses on leading large-scale mid-training for fundamental agent capabilities, and developing long-horizon deep research tasks.
My research focuses on agentic foundation models, including both LLMs and VLMs, for executing complex, long-horizon tasks in real-world environments. I am also broadly interested in multimodal reasoning, vision-language representation learning, and unified multimodal models, where I have developed substantial research expertise.
I am always open to potential discussions and collaborations. Please feel free to reach out by email if you are interested in my work or would like to explore possible collaborations.
Agentic LLMs and VLMs for long-horizon task execution, deep research, and scientific discovery
Novel reasoning and interaction paradigms for enhanced logical reasoning, planning, and decision-making
Multimodal reasoning, unified multimodal models, vision-language representation