About me

I am currently a final-year undergraduate student at the College of Computer Science and Technology, Jilin University, advised by Professor Hongxia Xie. I have been admitted to pursue a Ph.D. degree in School of Intelligence Science and Technology, Nanjing University, advised by Professor Lan-Zhe Guo, in collaboration with the Shanghai Innovation Institute (SII).

Research Interests

Currently, I am focusing on Embodied Agent.

Publications

  • CookAnything: A Framework for Flexible and Consistent Multi-Step Recipe Image Generation
    Ruoxuan Zhang, Bin Wen, Hongxia Xie, Yi Yao, Songhan Zuo, Jian-Yu Jiang-Lin, Hong-Han Shuai, Wen-Huang Cheng
    In: Proceedings of the ACM International Conference on Multimedia, 2025.
    ACMMM 2025. CCF-A.
  • CookAnything: A Framework for Flexible and Consistent Multi-Step Recipe Image Generation
    Ruoxuan Zhang, Bin Wen, Hongxia Xie, Yi Yao, Songhan Zuo, Jian-Yu Jiang-Lin, Hong-Han Shuai, Wen-Huang Cheng
    In: Proceedings of the ACM International Conference on Multimedia, 2025.
    ACMMM 2025. CCF-A.
  • RecipeGen: A Step-Aligned Multimodal Benchmark for Real-World Recipe Generation
    Ruoxuan Zhang, Jidong Gao, Bin Wen, Hongxia Xie, Chenming Zhang, Hong-Han Shuai, Wen-Huang Cheng
    In: Proceedings of the ACM International Conference on Multimedia, 2025.
    ACMMM 2025. CCF-A.
  • EmoArt: A Multidimensional Dataset for Emotion-Aware Artistic Generation
    Cheng Zhang, Hongxia Xie, Bin Wen, Songhan Zuo, Ruoxuan Zhang, Wen-Huang Cheng
    In: Proceedings of the ACM International Conference on Multimedia, 2025.
    ACMMM 2025. CCF-A.
  • MindPower: Enabling Theory-of-Mind Reasoning in VLM-based Embodied Agents
    Ruoxuan Zhang, Qiyun Zheng, Zhiyu Zhou, Ziqi Liao, Siyu Wu, Jian-Yu Jiang-Lin, Bin Wen, Hongxia Xie, Jianlong Fu, Wen-Huang Cheng
    In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025.
    CVPR 2026. CCF-A.
  • Design of UAV target detection network based on deep feature fusion and optimization with small targets in complex contexts
    Jianzheng Liu*, Bin Wen* (co-first author), Jiayi Xiao, Minghui Sun
    In: Neurocomputing, 2025.
    Neurocomputing. CCF-C.

Honor

  • National Scholarship, 2024
  • Outstanding Student, Jilin University, 2023, 2024, 2025
  • Outstanding Graduate, Jilin University, 2026

Teaching

  • Teaching Assistant for Introduction to Artificial Intelligence, Jilin University, 2025 Fall