Yibing Song

宋奕兵

Deputy Chief Engineer (Algorithm)
BYD Group

Email: yibingsong.cv at gmail dot com

Biography


I oversee the AI system design in BYD electric vehicles. Previously, I held positions in Academia (i.e., Fudan University as a faculty member) and Industry (i.e., Alibaba DAMO Academy, and Tencent AI Lab as a research scientist). I got my PhD/MPhil degrees from City University of Hong Kong during which I visited Adobe Research and UC Merced, and got my bachelor degree from University of Science and Technology of China. My expertise resides in computer vision and machine learning, with 60+ premier papers (i.e., CVPR, ICCV, ECCV, NeurIPS, ICML, ICLR, PAMI, IJCV) published and 12k+ citations gathered. Specifically, I am experienced in multi-modal AI, from model-centric, data-centric, and human-centric perspectives, with applications centered around computer vision. I am an IEEE senior member, and have been elected among the Top 2% Scientists worldwide by Stanford University.


Professional Activities


Senior / Lead Area Chairs: CVPR 2026, ICLR 2026
Area Chairs / Meta Reviewers: CVPR (2023-2025), ICCV (2023-2025), NeurIPS (2022-2025), ICML (2023-2025), ICLR (2022-2025)
Outstanding / Top Reviewers: CVPR (2018-2020), ECCV 2022, NeurIPS 2019


Shortlisted Publications   [More] [Citations]


CoT-lized Diffusion: Let's Reinforce T2I Generation Step-by-Step
Zheyuan Liu, Munan Ning, Qihui Zhang, Shuo Yang, Zhongrui Wang, Yiwei Yang, Xianzhe Xu, Yibing Song, Weihua Chen, Fan Wang, and Li Yuan
Advances in Neural Information Processing Systems (NeurIPS) 2025
Paper / Project
LLaVA-CoT: Let Vision Language Models Reason Step-by-Step
Guowei Xu, Peng Jin, Li Hao, Yibing Song, Lichao Sun, and Li Yuan,
IEEE/CVF International Conference on Computer Vision (ICCV) 2025
Paper / Project
Re-Aligning Language to Visual Objects with an Agentic Workflow
Yuming Chen, Jiangyan Feng, Haodong Zhang, Lijun Gong, Feng Zhu, Rui Zhao, Qibin Hou, Ming-Ming Cheng, and Yibing Song,
International Conference on Learning Representations (ICLR) 2025
Paper / Project
DiffusionDET: Diffusion Model for Object Detection
Shoufa Chen, Peize Sun, Yibing Song, and Ping Luo,
IEEE/CVF International Conference on Computer Vision (ICCV) 2023 (Best Paper Nominee)
Paper / Project
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Zhan Tong, Yibing Song, Jue Wang, and Limin Wang,
Advances in Neural Information Processing Systems (NeurIPS) 2022 (Spotlight)
Paper / Project / Hugging Face Repo
Ranked 8th in most influential NIPS 2022 papers / Ranked 39th in most cited 2022 AI papers