About Me

I am a second-year PhD student at the University of Central Florida, with Prof. Chen Chen as my advisor. I am currently interning at ByteDance-Seed with Jie Wu and Rui Wang to explore visual and multimodal RL.

Publications

[ByteDance Seed Technical Report]   Self-Forcing++: Towards Minute-Scale High-Quality Video Generation GitHub Stars

Justin Cui, Jie Wu, Ming Li, Tao Yang, Xiaojie Li, Rui Wang, Andrew Bai, Yuanhao Ban, Cho-Jui Hsieh

[ByteDance Seed Technical Report]   RewardDance: Reward Scaling in Visual Generation

Jie Wu, Yu Gao, Zilyu Ye, Ming Li, Liang Li, Hanzhong Guo, Jie Liu, Zeyue Xue, Xiaoxia Hou, Wei Liu, Yan Zeng, Weilin Huang

[ICCV'25]   SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing GitHub Stars

Ming Li, Xin Gu, Fan Chen, Xiaoying Xing, Longyin Wen, Chen Chen, Sijie Zhu

[ICLR'25]   Multi-Reward as Condition for Instruction-based Image Editing. GitHub Stars

Xin Gu, Ming Li, Libo Zhang, Fan Chen, Longyin Wen, Tiejian Luo, Sijie Zhu

[PR'25]   DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection. GitHub Stars

Manlin Zhang, Jie Wu, Yuxi Ren, Ming Li, Jie Qin, Xuefeng Xiao, Wei Liu, Rui Wang, Min Zheng, Andy J. Ma

[ECCV'24]   ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback. GitHub Stars

Ming Li, Taojiannan Yang, Huafeng Kuang, Jie Wu, Zhaoning Wang, Xuefeng Xiao, Chen Chen

[ACM MM'24]   Frame Interpolation with Consecutive Brownian Bridge Diffusion. GitHub Stars

Zonglin Lyu, Ming Li, Jianbo Jiao, Chen Chen

[ICCV'23]   AlignDet: Aligning Pre-training and Fine-tuning in Object Detection. GitHub Stars

Ming Li*, Jie Wu*#, Xionghui Wang, Chen Chen#, Jie Qin, Xuefeng Xiao, Rui Wang, Min Zheng, Xin Pan

[CVPR'23]   FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation. GitHub Stars

Jie Qin*, Jie Wu*#, Pengxiang Yan, Ming Li, Ren Yuxi, Xuefeng Xiao, Yitong Wang, Rui Wang, Shilei Wen, Xin Pan, Xingang Wang#

[ECCV'22]   Multi-granularity Distillation Scheme Towards Lightweight Semi-supervised Semantic Segmentation. GitHub Stars

Jie Qin, Jie Wu, Ming Li, Xuefeng Xiao, Min Zheng, Xingang Wang

[CVPR'25 Workshop, Computer Vision for Metaverse]   IL-NeRF: Incremental Learning for Neural Radiance Fields with Camera Pose Alignment.

Letian Zhang, Ming Li, Chen Chen, Jie Xu

Preprints

[arXiv'25]   Where do Large Vision-Language Models Look at when Answering Questions? GitHub Stars

Xiaoying Xing, Chia-Wen Kuo, Fuxin Li, Yulei Niu, Fan Chen, Ming Li, Ying Wu, Longyin Wen, Sijie Zhu

Internships

Honors