About Me
I am a second-year PhD student at the University of Central Florida, with Prof. Chen Chen as my advisor. I am currently interning at TikTok with Jie Wu and Rui Wang to explore visual reward fine-tuning. Before this, I conducted research on image editing with Sijie Zhu and Longyin Wen.
Publications
[ICLR'25] Multi-Reward as Condition for Instruction-based Image Editing.
Xin Gu, Ming Li, Libo Zhang, Fan Chen, Longyin Wen, Tiejian Luo, Sijie Zhu
[ECCV'24] ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback.
Ming Li, Taojiannan Yang, Huafeng Kuang, Jie Wu, Zhaoning Wang, Xuefeng Xiao, Chen Chen
[ACM MM'24] Frame Interpolation with Consecutive Brownian Bridge Diffusion.
Zonglin Lyu, Ming Li, Jianbo Jiao, Chen Chen
[ICCV'23] AlignDet: Aligning Pre-training and Fine-tuning in Object Detection.
Ming Li*, Jie Wu*#, Xionghui Wang, Chen Chen#, Jie Qin, Xuefeng Xiao, Rui Wang, Min Zheng, Xin Pan
[CVPR'23] FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation.
Jie Qin*, Jie Wu*#, Pengxiang Yan, Ming Li, Ren Yuxi, Xuefeng Xiao, Yitong Wang, Rui Wang, Shilei Wen, Xin Pan, Xingang Wang#
[ECCV'22] Multi-granularity Distillation Scheme Towards Lightweight Semi-supervised Semantic Segmentation.
Jie Qin, Jie Wu, Ming Li, Xuefeng Xiao, Min Zheng, Xingang Wang
[CVPR'25 Workshop, Computer Vision for Metaverse] IL-NeRF: Incremental Learning for Neural Radiance Fields with Camera Pose Alignment.
Letian Zhang, Ming Li, Chen Chen, Jie Xu
Preprints
[arXiv'25] SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing
Ming Li, Xin Gu, Fan Chen, Xiaoying Xing, Longyin Wen, Chen Chen, Sijie Zhu
[arXiv'25] Where do Large Vision-Language Models Look at when Answering Questions?
Xiaoying Xing, Chia-Wen Kuo, Fuxin Li, Yulei Niu, Fan Chen, Ming Li, Ying Wu, Longyin Wen, Sijie Zhu
[arXiv'23]
DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection.
Manlin Zhang, Jie Wu, Yuxi Ren, Ming Li, Jie Qin, Xuefeng Xiao, Wei Liu, Rui Wang, Min Zheng, Andy J. Ma
Internships
- 2024.05 - Now, TikTok, ByteDance, San Jose, USA.
- 2022.01 - 2023.07, ByteDance, Shenzhen, China.
Honors
- NeurIPS 2024 Top Reviewers.
- 🏆 Champion of CVPR 2023 Long-form Video Understanding and Generation Challenge (Track 3: Question-driven Video Understanding).
- 🏆 Champion of CVPR 2022 AVA Accessibility Vision and Autonomy Challenge (Image Segmentation).
- ORCGS Doctoral Fellowship, the University of Central Florida. 2023.