Haobo Yuan

Research Associate @ MMLab, NTU


Hi, I am a research associate affiliated with MMLab@NTU, working with Prof. Chen Change Loy. I received degrees of B.E. (with honors, Hongyi Honors College, 2020) and M.S. (School of Computer Science, 2023) both in computer science and technology at Wuhan University. During my master’s study, I was fortunately supervised by Prof. Lefei Zhang.

Currently, I am working on research projects mainly about computer vision. Feel free to email me for any form of discussion.ūüĒ•


Jul 1, 2024 Open-Vocabulary SAM has been accepted by ECCV 2024.
Feb 27, 2024 OMG-Seg got accepted by CVPR 2024.
Jan 30, 2024 One paper (survey about open-vocabulary learning) got accepted by TPAMI.
Aug 16, 2023 Day 1 @ MMLab, NTU. My new voyage begins. ūüöĘ
Jul 14, 2023 One paper got accepted by ICCV 2023. ūüéČūüéČūüéČ
Jun 12, 2023 One paper finally got accepted by IEEE TIP (CCF-A journal). ūüėÖ
Jan 21, 2023 One paper got accepted by ICLR 2023 (Spotlight). ūüéČūüéČūüéČ
Sep 15, 2022 One paper got accepted by NeurIPS 2022. ūüéČūüéČūüéČ
Jul 4, 2022 One paper got accepted by ECCV 2022. ūüéČūüéČūüéČ
Oct 8, 2021 Our ūüéĶPolyphonicFormer won the of ICCV-2021 BMTT Challenge Video + Depth track.


  1. Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively
    Haobo Yuan, Xiangtai Li, Chong Zhou, Yining Li, Kai Chen, and Chen Change Loy
    In ECCV, 2024. Milano, Italy.
  2. arXiv
    RAP-SAM:Towards Real-Time All-Purpose Segment Anything
    Shilin Xu, Haobo Yuan, Qingyu Shi, Lu Qi, Jingbo Wang, Yibo Yang, Yining Li, Kai Chen, Yunhai Tong, Bernard Ghanem, Xiangtai Li, and Ming-Hsuan Yang
    arXiv preprint, 2024.
  3. OMG-Seg: Is One Model Good Enough For All Segmentation?
    Xiangtai Li, Haobo Yuan, Wei Li, Henghui Ding, Size Wu, Wenwei Zhang, Yining Li, Kai Chen, and Chen Change Loy
    In CVPR, 2024. Seattle, WA, USA.
  4. Towards Open Vocabulary Learning: A Survey
    Jianzong Wu, Xiangtai Li, Shilin Xu, Haobo Yuan, Henghui Ding, Yibo Yang, Xia Li, Jiangning Zhang, Yunhai Tong, Xudong Jiang, Bernard Ghanem, and Dacheng Tao
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024.
  5. Tube-Link: A Flexible Cross Tube Baseline for Universal Video Segmentation
    Xiangtai Li, Haobo Yuan, Wenwei Zhang, Guangliang Cheng, Jiangmiao Pang, and Chen Change Loy
    In ICCV, 2023. Paris, France.
  6. Monocular Road Planar Parallax Estimation
    Haobo Yuan, Teng Chen, Wei Sui, Jiafeng Xie, Lefei Zhang, Yuan Li, and Qian Zhang
    IEEE Transactions on Image Processing, 2023.
  7. Neural Collapse Inspired Feature-Classifier Alignment for Few-Shot Class-Incremental Learning
    Yibo Yang, Haobo Yuan, Xiangtai Li, Zhouchen Lin, Philip Torr, and Dacheng Tao
    In ICLR, 2023. Kigali, Rwanda. Spotlight.
  8. PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation
    Haobo Yuan, Xiangtai Li, Yibo Yang, Guangliang Cheng, Jing Zhang, Yunhai Tong, Lefei Zhang, and Dacheng Tao
    In ECCV, 2022. Tel Aviv, Israel.
    Winner method of the ICCV-2021 SemKITTI-DVPS Challenge.
  9. Multi-Task Learning with Multi-query Transformer for Dense Prediction
    Yangyang Xu, Xiangtai Li, Haobo Yuan, Yibo Yang, and Lefei Zhang
    IEEE Transactions on Circuits and Systems for Video Technology, 2023.
  10. Towards Theoretically Inspired Neural Initialization Optimization
    Yibo Yang, Hong Wang, Haobo Yuan, and Zhouchen Lin
    In NeurIPS, 2022. New Orleans, LA, USA.
  11. BOSSA: a decentralized system for proofs of data retrievability and replication
    Dian Chen, Haobo Yuan, Shengshan Hu, Qian Wang, and Cong Wang
    IEEE Transactions on Parallel and Distributed Systems, 2021.
  12. arXiv
    Neural Collapse Terminus: A Unified Solution for Class Incremental Learning and Its Variants
    Yibo Yang, Haobo Yuan, Xiangtai Li, Jianlong Wu, Lefei Zhang, Zhouchen Lin, Philip Torr, Dacheng Tao, and Bernard Ghanem
    arXiv pre-print, 2023.
  13. arXiv
    Transformer-based Visual Segmentation: A Survey
    Xiangtai Li, Henghui Ding, Wenwei Zhang, Haobo Yuan, Jiangmiao Pang, Guangliang Cheng, Kai Chen, Ziwei Liu, and Chen Change Loy
    arXiv pre-print, 2023.
  14. arXiv
    PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation
    Xiangtai Li, Shilin Xu, Yibo Yang, Haobo Yuan, Guangliang Cheng, Yunhai Tong, Zhouchen Lin, Ming-Hsuan Yang, and Dacheng Tao
    arXiv pre-print, 2023.
  15. arXiv
    Point Could Mamba: Point Cloud Learning via State Space Model
    Tao Zhang, Xiangtai Li, Haobo Yuan, Shunping Ji, and Shuicheng Yan
    arXiv pre-print, 2024.