Haobo Yuan

PhD Student @ UC Merced

yuanhaobo.jpg

Hi, I am a PhD student at UC Merced, working with Prof. Ming-Hsuan Yang. I received degrees of B.E. (with honors, Hongyi Honors College, 2020) and M.S. (School of Computer Science, 2023) both in computer science and technology at Wuhan University. During my master’s study, I was fortunately supervised by Prof. Lefei Zhang.

Currently, I am working on research projects mainly about computer vision. Feel free to email me for any form of discussion.🔥

News

Aug 15, 2024 Starting my PhD journey at UC Merced.
Jul 1, 2024 Open-Vocabulary SAM has been accepted by ECCV 2024.
Feb 27, 2024 OMG-Seg got accepted by CVPR 2024.
Jan 30, 2024 One paper (survey about open-vocabulary learning) got accepted by TPAMI.
Aug 16, 2023 Day 1 @ MMLab, NTU. My new voyage begins. 🚢
Jul 14, 2023 One paper got accepted by ICCV 2023. 🎉🎉🎉
Jun 12, 2023 One paper finally got accepted by IEEE TIP (CCF-A journal). 😅
Jan 21, 2023 One paper got accepted by ICLR 2023 (Spotlight). 🎉🎉🎉
Sep 15, 2022 One paper got accepted by NeurIPS 2022. 🎉🎉🎉
Jul 4, 2022 One paper got accepted by ECCV 2022. 🎉🎉🎉

Publications

  1. Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively
    Haobo YuanXiangtai LiChong Zhou, Yining Li, Kai Chen, and Chen Change Loy
    In ECCV, 2024. Milano, Italy.
  2. OMG-Seg: Is One Model Good Enough For All Segmentation?
    Xiangtai LiHaobo Yuan, Wei Li, Henghui Ding, Size Wu, Wenwei Zhang, Yining Li, Kai Chen, and Chen Change Loy
    In CVPR, 2024. Seattle, WA, USA.
  3. Transformer-based Visual Segmentation: A Survey
    Xiangtai Li, Henghui Ding, Haobo YuanWenwei Zhang, Jiangmiao Pang, Guangliang Cheng, Kai Chen, Ziwei Liu, and Chen Change Loy
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024.
  4. Towards Open Vocabulary Learning: A Survey
    Jianzong Wu, Xiangtai LiShilin XuHaobo Yuan, Henghui Ding, Yibo Yang, Xia Li, Jiangning Zhang, Yunhai Tong, Xudong Jiang, Bernard Ghanem, and Dacheng Tao
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024.
  5. PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation
    Xiangtai LiShilin XuYibo YangHaobo Yuan, Guangliang Cheng, Yunhai Tong, Zhouchen Lin, Ming-Hsuan Yang, and Dacheng Tao
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024.
  6. Tube-Link: A Flexible Cross Tube Baseline for Universal Video Segmentation
    Xiangtai LiHaobo YuanWenwei Zhang, Guangliang Cheng, Jiangmiao Pang, and Chen Change Loy
    In ICCV, 2023. Paris, France.
  7. Monocular Road Planar Parallax Estimation
    Haobo Yuan, Teng Chen, Wei Sui, Jiafeng Xie, Lefei Zhang, Yuan Li, and Qian Zhang
    IEEE Transactions on Image Processing, 2023.
  8. Neural Collapse Inspired Feature-Classifier Alignment for Few-Shot Class-Incremental Learning
    Yibo YangHaobo YuanXiangtai Li, Zhouchen Lin, Philip Torr, and Dacheng Tao
    In ICLR, 2023. Kigali, Rwanda. Spotlight.
  9. PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation
    Haobo YuanXiangtai LiYibo Yang, Guangliang Cheng, Jing Zhang, Yunhai Tong, Lefei Zhang, and Dacheng Tao
    In ECCV, 2022. Tel Aviv, Israel.
    Winner method of the ICCV-2021 SemKITTI-DVPS Challenge.
  10. Multi-Task Learning with Multi-query Transformer for Dense Prediction
    Yangyang Xu, Xiangtai LiHaobo YuanYibo Yang, and Lefei Zhang
    IEEE Transactions on Circuits and Systems for Video Technology, 2023.
  11. Towards Theoretically Inspired Neural Initialization Optimization
    Yibo Yang, Hong Wang, Haobo Yuan, and Zhouchen Lin
    In NeurIPS, 2022. New Orleans, LA, USA.
  12. BOSSA: a decentralized system for proofs of data retrievability and replication
    Dian Chen, Haobo Yuan, Shengshan Hu, Qian Wang, and Cong Wang
    IEEE Transactions on Parallel and Distributed Systems, 2021.
  13. arXiv
    Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model
    Haobo YuanXiangtai LiLu QiTao ZhangMing-Hsuan Yang, Shuicheng Yan, and Chen Change Loy
    arXiv preprint, 2024.
  14. arXiv
    OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding
    Tao ZhangXiangtai Li, Hao Fei, Haobo Yuan, Shengqiong Wu, Shunping Ji, Chen Change Loy, and Shuicheng Yan
    arXiv preprint, 2024.
  15. arXiv
    Point Could Mamba: Point Cloud Learning via State Space Model
    Tao ZhangXiangtai LiHaobo Yuan, Shunping Ji, and Shuicheng Yan
    arXiv pre-print, 2024.
  16. arXiv
    LLAVADI: What Matters For Multimodal Large Language Models Distillation
    Shilin XuXiangtai LiHaobo YuanLu Qi, Yunhai Tong, and Ming-Hsuan Yang
    arXiv preprint, 2024.
  17. arXiv
    RAP-SAM:Towards Real-Time All-Purpose Segment Anything
    Shilin XuHaobo Yuan, Qingyu Shi, Lu Qi, Jingbo Wang, Yibo Yang, Yining Li, Kai Chen, Yunhai Tong, Bernard Ghanem, Xiangtai Li, and Ming-Hsuan Yang
    arXiv preprint, 2024.
  18. arXiv
    Neural Collapse Terminus: A Unified Solution for Class Incremental Learning and Its Variants
    Yibo YangHaobo YuanXiangtai Li, Jianlong Wu, Lefei Zhang, Zhouchen Lin, Philip TorrDacheng Tao, and Bernard Ghanem
    arXiv pre-print, 2023.