基本信息
黄怀波  男  硕导  中国科学院自动化研究所
电子邮件: huaibo.huang@cripac.ia.ac.cn
通信地址: 北京市海淀区中关村东路95号
邮政编码: 100190

研究领域

计算机视觉、多模态理解与生成、视觉合成与安全、图像恢复和增强

招生信息

每年招收硕士研究生一名,欢迎具有自驱力、致力于发表高水平文章、解决实际科研问题的同学联系。

招生专业
081104-模式识别与智能系统
081203-计算机应用技术
招生方向
模式识别,计算机视觉

工作经历

2019-06   中国科学院大学   博士
2016-01   北京航空航天大学   硕士
2012-07   西安交通大学   学士

工作简历
2021-04~现在, 中国科学院自动化研究所, 副研究员
2019-07~2021-04,中国科学院自动化研究所, 助理研究员
学术兼职

1. 北京图象图形学学会理事

2. 中国图象图形学学会视觉大数据专委会委员

3. IEEE TIFS、IEEE  TBIOM期刊编委

4. ICLR、ACM MM、PRCV等学术会议领域主席

5. TPAMI、IJCV、TIP、NeurIPS、ICML、ICLR、CVPR、ICCV等期刊和会议审稿人

奖励信息

(1) 北京市科技新星, 省级, 2023

(2) 吴文俊人工智能科学技术奖技术发明奖, 一等奖, 其他, 2023

(3) 中国科学院青年创新促进会, , 院级, 2022

(4) 北京市科协青年人才托举工程, , 其他, 2020

(5) 北京市优秀毕业生, , 省级, 2019

(6) 中国科学院院长优秀奖, 院级, 2019

(7) ICME研讨会最佳学生论文奖, 其他, 2019

出版信息

近几年在人工智能领域国际权威期刊和会议发表/录用论文共计90余篇,其中CCF-A类论文53篇,包含TPAMI 3篇、IJCV 8篇、NeurIPS 9篇、CVPR 14篇、ICCV 5篇;出版Springer专著1部。以第一作者和通讯作者身份发表TPAMI 1篇、IJCV 4篇、NeurIPS 6篇、CVPR 9篇、ICCV 3篇。

全部论文列表参考:个人主页 谷歌学术 

期刊论文
  1. Qihang Fan, Huaibo Huang, Mingrui Chen, Hongmin Liu, Ran He. Advancing Vision Transformer with Enhanced Spatial Priors. Trans. Pattern Analysis and Machine Intelligence (TPAMI), 2026. (Accepted) (IF: 23.6,CCF-A,人工智能领域顶级期刊)
  2. Yuang Ai, Jie Cao, Ran He, Huaibo Huang. Uncertainty-Aware Source-Free Adaptive Image Restoration with State Space Augmentation. International Journal of Computer Vision (IJCV), 2026. (Accepted)  (IF: 19.5,CCF-A,人工智能领域顶级期刊)
  3. Jiayang Sun, Hongbo Wang, Jie Cao, Huaibo Huang. Marmot: Object-Level Self-Correction via Multi-Agent Reasoning. Machine Intelligence Research (MIR). 2026. (IF: 8.7,机器智能领域权威期刊)
  4. Junxian Duan, Hao Sun, Fan Ji, Kai Zhou, Zhiyong Wang, Huaibo Huang, Lianwen Jin. RealDTT: Towards A Comprehensive Real-World Dataset for Tampered Text Detection. International Journal of Computer Vision (IJCV),  2025. (IF: 19.5,CCF-A,人工智能领域顶级期刊)
  5. Nan Gao, Jia Li, Huaibo Huang, Zhi Zeng, Ran He. InfoBFR: Real-World Blind Face Restoration via Information Bottleneck. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2025. (IF: 8.4, CCF-B, 视频处理领域权威期刊)
  6. Junxian Duan, Siyu Liu, Yiming Hao, Huaibo Huang, Ran He. Dual Frequency-Guided Spatiotemporal Feature Learning for Face Forgery Detection. IEEE Transactions on Biometrics, Behavior, and Identity Science (TBIOM), 2025. (IF: 5.0,生物特征识别领域权威期刊)
  7. Junxian Duan, Yuang Ai, Jipeng Liu, Shenyuan Huang, Huaibo Huang, Jie Cao, Ran He. Test-time Forgery Detection with Spatial-Frequency Prompt Learning. International Journal of Computer Vision (IJCV), 2024. (IF: 19.5,CCF-A,人工智能领域顶级期刊)
  8. Xiaoqiang Zhou, Chaoyou Fu, Huaibo Huang, Ran He. Dynamic Graph Memory Bank for Video Inpainting. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2024. (IF: 8.4, CCF-B, 视频处理领域权威期刊)
  9. Xiaoqiang Zhou, Huaibo Huang, Zilei Wang, Ran He. RISTRA: Recursive Image Super-resolution Transformer with Relativistic Assessment. IEEE Transactions on Multimedia (TMM), 2024. (IF: 7.3, CCF-A, 多媒体领域权威期刊)
  10. Huaibo Huang, Mandi Luo, Ran He. Memory Uncertainty Learning for Real-World Single Image Deraining. Trans. Pattern Analysis and Machine Intelligence (TPAMI), 2023. (IF: 23.6,CCF-A,人工智能领域顶级期刊)
  11. Chaoyou Fu, Xiang Wu, Yibo Hu, Huaibo Huang, Ran He. DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition. Trans. Pattern Analysis and Machine Intelligence (TPAMI), 2022. (IF: 23.6,CCF-A,人工智能领域顶级期刊)
  12. Jianze Wei, Huaibo Huang, Yunlong Wang, Ran He, Zhenan Sun. Towards More Discriminative and Robust Iris Recognition by Learning Uncertain Factors. IEEE Transactions on Information Forensics & Security (TIFS), 2022. (IF: 6.8,CCF-A,信息安全领域顶级期刊)
  13. Jianze Wei, Yunlong Wang, Huaibo Huang, Ran He, Zhenan Sun, Xingyu Gao. Contextual Measures for Iris Recognition. IEEE Transactions on Information Forensics & Security (TIFS), 2022. (IF: 6.8,CCF-A,信息安全领域顶级期刊)
  14. Mandi Luo, Haoxue Wu, Huaibo Huang, Weizan He, Ran He. Memory-Modulated Transformer Network for Heterogeneous Face Recognition. IEEE Transactions on Information Forensics & Security (TIFS), 2022. (IF: 6.8,CCF-A,信息安全领域顶级期刊)
  15. Aijing Yu, Haoxue Wu, Huaibo Huang, Zhen Lei, Ran He. LAMP-HQ: A Large-Scale Multi-Pose High-Quality Database for NIR-VIS Face Recognition. International Journal of Computer Vision (IJCV), 2021. (IF: 19.5,CCF-A,人工智能领域顶级期刊)
  16. Huaibo Huang, Aijing Yu, Zhenhua Chai, Ran He, Tieniu Tan. Selective Wavelet Attention Learning for Single Image Deraining. International Journal of Computer Vision (IJCV), 2021. (IF: 19.5,CCF-A,人工智能领域顶级期刊)
  17. Xin Ma, Xiaoqiang Zhou, Huaibo Huang, Gengyun Jia, Zhenhua Chai, Xiaolin Wei.  Contrastive Attention Network with the Dense Field Estimation for Face Completion. Pattern Recognition (PR), 2021.  (IF: 8,CCF-B,模式识别领域权威期刊)
  18. Yi Li#, Huaibo Huang#, Jie Cao, Ran He, Tieniu Tan. Disentangled Representation Learning of Makeup Portraits in the Wild. International Journal of Computer Vision (IJCV), 2020, 128: 2166–2184. (Co-first author)(IF: 19.5,CCF-A,人工智能领域顶级期刊)
  19. Xin Zheng, Yanqing Guo, Huaibo Huang, Yi Li, Ran He. A Survey to Deep Facial Attribute Analysis. International Journal of Computer Vision (IJCV), 2020. (IF: 19.5,CCF-A,计算机视觉领域顶级期刊)
  20. Xin Zheng, Huaibo Huang, Yanqing Guo, Ran He.  BLAN: Bi-directional Ladder Attentive Network for Facial Attribute Prediction. Pattern Recognition (PR), 2020. (IF: 8,CCF-B,模式识别领域权威期刊)
  21. Huaibo Huang, Ran He, Zhenan Sun, Tieniu Tan. Wavelet Domain Generative Adversarial Network for Multi-scale Face Hallucination. International Journal of Computer Vision (IJCV), 127(6-7): 763-784, 2019. (IF: 19.5,CCF-A,人工智能领域顶级期刊)
会议论文
  1. Qihang Fan, Yuang Ai, Huaibo Huang, Ran He. Random Wins All: Rethinking Grouping Strategies for Vision Tokens. Computer Vision and Pattern Recognition (CVPR), 2026. (CCF-A,计算机视觉领域顶级会议)
  2. Mingrui Chen, Hexiong Yang, Haogeng Liu, Huaibo Huang, Ran He. Think 360°: Evaluating the Width-centric Reasoning Capability of MLLMs Beyond Depth. Computer Vision and Pattern Recognition (CVPR), 2026. (CCF-A,计算机视觉领域顶级会议)
  3. Jiayang Sun, Pin Wang, Hongbo Wang, Xinyue Liu, Huaibo Huang, Ran He. Towards Fine-Grained Attribution: Instance-Aware Preference Optimization for Aligning Diffusion Models. Computer Vision and Pattern Recognition (CVPR), 2026. (CCF-A,计算机视觉领域顶级会议)
  4. Xinyue Liu, Jin Liu, Hongbo Wang, Ran He, Huaibo Huang. Think-Then-Generate: Structural Chain-of-Thought Reasoning for Consistent 3D Generation. Computer Vision and Pattern Recognition (CVPR), 2026. (CCF-A,计算机视觉领域顶级会议)
  5. Shiran Ge, Chenyi Huang, Yuang Ai, Qihang Fan, Huaibo Huang, Ran He. Expand and Prune: Maximizing Trajectory Diversity for Effective GRPO in Generative Models. Computer Vision and Pattern Recognition (CVPR), 2026. (CCF-A,计算机视觉领域顶级会议)
  6. Xing Cui, Yueying Zou, Zekun Li, Peipei Li, Xinyuan Xu, Xuannan Liu, Huaibo Huang, Ran He. T$^2$Agent: A Tool-augmented Multimodal Misinformation Detection Agent with Monte Carlo Tree Search. AAAI Conference on Artificial Intelligence (AAAI), 2026. (Oral, CCF-A,人工智能领域顶级会议)
  7. Yuang Ai, Qihang Fan, Xuefeng Hu, Zhenheng Yang, Ran He, Huaibo Huang. DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling. Neural Information Processing Systems (NeurIPS), 2025.(Spotlight, CCF-A,人工智能领域顶级会议)
  8. ​Xuannan Liu, Zekun Li, Zheqi He, Pei Pei Li, Shuhan Xia, Xing Cui, Huaibo Huang, Xi Yang, Ran He. Video-SafetyBench: A Benchmark for Safety Evaluation of Video LVLMs. Neural Information Processing Systems (NeurIPS), 2025. (CCF-A,人工智能领域顶级会议)
  9. Yuguang Zhang, Qihang Fan, Huaibo Huang*. Vision Transformer with Sparse Scan Prior. ACM International Conference on Multimedia (ACM MM), 2025. (CCF-A,多媒体计算领域顶级会议)
  10. Qihang Fan, Huaibo Huang*, Yuang Ai, Ran He. Rectifying Magnitude Neglect in Linear Attention.  International Conference on Computer Vision (ICCV), 2025. (Highlight, CCF-A,计算机视觉领域顶级会议)
  11. Qihang Fan, Huaibo Huang*, Mingrui Chen, Ran He. Semantic Equitable Clustering: A Simple and Effective Strategy for Clustering Vision Tokens. International Conference on Computer Vision (ICCV), 2025. (CCF-A,计算机视觉领域顶级会议)
  12. Qihang Fan, Huaibo Huang*, Ran He. Breaking the Low-Rank Dilemma of Linear Attention. Computer Vision and Pattern Recognition (CVPR), 2025.  (CCF-A,计算机视觉领域顶级会议)
  13. Xuannan Liu, Zekun Li, Pei Pei Li, Huaibo Huang, Shuhan Xia, etc. MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs. International Conference on Learning Representations (ICLR), 2025. (CCF-A,人工智能领域顶级会议)
  14. Yuang Ai, Xiaoqiang Zhou, Huaibo Huang*, Xiaotian Han, Zhengyu Chen, Quanzeng You, Hongxia Yang. DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation. Neural Information Processing Systems (NeurIPS), 2024. (CCF-A,机器学习领域顶级会议)
  15. Haogeng Liu, Quanzeng You, Xiaotian Han, Yongfei Liu, Huaibo Huang*, Ran He, Hongxia Yang. Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model.  Neural Information Processing Systems (NeurIPS), 2024. (CCF-A,机器学习领域顶级会议)
  16. Hongbo Wang, Jin Liu, Xiaoqiang Zhou, Jie Cao, Huaibo Huang*, Ran He. Hallo3D: Multi-Modal Hallucination Detection and Mitigation for Consistent 3D Content Generation. Neural Information Processing Systems (NeurIPS), 2024. (CCF-A,机器学习领域顶级会议)
  17. Jin Liu, Huaibo Huang*, Jie Cao, Ran He. ZePo: Zero-Shot Portrait Stylization with Faster Sampling. ACM International Conference on Multimedia (ACM MM). 2024. (CCF-A, 多媒体计算领域顶级会议)
  18. Xuannan Liu, Pei Pei Li, Huaibo Huang, Zekun Li, Xing Cui, et al. FKA-Owl: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs. ACM International Conference on Multimedia (ACM MM). 2024. (CCF-A, 多媒体计算领域顶级会议)
  19. Xing Cui, Zekun Li, Peipei Li, Huaibo Huang, Xuannan Liu, Zhaofeng He. InstaStyle: Inversion Noise of a Stylized Image is Secretly a Style Adviser. European Conference on Computer Vision (ECCV), 2024. (CCF-B, 计算机视觉领域顶级会议)
  20. Tingkai Liu, Yunzhe Tao, Haogeng Liu, Qihang Fan, Ding Zhou, Huaibo Huang, Ran He, Hongxia Yang. DeVAn: Dense Video Annotation for Video-Language Models. Association for Computational Linguistics (ACL), 2024. (CCF-A,自然语言处理领域顶级会议)
  21. Yuang Ai, Huaibo Huang*, Xiaoqiang Zhou, Jiexiang Wang, Ran He. Multimodal Prompt Perceiver: Empower Adaptiveness, Generalizability and Fidelity for All-in-One Image Restoration. Computer Vision and Pattern Recognition (CVPR), 2024. (Corresponding author) (CCF-A,计算机视觉领域顶级会议)
  22. Yuang Ai, Xiaoqiang Zhou, Huaibo Huang*, Lei Zhang, Ran He. Uncertainty-Aware Source-Free Adaptive Image Super-Resolution with Wavelet Augmentation Transformer. Computer Vision and Pattern Recognition (CVPR), 2024.(Corresponding author) (CCF-A,计算机视觉领域顶级会议)
  23. Qihang Fan, Huaibo Huang, Mingrui Chen, Hongmin Liu, Ran He. RMT: Retentive Networks Meet Vision Transformers. Computer Vision and Pattern Recognition (CVPR), 2024. (CCF-A,计算机视觉领域顶级会议)
  24. Zi Wang, Huaibo Huang, Aihua Zheng, Ran He. Heterogeneous Test-time Training for Multi-modal Person Re-identification. AAAI Conference on Artificial Intelligence (AAAI), 2024.(CCF-A,人工智能领域顶级会议)
  25. Qihang Fan, Huaibo Huang, Xiaoqiang Zhou, Ran He. Lightweight Vision Transformer with Bidirectional Interaction. Neural Information Processing Systems (NeurIPS), 2023.(CCF-A,机器学习领域顶级会议)
  26. Rui Wang, Pei Pei Li, Huaibo Huang, Chunshui Cao, Ran He, Zhaofeng He. Learning-to-Rank Meets Language: Boosting Language-Driven Ordering Alignment for Ordinal Classification. Neural Information Processing Systems (NeurIPS), 2023.(CCF-A,机器学习领域顶级会议)
  27. Xiaoqiang Zhou, Huaibo Huang, Zilei Wang, Jie Hu, Ran He, Tieniu Tan. MSRA-SR: Image Super-resolution Transformer with Multi-scale Shared Representation Acquisition. International Conference on Computer Vision (ICCV), 2023. (CCF-A,计算机视觉领域顶级会议)
  28. Peipei Li, Rui Wang, Huaibo Huang, Ran He, Zhaofeng He. Pluralistic Aging Diffusion Autoencoder. International Conference on Computer Vision (ICCV), 2023. (CCF-A,计算机视觉领域顶级会议)
  29. Huaibo Huang, Xiaoqiang Zhou, Jie Cao, Ran He, Tieniu Tan. Vision Transformer with Super Token Sampling. Computer Vision and Pattern Recognition (CVPR), 2023. (CCF-A,计算机视觉领域顶级会议)
  30. Huaibo Huang, Xiaoqiang Zhou, Ran He. Orthogonal Transformer: An Efficient Vision Transformer Backbone with Token Orthogonalization. Neural Information Processing Systems (NeurIPS), 2022. (CCF-A,机器学习领域顶级会议)
  31. Gengyun Jia, Huaibo Huang, Chaoyou Fu, Ran He. Rethinking Image Cropping: Exploring Diverse Compositions from Global Views. Computer Vision and Pattern Recognition (CVPR), 2022. (CCF-A,计算机视觉领域顶级会议)
  32. Xin Xie, Yi Li, Huaibo Huang, Haiyan Fu, Wanwan Wang, Yanqing Guo. Artistic Style Discovery With Independent Components. Computer Vision and Pattern Recognition (CVPR), 2022. (CCF-A,计算机视觉领域顶级会议)
  33. Huaibo Huang, Aijing Yu, Ran He. Memory Oriented Transfer Learning for Semi-Supervised Image Deraining. Computer Vision and Pattern Recognition (CVPR), 2021. (CCF-A,计算机视觉领域顶级会议)
  34. Gege Gao, Huaibo Huang, Chaoyou Fu, Zhaoyang Li, Ran He. Information Bottleneck Disentanglement for Identity Swapping. Computer Vision and Pattern Recognition (CVPR), 2021. (CCF-A,计算机视觉领域顶级会议)
  35. Peipei Li#, Huaibo Huang#, Yibo Hu, Xiang Wu, Ran He, Zhenan Sun. Hierarchical Face Aging through Disentangled Latent Characteristics. European Conference on Computer Vision (ECCV), 2020. (Co-first author)(CCF-B, 计算机视觉领域顶级会议)
  36. Jie Cao, Huaibo Huang, Yi Li, Jingtuo Liu, Ran He, Zhenan Sun. Informative Sample Mining Network for Multi-Domain Image-to-Image Translation. European Conference on Computer Vision (ECCV), 2020. (CCF-B, 计算机视觉领域顶级会议)
  37. Hao Zhu, Huaibo Huang, Yi Li, Aihua Zheng, Ran He. Arbitrary Talking Face Generation via Attentional Audio-Visual Coherence Learning. International Joint Conference on Artificial Intelligence (IJCAI), 2020. (CCF-B,人工智能领域顶级会议)
  38. Chaoyou Fu, Xiang Wu, Yibo Hu, Huaibo Huang, Ran He. Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. Neural Information Processing Systems (NeurIPS), 2019.(CCF-A,机器学习领域顶级会议)
  39. Weikuo Guo, Huaibo Huang, Xiangwei Kong, Ran He. Learning Disentangled Representation for Cross-Modal Retrieval with Deep Mutual Information Estimation. ACM International Conference on Multimedia (ACMMM), 2019.(CCF-A,人工智能领域顶级会议)
  40. Xiang Wu, Huaibo Huang, Vishal Patel, Ran He, Zhenan Sun. Disentangled Variational Representation for Heterogeneous Face Recognition. AAAI Conference on Artificial Intelligence (AAAI), 2019.(CCF-A,人工智能领域顶级会议)
  41. Rui Wang, Huaibo Huang, Xufeng Zhang, Jixin Ma, Aihua Zheng. A Novel Distance Learning for Elastic Cross Modal Audio-Visual Matching. Workshops: 2019 IEEE International Conference on Multimedia and Expo (ICMEW), 2019. (Best Student Paper) (CCF-B, 计算机图形学与多媒体领域权威会议)
  42. Huaibo Huang, Zhihang Li, Ran He, Zhenan Sun, Tieniu Tan. IntroVAE: Introspective Variational Autoencoders for Photographic Image Synthesis. Neural Information Processing Systems (NeurIPS), 2018: 52-63. (CCF-A,机器学习领域顶级会议)
  43. Huaibo Huang, Ran He, Zhenan Sun, Tieniu Tan. Wavelet-SRNet: A Wavelet-based CNN for Multi-scale Face Super Resolution. International Conference on Computer Vision (ICCV), 2017: 1698-1706. (CCF-A,计算机视觉领域顶级会议)
著作
Yi Li, Huaibo Huang, Ran He, Tieniu Tan. Heterogeneous Facial Analysis and Synthesis. Springer, 2020.

科研项目

( 1 ) 隐私-效用协同优化的多模态视觉内容生成研究, 负责人, 国家任务, 2026-01--2029-12

( 2 ) 开放环境下图像增强基础模型研究, 负责人, 地方任务, 2025-01--2027-12

( 3 ) 基于视频⽣成先验的视频处理算法研究, 负责人, 境内委托项目, 2025-01--2025-12

( 4 ) 多模态融合的白内障智能诊断方法和应用研究, 负责人, 地方任务, 2024-09--2026-09

( 5 ) 北京市科技新星创新新星计划, 负责人, 地方任务, 2023-10--2026-10

( 6 ) 基于MindSpore的视觉内容智能合成与鉴别技术研究, 负责人, 境内委托项目, 2022-11--2023-11

( 7 ) 中国科学院青年创新促进会, 负责人, 中国科学院计划, 2022-03--2025-12

( 8 ) 面向复杂场景的小样本视频自动生成研究, 负责人, 境内委托项目, 2021-09--2022-08

( 9 ) 基于解耦表达的高保真人脸图像合成理论和方法研究, 负责人, 国家任务, 2021-01--2023-12

( 10 ) 多媒体混合伪造生成模型的稳定性研究, 负责人, 国家任务, 2020-07--2023-06

( 11 ) 人脸增强、旋转与Sketch转换技术, 负责人, 境内委托项目, 2018-12--2024-06