基本信息

张俊格 研究员 博士生导师 中国科学院自动化研究所
电子邮件: jgzhang@nlpr.ia.ac.cn
通信地址: 北京市海淀区中关村东路95号
北京市科技新星,中科院青促会会员
研究领域
博弈决策智能, 强化学习,多智能体系统,通用人工智能
通过对博弈决策、多智能体系统、强化学习以及大模型等的深入研究,探索AGI的可能路径是我们的长期研究梦想。
招生信息
希望学生具有优秀的编程(C/C++, Python)、数学、英语基础(如CET-6不低于550分)。
目前还有博士指标,希望学生数学基础优秀,如果在博弈论、算法博弈论、复杂系统、数理逻辑等方面有基础是显著加分项,欢迎有编程基础的数学、统计物理相关专业学生报考。
我们团队对学生的培养偏向基础学术研究,算力资源充沛,因此希望学生对AGI有着极强的使命感和热情,探索新一代人工智能基础理论、算法和应用,鼓励学生对大胆前沿课题开展自由研究,发表高水平、有影响力学术论文。
教育背景
2008-09--2013-06 中国科学院自动化研究所 博士
学历
博士研究生,导师为谭铁牛院士
学位
博士
工作经历
工作简历
2013-07~现在, 中国科学院自动化研究所, 助理研究员、副研究员、研究员
社会兼职
2021-03-30-2024-04-28,IEEE CIS Games Technical Committee, 委员
2018-08-01-今,中国图象图形学会青年工作委员会委员,
2017-09-01-今,中国自动化学会混合智能专业委员会委员,
2017-05-01-今,中关村管委会专家委员会委员,
2015-10-01-今,中国计算机学会计算机视觉专业委员会委员,
2018-08-01-今,中国图象图形学会青年工作委员会委员,
2017-09-01-今,中国自动化学会混合智能专业委员会委员,
2017-05-01-今,中关村管委会专家委员会委员,
2015-10-01-今,中国计算机学会计算机视觉专业委员会委员,
专利与奖励
奖励信息
(1) 庙算:人机对抗平台, 一等奖, 部委级, 2023(2) 基于结构化认知学习的图像语义理解理论与方法, 二等奖, 部委级, 2021(3) 北京市科技新星, , 省级, 2019(4) 中科院青年促进会成员, 部委级, 2019(5) AIIDE星际争霸AI竞赛国际季军, 其他, 2018(6) AIIDE星际争霸AI竞赛国际第四名, 其他, 2017(7) 中国人工智能学会优秀博士学位论文提名, 部委级, 2013(8) PASCAL VOC国际冠军, 其他, 2011
专利成果
[1] 黄凯奇, 尹奇跃, 张俊格, 徐沛. 基于深度强化学习网络构建对区域敏感的模型的方法. CN: CN114004370A, 2022-02-01.[2] 黄凯奇, 尹奇跃, 张俊格, 徐沛. 基于深度强化学习网络构建多样化搜索策略的模型的方法. CN: CN113962390A, 2022-01-21.[3] 张俊格, 白栋栋, 黄凯奇. 基于动作剪枝的推荐方法、装置、电子设备与存储介质. CN: CN113626720A, 2021-11-09.[4] 张俊格, 尹奇跃, 于彤彤. 一种基于多智能体的实时战略游戏对局方法. CN: CN112755538B, 2021-08-31.[5] 张俊格, 李庆明, 尹奇跃. 非平稳环境中去中心化多智能系统的决策方法. CN: CN112668721B, 2021-07-02.[6] 张俊格, 尹奇跃, 于彤彤. 通用的多智能体博弈算法. CN: CN112755538A, 2021-05-07.[7] 张俊格, 李庆明. 在多任务数据流中持续学习的方法及装置. CN: CN112698933A, 2021-04-23.[8] 张俊格, 李庆明, 尹奇跃. 通用的非平稳环境中去中心化多智能系统的决策方法. CN: CN112668721A, 2021-04-16.[9] 徐名源, 姚春凤, 冯柏岚, 黄凯奇, 张俊格, 陈晓棠, 李德榜. 一种对象检测模型的对抗扰动生成方法和装置. CN: CN109902705A, 2019-06-18.[10] 黄凯奇, 张俊格, 李德榜. 基于强化学习的图片自动裁剪的方法及装置. CN: CN108154464A, 2018-06-12.[11] 张俊格. 一种图像处理方法及系统. CN: CN107391505A, 2017-11-24.[12] 张俊格, 谭铁牛, 黄凯奇, 贾真. 基于百科知识语义增强的零样本分类方法、装置. CN: CN107292349A, 2017-10-24.[13] 黄凯奇, 张俊格, 付连锐. 二维图像人体关节点定位模型的构建方法及定位方法. CN: CN106548194A, 2017-03-29.[14] 黄凯奇, 徐冉, 张俊格. 基于双向递归卷积神经网络的图像超分辨率增强方法. CN: CN106127684A, 2016-11-16.[15] 黄凯奇, 任伟强, 王冲, 张俊格. 一种视觉目标检测与标注方法. CN: CN104217225A, 2014-12-17.[16] 黄凯奇, 任伟强, 张俊格. 一种基于数据与任务驱动的图像分类方法. CN: CN103984959A, 2014-08-13.[17] 吴娜, 陆京, 金永哲, 黄凯奇, 马丹, 张俊格. 基于视频的摔倒检测方法和设备. CN: CN103186902A, 2013-07-03.[18] 黄凯奇, 单言虎, 张俊格, 金永哲, 吴娜. 利用视频检测打架行为的方法和装置. CN: CN102750709A, 2012-10-24.
出版信息
发表论文
(1) Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks, AAAI Conference on Artificial Intelligence (AAAI), 2023, 通讯作者(2) 兵棋推演的智能决策技术与挑战, 自动化学报, 2023, 第 4 作者(3) Prioritized Tasks Mining for Multi-Task Cooperative Multi-Agent Reinforcement Learning, AAMAS, 2023, 第 3 作者(4) PECAN: Leveraging Policy Ensemble for Context-Aware Zero-Shot Human-AI Coordination, AAMAS, 2023, 通讯作者(5) Exploration via Joint Policy Diversity for Sparse-Reward Multi-Agent Tasks, IJCAI, 2023, 通讯作者(6) Deep Reinforcement Learning With Part-Aware Exploration Bonus in Video Games, IEEE TRANSACTIONS ON GAMES, 2022, 通讯作者(7) Offline reinforcement learning with representations for actions, INFORMATION SCIENCES, 2022, 通讯作者(8) Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning, 2022, 第 3 作者(9) Learning to Reweight Imaginary Transitions for Model-Based Reinforcement Learning, AAAI, 2021, 第 3 作者(10) Universal adversarial perturbations against object detection, PATTERN RECOGNITION, 2021, 第 2 作者(11) 基于深度学习的跨模态检索综述, Survey on deep learning based cross-modal retrieval, 中国图象图形学报, 2021, 第 3 作者(12) 人机对抗智能技术, Intelligent technologies of human-computer gaming, 中国科学:信息科学, 2020, 第 3 作者(13) Learning to Learn Cropping Models for Different Aspect Ratio Requirements, IEEE Conference on Computer Vision and Pattern Recognition, 2020, (14) Composing Good Shots by Exploiting Mutual Relations, 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, 第 2 作者(15) Human-Machine Gaming Intelligence: A Review, Science China, 2020, 第 1 作者(16) Opponent Strategy Recognition In Real Time Strategy Game Using Deep Feature Fusion Neural Network, 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2020), 2020, 第 3 作者(17) Fast A3RL: Aesthetics-Aware Adversarial Reinforcement Learning for Image Cropping., IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY, 2019, 第 3 作者(18) SparseMask: Differentiable Connectivity Learning for Dense Image Prediction, 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, 第 2 作者(19) Mixed supervised object detection with robust objectness transfer, IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 第 2 作者(20) GP-GAN: Towards Realistic High-Resolution Image Blending, PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, 第 3 作者(21) MVP-Net: Multi-view FPN with Position-Aware Attention for Deep Universal Lesion Detection, MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT VI, 2019, 第 3 作者(22) Few-Shot Image Recognition with Knowledge Transfer, 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, 第 3 作者(23) Multi-view clustering via joint feature selection and partially constrained cluster label learning, PATTERN RECOGNITION, 2019, 通讯作者(24) Transductive Zero-Shot Learning via Visual Center Adaptation, THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 通讯作者(25) Transductive Zero-Shot Learning with Visual Structure Constraint, ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 第 5 作者(26) Fast A3RL: Aesthetics-Aware Adversarial Reinforcement Learning for Image Cropping, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 第 3 作者(27) Bootstrap Estimated Uncertainty of the Environment Model for Model-Based Reinforcement Learning, THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 第 2 作者(28) DF(2)Net: A Discriminative Feature Learning and Fusion Network for RGB-D Indoor Scene Classification, THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, 第 2 作者(29) Deep Semantic Structural Constraints for Zero-Shot Learning, THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, 第 3 作者(30) Discriminative learning of latent features for zero-shot recognition, 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, 第 2 作者(31) Fast End-to-End Trainable Guided Filter, 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, 第 3 作者(32) ACM: Learning Dynamic Multi-agent Cooperation via Attentional Communication Model, ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT II, 2018, 第 3 作者(33) Deep Semantic Structural Constraints for Zero-Shot Learning, 2018, 第 2 作者(34) Df^2net: Discriminative feature learning and fusion network for RGB-D indoor scene classification, AAAI, 2018, 第 1 作者(35) A2-RL: Aesthetics Aware Reinforcement Learning for Image Cropping, 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, 第 3 作者(36) DF2Net: A Discriminative Feature Learning and Fusion Network for RGB-D Indoor Scene Classification., 2018, 第 2 作者(37) SEMANTICS-GUIDED MULTI-LEVEL RGB-D FEATURE FUSION FOR INDOOR SEMANTIC SEGMENTATION, 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, 第 2 作者(38) Semantics-guided Multi-level RGB-D Feature Fusion for Indoor Semantic Segmentation, 2017, 第 2 作者(39) GRMA: Generalized Range Move Algorithms for the Efficient Optimization of MRFs, INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 第 2 作者(40) Local structured representation for generic object detection, FRONTIERS OF COMPUTER SCIENCE, 2017, 通讯作者(41) Encyclopedia Enhanced Semantic Embedding for Zero-Shot Learning, 2017, 第 4 作者(42) ORGM: Occlusion Relational Graphical Model for Human Pose Estimation, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 第 2 作者(43) FastLCD: Fast Label Coordinate Descent for the Efficient Optimization of 2D Label MRFs, INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 第 3 作者(44) 利用双通道卷积神经网络的图像超分辨率算法, 中国图象图形学报, 2016, 第 2 作者(45) ISEE Smart Home (ISH): Smart video analysis for home security, NEUROCOMPUTING, 2015, 通讯作者(46) Beyond Tree Structure Models: A New Occlusion Aware Graphical Model for Human Pose Estimation, 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, 第 2 作者(47) Mirrored Non-Maximum Suppression for Accurate Object Part Localization, PROCEEDING OF 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION, 2015, 第 1 作者(48) Large-Scale Weakly Supervised Object Localization via Latent Category Learning, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 第 4 作者(49) Learning Convolutional NonLinear Features for K Nearest Neighbor Image Classification, 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, 第 3 作者(50) Deformable Object Matching via Deformation Decomposition based 2D Label MRF, IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, 2014, 第 1 作者(51) Learning Convolutional NonLinear Features for K Nearest Neighbor Image Classification, PROC. INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION 2014, 2014, 第 1 作者(52) Improved Optimization Based on Graph Cuts for Discrete Energy Minimization, INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, 2014, 第 2 作者(53) Robust Object Recognition via Visual Pathway Feedback, ICPR, 2014, 第 1 作者(54) Deformable Object Matching via Deformation Decomposition based 2D Label MRF, 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, 第 2 作者(55) Recent Progress on Object Classification and Detection, PROC. IBEROAMERICAN CONGRESS ON PATTERN RECOGNITION, 2013, 第 1 作者(56) An adaptive combination of multiple features for robust tracking in real scene, IEEEINTERNATIONALCONFERENCEONCOMPUTERVISIONICCV, 2013, 第 3 作者(57) Exploring the Power of Kernel in Feature Representation for Object Categorization, INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING, 2013, 第 1 作者(58) An Adaptive Combination of Multiple Features for Robust Tracking in Real Scene, 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, 第 3 作者(59) Data Decomposition and Spatial Mixture Modeling for Part based Model, SPRINGER BERLIN HEIDELBERG,2012, 2012, 第 2 作者(60) Semantic Windows Mining in Sliding Window Based Object Detection, INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2012, 第 2 作者(61) An Empirical Study of Visual Features for Part Based Model, PATTERN RECOGNITION,2011, 2011, 第 4 作者(62) Boosted Local Structured HOG-LBP for Object Localization, IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, 第 2 作者(63) Boosted Local Structured HOG-LBP for Object Localization, 2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, 通讯作者(64) Offline Reinforcement Learning with Representations for Actions, INFORMATION SCIENCES, 第 3 作者
科研活动
科研项目
( 1 ) 博弈决策智能理论与技术研究, 主持, 省级, 2019-11--2022-12( 2 ) 中科院青年促进会项目, 主持, 部委级, 2019-01--2022-12( 3 ) 智能博弈**中的关键理论与技术研究, 主持, 国家级, 2019-01--2021-12( 4 ) 场景元素的时空演化分析与高层次事件检测, 主持, 国家级, 2016-07--2020-12( 5 ) 小样本条件下的物体检测研究, 主持, 国家级, 2019-01--2022-12( 6 ) 小样本博弈学习与可解释性建模, 主持, 部委级, 2020-10--2024-12( 7 ) 智能博弈决策AI训练学习与推理平台, 主持, 国家级, 2020-01--2022-06( 8 ) 面向图像分析的带噪声小样本学习技术, 主持, 院级, 2018-09--2021-12( 9 ) 自动化算子设计, 主持, 院级, 2019-04--2020-07