基本信息

张俊格  男  研究员 博导  中国科学院自动化研究所
电子邮件: jgzhang@nlpr.ia.ac.cn
通信地址: 北京市海淀区中关村东路95号
邮政编码:100190


中国科学院特聘核心岗位研究员

中国科学院青年促进会优秀会员

中国科学院稳定支持基础研究青年团队

北京市科技新星

研究领域

博弈智能,  强化学习,多智能体系统,决策大模型(LLM-as-agent and beyond),通用人工智能


招生信息

希望学生具有优秀的编程(C/C++, Python)、数学、英语基础(如CET-6不低于550分)。


现在还有一名依托国科大前沿交叉科学学院培养的直博生保送招生指标,

欢迎有优秀编程基础的应用数学、统计物理、统计力学等相关专业学生报考。


课题组对博士研究生的培养定位为AI基础理论及应用基础研究,算力资源(近400张卡)非常充沛,因此希望学生对AGI有着极强的使命感和热情,做重要的问题,一流的问题,发表顶级论文。


研究组主要和国家能源、国家电投、国家电网、航天科技、航天科工、航空工业、中国船舶、中国兵器、中国电科、中国移动、华为等等知名央国企、民营企业以及相关部委下属研究机构有着长期合作。





教育背景

2008-09--2013-06   中国科学院自动化研究所   博士
学历
博士研究生,导师为谭铁牛院士

学位
博士

工作经历

工作简历
2013-07~现在, 中国科学院自动化研究所, 助理研究员、副研究员、研究员
社会兼职
2021-03-29-2026-12-31,IEEE CIS Games Technical Committee, 委员
2018-07-30-2025-12-30,中国图象图形学会青年工作委员会委员, 委员
2017-08-31-2026-12-31,中国自动化学会混合智能专业委员会委员, 委员
2017-04-30-2024-12-31,中关村管委会专家委员会委员, 委员
2015-09-30-2027-12-31,中国计算机学会计算机视觉专业委员会委员, 委员

专利与奖励

奖励信息
(1) 中科院青年促进会优秀会员, , 部委级, 2023
(2) 《庙算:人机对抗平台》中国指挥与控制学会;省部科学技术进步奖;一等奖,部委级, 一等奖, 部委级, 2023
(3) 《基于结构化认知学习的图像语义理解理论与方法》中国图象图形学学会;省部自然科学科技类;二等奖, 二等奖, 部委级, 2021
(4) 北京市科技新星, , 省级, 2019
(5) AIIDE星际争霸AI竞赛国际季军, 其他, 2018
(6) AIIDE星际争霸AI竞赛国际第四名, 其他, 2017
(7) 中国人工智能学会优秀博士学位论文提名, 部委级, 2013
(8) PASCAL VOC国际冠军, 其他, 2011

出版信息


发表论文

(1)   BadRL: Sparse Targeted Backdoor Attack Against Reinforcement Learning, AAAI, 2024,   通讯作者
(2)   Position: Foundation Agents as the Paradigm Shift for Decision Making, ICML, 2024, 通讯作者
(3)   TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient, AAAI, 2024,   通讯作者
(4)   Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models, AAMAS, 2024,   通讯作者
(5)   ProAgent: Building Proactive Cooperative AI with Large Language Models, AAAI, 2024,  通讯作者
(6)   Exemplar-based Continual Learning via Contrastive Learning, IEEE Transactions on Artificial Intelligence (TAI), 2024,  通讯作者
(7)   PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning, AAMAS, 2024,   通讯作者
(8)   TASK-WISE PROMPT QUERY FUNCTION FOR REHEARSAL-FREE CONTINUAL LEARNING, ICASSP, 2024,  通讯作者
(9)   ADMN: Agent-Driven Modular Network for Dynamic Parameter Sharing in Cooperative Multi-Agent Reinforcement Learning, IJCAI, 2024,  通讯作者
(10) Population-Based Diverse Exploration for Sparse-Reward Multi-Agent Tasks, IJCAI, 2024,  通讯作者
(11) Learning Individual Difference Rewards in Multi-Agent Reinforcement Learning, AAMAS, 2023,  通讯作者
(12) Contrastive Correlation Preserving Replay for Online Continual Learning, IEEE CSVT, 2023, 通讯作者
(13) Dynamic Equilibrium-Based Continual Learning Model with Disentangled Meta-features, IEEE SMC, 2023,   通讯作者
(14) Leveraging Joint-action Embedding in Multi-agent Reinforcement Learning for Cooperative Games, TOG, 2023,   通讯作者
(15) Squeezing More Past Knowledge for Online Class-Incremental Continual Learning, Squeezing More Past Knowledge for Online Class-Incremental Continual              Learning, IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2023, 通讯作者
(16) Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks, AAAI, 2023,   通讯作者
(17) 兵棋推演的智能决策技术与挑战, 自动化学报, 2023
(18) Prioritized Tasks Mining for Multi-Task Cooperative Multi-Agent Reinforcement Learning, AAMAS, 2023,  通讯作者
(19) 基于模型的强化学习中可学习的样本加权机制, Learnable Weighting Mechanism in Model-based Reinforcement Learning, 软件学报, 2023
(20) PECAN: Leveraging Policy Ensemble for Context-Aware Zero-Shot Human-AI Coordination, AAMAS, 2023,   通讯作者
(21) Exploration via Joint Policy Diversity for Sparse-Reward Multi-Agent Tasks, IJCAI, 2023,  通讯作者
(22) DecisionHoldem: Safe Depth-Limited Solving With Diverse Opponents for Imperfect-Information Games, 2022
(23) RACA: Relation-Aware Credit Assignment for Ad-Hoc Cooperation in Multi-Agent Deep Reinforcement Learning, IJCNN, 2022
(24) Deep Reinforcement Learning With Part-Aware Exploration Bonus in Video Games, IEEE TRANSACTIONS ON GAMES, 2022,   通讯作者
(25) Offline reinforcement learning with representations for actions, INFORMATION SCIENCES, 2022,  通讯作者
(26) Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning, 2022
(27) Learning Macromanagement in Starcraft by Deep Reinforcement Learning, SENSORS, 2021
(28) Learning to Reweight Imaginary Transitions for Model-Based Reinforcement Learning, AAAI, 2021
(29) Universal adversarial perturbations against object detection, PATTERN RECOGNITION, 2021
(30) 基于深度学习的跨模态检索综述, Survey on deep learning based cross-modal retrieval, 中国图象图形学报, 2021
(31) Composing Good Shots by Exploiting Mutual Relations, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020,   通讯作者
(32) Human-Machine Gaming Intelligence: A Review, Science China, 2020, 
(33) Opponent Strategy Recognition In Real Time Strategy Game Using Deep Feature Fusion Neural Network, 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2020), 2020
(34) 人机对抗智能技术, Intelligent technologies of human-computer gaming, 中国科学:信息科学, 2020,
(35) Learning to Learn Cropping Models for Different Aspect Ratio Requirements, IEEE Conference on Computer Vision and Pattern Recognition, 2020,   通讯作者
(36) Fast A3RL: Aesthetics-Aware Adversarial Reinforcement Learning for Image Cropping., IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY, 2019
(37) SparseMask: Differentiable Connectivity Learning for Dense Image Prediction, 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019
(38) Mixed supervised object detection with robust objectness transfer, IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019
(39) GP-GAN: Towards Realistic High-Resolution Image Blending, PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019
(40) MVP-Net: Multi-view FPN with Position-Aware Attention for Deep Universal Lesion Detection, MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT VI, 2019
(41) Few-Shot Image Recognition with Knowledge Transfer, 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019
(42) Multi-view clustering via joint feature selection and partially constrained cluster label learning, PATTERN RECOGNITION, 2019,   通讯作者
(43) Transductive Zero-Shot Learning via Visual Center Adaptation, THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019,   通讯作者
(44) Transductive Zero-Shot Learning with Visual Structure Constraint, ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019
(45) Bootstrap Estimated Uncertainty of the Environment Model for Model-Based Reinforcement Learning, THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019
(46) Fast A3RL: Aesthetics-Aware Adversarial Reinforcement Learning for Image Cropping, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019
(47) Fast End-to-End Trainable Guided Filter, 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018
(48) ACM: Learning Dynamic Multi-agent Cooperation via Attentional Communication Model, ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT II, 2018
(49) Deep Semantic Structural Constraints for Zero-Shot Learning, 2018
(50) Df^2net: Discriminative feature learning and fusion network for RGB-D indoor scene classification, AAAI, 2018, 第 1 作者
(51) A2-RL: Aesthetics Aware Reinforcement Learning for Image Cropping, 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018
(52) DF2Net: A Discriminative Feature Learning and Fusion Network for RGB-D Indoor Scene Classification., 2018
(53) DF(2)Net: A Discriminative Feature Learning and Fusion Network for RGB-D Indoor Scene Classification, THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018
(54) Deep Semantic Structural Constraints for Zero-Shot Learning, THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018
(55) Discriminative learning of latent features for zero-shot recognition, 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018
(56) SEMANTICS-GUIDED MULTI-LEVEL RGB-D FEATURE FUSION FOR INDOOR SEMANTIC SEGMENTATION, 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017
(57) Semantics-guided Multi-level RGB-D Feature Fusion for Indoor Semantic Segmentation, 2017
(58) GRMA: Generalized Range Move Algorithms for the Efficient Optimization of MRFs, INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017
(59) Local structured representation for generic object detection, FRONTIERS OF COMPUTER SCIENCE, 2017, 第 1 作者  通讯作者
(60) Encyclopedia Enhanced Semantic Embedding for Zero-Shot Learning, 2017
(61) ORGM: Occlusion Relational Graphical Model for Human Pose Estimation, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017
(62) FastLCD: Fast Label Coordinate Descent for the Efficient Optimization of 2D Label MRFs, INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016
(63) 利用双通道卷积神经网络的图像超分辨率算法, 中国图象图形学报, 2016
(64) ISEE Smart Home (ISH): Smart video analysis for home security, NEUROCOMPUTING, 2015, 第 1 作者  通讯作者
(65) Beyond Tree Structure Models: A New Occlusion Aware Graphical Model for Human Pose Estimation, 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015
(66) Mirrored Non-Maximum Suppression for Accurate Object Part Localization, PROCEEDING OF 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION, 2015, 第 1 作者
(67) Large-Scale Weakly Supervised Object Localization via Latent Category Learning, IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015
(68) Learning Convolutional NonLinear Features for K Nearest Neighbor Image Classification, PROC. INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION 2014, 2014, 第 1 作者
(69) Robust Object Recognition via Visual Pathway Feedback, ICPR, 2014, 第 1 作者
(70) Improved Optimization Based on Graph Cuts for Discrete Energy Minimization, INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, 2014
(71) Learning Convolutional NonLinear Features for K Nearest Neighbor Image Classification, 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014
(72) Deformable Object Matching via Deformation Decomposition based 2D Label MRF, IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, 2014, 第 1 作者
(73) Deformable Object Matching via Deformation Decomposition based 2D Label MRF, 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014
(74) Recent Progress on Object Classification and Detection, PROC. IBEROAMERICAN CONGRESS ON PATTERN RECOGNITION, 2013, 第 1 作者
(75) An adaptive combination of multiple features for robust tracking in real scene, IEEEINTERNATIONALCONFERENCEONCOMPUTERVISIONICCV, 2013
(76) Exploring the Power of Kernel in Feature Representation for Object Categorization, INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING, 2013, 第 1 作者
(77) An Adaptive Combination of Multiple Features for Robust Tracking in Real Scene, 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013
(78) Data Decomposition and Spatial Mixture Modeling for Part based Model, SPRINGER BERLIN HEIDELBERG,2012, 2012
(79) Semantic Windows Mining in Sliding Window Based Object Detection, INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2012
(80) Boosted Local Structured HOG-LBP for Object Localization, IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011
(81) Boosted Local Structured HOG-LBP for Object Localization, 2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, 第 1 作者  通讯作者
82) An Empirical Study of Visual Features for Part Based Model, PATTERN RECOGNITION,2011, 2011

科研活动

   
科研项目
( 1 ) 中科院青年促进会优秀会员人才项目, 负责人, 中国科学院计划, 2024-01--2026-12
( 2 ) 持续博弈学习关键理论与方法研究, 负责人, 中国科学院计划, 2022-01--2023-12
( 3 ) 小样本博弈学习与可解释性建模, 负责人, 中国科学院计划, 2020-10--2025-12
( 4 ) 智能博弈决策AI训练学习与推理平台, 负责人, 国家任务, 2020-01--2022-06
( 5 ) 博弈决策智能理论与技术研究, 负责人, 地方任务, 2019-11--2022-12
( 6 ) 自动化算子设计, 负责人, 境内委托项目, 2019-04--2020-07
( 7 ) 智能博弈**中的关键理论与技术研究, 负责人, 国家任务, 2019-01--2021-12
( 8 ) 小样本条件下的物体检测研究, 负责人, 国家任务, 2019-01--2022-12
( 9 ) 面向图像分析的带噪声小样本学习技术, 负责人, 境内委托项目, 2018-09--2021-12
( 10 ) 场景元素的时空演化分析与高层次事件检测, 负责人, 国家任务, 2016-07--2020-12