基本信息

刘静  研究员  中科院自动化研究所 

国家优秀青年科学基金获得者


电子邮件: jliu@nlpr.ia.ac.cn
通信地址: 北京海淀中关村东路95号智能化大厦1302
邮政编码: 100190

研究领域

视频图像分析与理解;视觉与语言;多媒体分析与检索

招生信息

   
招生专业
081104-模式识别与智能系统
081203-计算机应用技术
招生方向

多模态预训练
具身智能

多媒体分析与理解

图像语义理解

招生要求
专业背景不限,但需对科研怀有强烈的兴趣,具有良好的数学基础、编程能力以及自主学习能力。 
由于招生数量有限,希望提前与我联系。 
另招收少量实习生,要求计算机相关专业的在读本科或研究生,具有较强的编程能力,实习期半年以上者优先。

工作经历

2015-11--至今        中国科学院自动化研究所 模式识别国家重点实验室 研究员

2010-11--2015-10 中国科学院自动化研究所 模式识别国家重点实验室 副研究员

2008-01--2010-10 中国科学院自动化研究所 模式识别国家重点实验室 助理研究员

教授课程

多媒体分析与理解
科学前沿进展名家系列讲座III
多媒体信息处理
本科生毕业设计(计算机科学与技术)

论文发表

    • [1] Sun, Mingzhen, Wang, Weining, Zhu, Xinxin, Liu, Jing. Reparameterizing and dynamically quantizing image features for image generation. PATTERN RECOGNITION[J]. 2024, 146:.
    • [2] Zikang Liu, Sihan Chen, Guo Longteng, Xingjian He, Jing Liu. Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner. ACM International Conference on Multimedia. 2023, 
    • [3] 刘静, 郭龙腾. GPT-4对多模态大模型在多模态理解、生成、交互上的启发. 中国科学基金[J]. 2023, 37(05期): 793-802.
    • [4] Yanyuan Qiao, Zheng Yu, Jing Liu, Qi Wu. March in chat: Interactive prompting for remote embodied referring expression. ICCV 2023, 
    • [5] Zhao, Zijia, Guo, Longteng, He, Xingjian, Shao, Shuai, Yuan, Zehuan, Liu, Jing. MAMO: Masked Multimodal Modeling for Fine-Grained Vision-Language Representation Learning. SIGIR 2023, 
    • [6] Mingzhen Sun, Weining Wang, Zihan Qin, Jiahui Sun, Sihan Chen, Jing Liu. GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodER. NeurIPS 2023, 
    • [7] Jiawei Liu, Weining Wang, Sihan Chen, Xinxin Zhu, Jing Liu. Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation. IEEE Trans. on Multimedia[J]. 2023, 
    • [8] Zhang, Kun, Wu, Le, Lv, Guangyi, Chen, Enhong, Ruan, Shulan, Liu, Jing, Zhang, Zhiqiang, Zhou, Jun, Wang, Meng. Description-Enhanced Label Embedding Contrastive Learning for Text Classification. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS[J]. 2023,
    • [9] Mingzhen Sun, Weining Wang, Xinxin Zhu, Jing Liu. MOSO: Decomposing MOtion, Scene and Object for Video Prediction. CVPR. 2023, 
    • [10] Sihan Chen, Handong Li, Qunbo Wang, Zijia Zhao, Mingzhen Sun, Xinxin Zhu, Jing Liu. VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset. NeurIPS 2023, 
    • [11] Jiawei Liu, Hao Wang, Weining Wang, Jing Liu. WL-MSR: Watch and Listen for Multimodal Subtitle Recognition. ICASSP 2023, 
    • [12] He, Xingjian, Liu, Jing, Wang, Weining, Lu, Hanqing. An Efficient Sampling-Based Attention Network for Semantic Segmentation. IEEE TRANSACTIONS ON IMAGE PROCESSING[J]. 2022, 31: 2850-2863, http://dx.doi.org/10.1109/TIP.2022.3162101.
    • [13] Jie Jiang, Jing Liu, Jun Fu, Xinxin Zhu, Zechao Li, Lu, Hanqing. Global-Guided Selective Context Network for Scene Parsing. IEEE Trans. Neural Networks Learn. Syst[J]. 2022, 33(4): 1752-1764, 
    • [14] Weining Wang, Tianwei Lin, Dongliang He, Fu Li, Shilei Wen, Liang Wang, Jing Liu. Semi-Supervised Temporal Action Proposal Generation via Exploiting 2-D Proposal Map.. IEEE transactions on Multimedia (TMM)[J]. 2022, 24: 3624-3635, 
    • [15] Xu, Bingxiang, Li, Xiaoli, Gao, Xiaomeng, Jia, Yan, Liu, Jing, Li, Feifei, Zhang, Zhihua. DeNOPA: decoding nucleosome positions sensitively with sparse ATAC-seq data. BRIEFINGS IN BIOINFORMATICS[J]. 2022, 23(1): 
    • [16] He Xingjian, Liu Jing, Fu Jun, Zhu Xinxin, Wang, Jinqiao, Lu Hanqing. Consistent-Separable Feature Representation for Semantic Segmentation. AAAI 2021, 
    • [17] Fu, Jun, Liu, Jing, Jiang, Jie, Li, Yong, Bao, Yongjun, Lu, Hanqing. Scene Segmentation With Dual Relation-Aware Attention Network. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS[J]. 2021, 32(6): 2547-2560, http://dx.doi.org/10.1109/TNNLS.2020.3006524.
    • [18] Wang, Hao, Wang, Weining, Liu, Jing. Temporal Memory Attention for Video Semantic Segmentation. ICIP. 2021, http://arxiv.org/abs/2102.08643.
    • [19] Sihan Chen, Xinxin Zhu, Sihan Chen, Wei Liu, Jiawei Liu, Zijia Zhao, Longteng Guo, Liu Jing. MM21Pre-training for Video Understanding Challenge: Video Captioning with Pretraining Techniques. ACMMM 2021.
    • [20] Fei Liu, Jing Liu, Weining Wang, Lu Hanqing. HAIR: Hierarchical Visual-Semantic Relational Reasoning for Video Question Answering. ICCV 2021, 
    • [21] Liu, Fei, Liu, Jing, Fang, Zhiwei, Hong, Richang, Lu, Hanqing. Visual Question Answering With Dense Inter- and Intra-Modality Interactions. IEEE TRANSACTIONS ON MULTIMEDIA[J]. 2021, 23: 3518-3529, 
    • [22] Guo, Longteng, Liu, Jing, Zhu, Xinxin, He, Xingjian, Jiang, Jie, Lu, Hanqing. Non-Autoregressive Image Captioning with Counterfactuals-Critical Multi-Agent Learning. IJCAI. 2020, http://arxiv.org/abs/2005.04690.
    • [23] Qiao, Yanyuan, Yu, Zheng, Liu, Jing, IEEE. RANKVQA: ANSWER RE-RANKING FOR VISUAL QUESTION ANSWERING. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME)[J]. 2020, 
    • [24] Jiang, Jie, Liu, Jing, Fu, Jun, Zhu, Xinxin, Lu, Hanqing. POINT SET ATTENTION NETWORK FOR SEMANTIC SEGMENTATION. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP). 2020, 2186-2190, 
    • [25] Guo Longteng, Liu Jing, Zhu Xinxin, Yao Peng, Lu Shichen, Lu Hanqing. Normalized and Geometry-Aware Self-Attention Network for Image Captioning. 2020 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2020, http://arxiv.org/abs/2003.08897.
    • [26] Yao, Peng, Li, Jiangyun, Guo, Longteng, Liu, Jing, IEEE. MODELING LOCAL AND GLOBAL CONTEXTS FOR IMAGE CAPTIONING. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME). 2020,
    • [27] Fei Liu, Jing Liu, Xinxin Zhu, Richang Hong, Hanqing Lu. Dual Hierarchical Temporal Convolutional Network with QA-Aware Dynamic Normalization for Video Story Question Answering. ACM MM. 2020, 
    • [28] Fu, Jun, Liu, Jing, Li, Yong, Bao, Yongjun, Yan, Weipeng, Fang, Zhiwei, Lu, Hanqing. Contextual deconvolution network for semantic segmentation. PATTERN RECOGNITION[J]. 2020, 101: 107152-, http://dx.doi.org/10.1016/j.patcog.2019.107152.
    • [29] Guo, Longteng, Liu, Jing, Lu, Shichen, Lu, Hanqing. Show, Tell, and Polish: Ruminant Decoding for Image Captioning. IEEE TRANSACTIONS ON MULTIMEDIA[J]. 2020, 22(8): 2149-2162, http://dx.doi.org/10.1109/TMM.2019.2951226.
    • [30] Liu Jing. Visual Question Answering with Dense Inter-and Intra-modality Interactions. IEEE Transactions on Multimedia. 2020, 
    • [31] Qiao, Yanyuan, Yu, Zheng, Liu, Jing, IEEE. VC-VQA: VISUAL CALIBRATION MECHANISM FOR VISUAL QUESTION ANSWERING. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) 2020, 1481-1485, 
    • [32] Fang, Zhiwei, Liu, Jing, Li, Yong, Qiao, Yanyuan, Lu, Hanqing. Improving visual question answering using dropout and enhanced question encoder. PATTERN RECOGNITION[J]. 2019, 90(1): 404-414, http://ir.ia.ac.cn/handle/173211/23483.
    • [33] Fei Liu, Jing Liu, Zhiwei Fang, Richang Hong, Hanqing Lu. Densely Connected Attention Flow for Visual Question Answering. IJCAI[J]. 2019, [37] Liu Jing. Multi-StyleGAN: Multi-Style Image Captioning from Non-Parallel Data using GANs and Back-Translation. CVPR. 2019,
    • [34] Fu, Jun, Liu, Jing, Tian, Haijie, Li, Yong, Bao, Yongjun, Fang, Zhiwei, Lu, Hanqing. Dual Attention Network for Scene Segmentation. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019)null. 2019, 3141-3149, 
    • [35] Fang Zhiwei, Liu Jing, Tang Qu, Li Yong, Lu Hanqing, Jawahar CV, Li H, Mori G, Schindler K. Answer Distillation for Visual Question Answering. COMPUTER VISION - ACCV 2018, 11361: 72-87,
    • [36] Fu, Jun, Liu, Jing, Wang, Yuhang, Li, Yong, Bao, Yongjun, Tang, Jinhui, Lu, Hanqing. Adaptive Context Network for Scene Parsing. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) 2019, 6747-6756, 
    • [37] Liu, Fei, Liu, Jing, Hong, Richang, Lu, Hanqing. Erasing-based Attention Learning for Visual Question Answering. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19)null. 2019, 1175-1183, http://dx.doi.org/10.1145/3343031.3350993.
    • [38] Liu, Fei, Liu, Jing, Fang, Zhiwei, Lu, Hanqing. Language and Visual Relations Encoding for Visual Question Answering. 2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP)null. 2019, 3307-3311, 

更多详细内容