发表论文
[1] Qi, Zhaobo, Wang, Shuhui, Su, Chi, Su, Li, Huang, Qingming, Tian, Qi. Self-Regulated Learning for Egocentric Video Activity Anticipation. IEEE Transactions on Pattern Analysis and Machine Intelligence[J]. 2023, 第 2 作者 通讯作者 45(6): 6715-6730, http://dx.doi.org/10.1109/TPAMI.2021.3059923.[2] Ying Yu, Xiaojun lin, Shuhui Wang, Weiguo Sheng, 黄庆明, Jun Yu. A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes. IEEE Transactions on Circuits and Systems for Video Technology[J]. 2023, 第 3 作者[3] Guorong Li, Ye, Hanhua, Yuankai Qi, Shuhui Wang, Laiyun Qing, 黄庆明, Ming-Hsuan Yang. Learning Hierarchical Modular Networks for Video Captioning. IEEE Transactions on Pattern Analysis and Machine Intelligence[J]. 2023, 第 4 作者46(2): 1049-1064, https://ieeexplore.ieee.org/document/10296527.[4] 卓君宝, 王树徽, 黄庆明. Uncertainty Modeling for Robust Domain Adaptation Under Noisy Environments. IEEE TRANSACTIONS ON MULTIMEDIA[J]. 2023, 第 2 作者[5] Chen, Weidong, Li, Guorong, Zhang, Xinfeng, Wang, Shuhui, Li, Liang, Huang, Qingming. Weakly Supervised Text-based Actor-Action Video Segmentation by Clip-level Multi-instance Learning. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS[J]. 2023, 第 4 作者19(1): http://dx.doi.org/10.1145/3514250.[6] Zhang, Weigang, Qi, Zhaobo, Wang, Shuhui, Su, Chi, Su, Li, Huang, Qingming. Temporal Dynamic Concept Modeling Network for Explainable Video Event Recognition. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS[J]. 2023, 第 3 作者19(6): http://dx.doi.org/10.1145/3568312.[7] XiaoDan Li, Chen, Yuefeng, Yao Zhu, 王树徽, Rong Zhang, Hui Xue. ImageNet-E: Benchmarking Neural Network Robustness via Attribute Editing. CVPR 2023. 2023, 第 4 作者 通讯作者 [8] Han, Xinzhe, Wang, Shuhui, Su, Chi, Huang, Qingming, Tian, Qi. General Greedy De-bias Learning. IEEE Transactions on Pattern Analysis and Machine Intelligence[J]. 2023, 第 2 作者 通讯作者 http://arxiv.org/abs/2112.10572.[9] 孙隽姝, 王树徽, 杨晨雪, 黄庆明, 郑振刚. 附加特征图增强的图卷积神经网络. 计算机学报[J]. 2023, 第 2 作者46(9): 1900-1918, http://lib.cqvip.com/Qikan/Article/Detail?id=7110463459.[10] Junshu Sun, Shuhui Wang, Xinzhe Han, Zhe Xue, Qingming Huang. All in a Row: Compressed Convolution Networks for Graph. International Conference on Machine Learning. 2023, 第 2 作者 通讯作者 [11] Zhaobo Qi, Shuhui Wang, Chi Su, Li Su, Qingming Huang, Qi Tian. Self-Regulated Learning for Egocentric Video Activity Anticipation. IEEE Transactions on Pattern Analysis and Machine Intelligence[J]. 2023, 第 2 作者 通讯作者 https://ieeexplore.ieee.org/document/9356220.[12] Zhengqi Pei, Shuhui Wang. Dynamics-inspired Neuromorphic Visual Representation Learning. International Conference on Machine Learning. 2023, 第 2 作者 通讯作者 [13] Ding, Guanqi, Han, Xinzhe, Wang, Shuhui, Wu, Shuzhe, Jin, Xin, Tu, Dandan, Huang, Qingming. Attribute Group Editing for Reliable Few-shot Image Generation. IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022, 第 3 作者 通讯作者 [14] Sheng Fang, Shuhui Wang, Junbao Zhuo, Qingming Huang, Bin Ma, Xiaoming Wei, Xiaolin Wei. Concept Propagation via Attentional Knowledge Graph Reasoning for Video-Text Retrieval. ACM International Conference on Multimedia. 2022, 第 2 作者 通讯作者 [15] Weidong Chen, Dexiang Hong, Qi, Yuankai, Zhenjun Han, Shuhui Wang, Laiyun Qing, Qingming Huang, 李国荣. Multi-Attention Network for Compressed Referring Video Object Segmentation. ACM International Conference on Multimedia. 2022, 第 5 作者[16] Ye, Hanhua, Li, Guorong, Qi, Yuankai, Wang, Shuhui, Huang, Qingming, Yang, MingHsuan. Hierarchical Modular Network for Video Captioning. IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022, 第 4 作者[17] Deng, Jincan, Li, Liang, Zhang, Beichen, Wang, Shuhui, Zha, Zhengjun, Huang, Qingming. Syntax-Guided Hierarchical Attention Network for Video Captioning. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY[J]. 2022, 第 4 作者32(2): 880-892, http://dx.doi.org/10.1109/TCSVT.2021.3063423.[18] 邓锦灿, Li Liang, Zhang Beichen, 王树徽, Zheng-Jun Zha, Huang, Qingming. Syntax-Guided Hierarchical Attention Network for Video Captioning. IEEE Transactions on Circuit System and Video Technology[J]. 2022, 第 4 作者32(2): 880-892, [19] Junbao Zhuo, Yan Zhu, Shuhao Cui, Shuhui Wang, Bin Ma, Qingming Huang, Xiaoming Wei, Xiaolin Wei. Zero-shot Video Classification with Appropriate Web and Task Knowledge Transfer. ACM International Conference on Multimedia. 2022, 第 4 作者 通讯作者 [20] 黄庆明, 王树徽, 许倩倩, 李亮, 蒋树强. 以图像视频为中心的跨媒体分析与推理. 智能系统学报[J]. 2021, 第 2 作者16(5): 835-848, http://lib.cqvip.com/Qikan/Article/Detail?id=7106020823.[21] Zhang, Jinghao, Zhu, Yanqiao, Liu, Qiang, Wu, Shu, Wang, Shuhui, Wang, Liang. Mining Latent Structures for Multimedia Recommendation. THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA(MM)[J]. 2021, 第 5 作者http://arxiv.org/abs/2104.09036.[22] Yang, Shijie, Li, Liang, Wang, Shuhui, Zhang, Weigang, Huang, Qingming, Tian, Qi. Graph Regularized Encoder-Decoder Networks for Image Representation Learning. IEEE TRANSACTIONS ON MULTIMEDIA[J]. 2021, 第 3 作者23: 3124-3136, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000698902000014.[23] Han, Xinzhe, Wang, Shuhui, Su, Chi, Huang, Qingming, Tian, Qi. Greedy Gradient Ensemble for Robust Visual Question Answering. ICCV. 2021, 第 2 作者 通讯作者 http://arxiv.org/abs/2107.12651.[24] 王树徽, 闫旭, 黄庆明. 跨媒体分析与推理技术研究综述. 计算机科学[J]. 2021, 第 1 作者48(3): 79-86, http://lib.cqvip.com/Qikan/Article/Detail?id=7103984849.[25] Mao, Xiaofeng, Chen, Yuefeng, Wang, Shuhui, Su, Hang, He, Yuan, Xue, Hui. Composite Adversarial Attacks. AAAI. 2021, 第 3 作者http://arxiv.org/abs/2012.05434.[26] Yan, Xu, Fei, Zhengcong, Li, Zekang, Wang, Shuhui, Huang, Qingming, Tian, Qi. Semi-Autoregressive Image Captioning. ACM Multimedia. 2021, 第 4 作者 通讯作者 [27] Liu, Xuejing, Li, Liang, Wang, Shuhui, Zha, ZhengJun, Huang, Qingming. Local-binarized very deep residual network for visual categorization. NEUROCOMPUTING[J]. 2021, 第 3 作者430: 82-93, http://dx.doi.org/10.1016/j.neucom.2020.11.041.[28] Qi, Zhaobo, Wang, Shuhui, Su, Chi, Su, Li, Huang, Qingming, Tian, Qi. Self-Regulated Learning for Egocentric Video Activity Anticipation. 2021, 第 2 作者[29] Li, Xiaodan, Li, Jinfeng, Chen, Yuefeng, Ye, Shaokai, He, Yuan, Wang, Shuhui, Su, Hang, Xue, Hui. QAIR: Practical Query-efficient Black-Box Attacks for Image Retrieval. CVPR. 2021, 第 6 作者http://arxiv.org/abs/2103.02927.[30] Chen Weidong, Li Guorong, Zhang Xinfeng, Yu Hongyang, 王树徽, Huang Qingming. Cascade Cross-modal Attention Network for Video Actor and Action Segmentation from a Sentence. ACM Multimedia. 2021, 第 5 作者[31] Song, Guoli, Wang, Shuhui, Huang, Qingming, Tian, Qi. Learning Feature Representation and Partial Correlation for Multimodal Multi-Label Data. IEEE TRANSACTIONS ON MULTIMEDIA[J]. 2021, 第 2 作者 通讯作者 23: 1882-1894, http://dx.doi.org/10.1109/TMM.2020.3004963.[32] Liu, Mengyi, Wang, Shuhui, Guo, Yulan, He, Yuan, Xue, Hui. Pano-SfMLearner: Self-Supervised Multi-Task Learning of Depth and Semantics in Panoramic Videos. IEEE SIGNAL PROCESSING LETTERS[J]. 2021, 第 2 作者28: 832-836, http://dx.doi.org/10.1109/LSP.2021.3073627.[33] Wu, Yiling, Wang, Shuhui, Song, Guoli, Huang, Qingming. Augmented Adversarial Training for Cross-Modal Retrieval. IEEE TRANSACTIONS ON MULTIMEDIA[J]. 2021, 第 2 作者 通讯作者 23: 559-571, https://www.webofscience.com/wos/woscc/full-record/WOS:000613560200004.[34] Song, Guoli, Wang, Shuhui, Huang, Qingming, Tian, Qi. Harmonized Multimodal Learning with Gaussian Process Latent Variable Models. IEEE Transactions on Pattern Analysis and Machine Intelligence[J]. 2021, 第 2 作者 通讯作者 43(3): 858-872, https://www.webofscience.com/wos/woscc/full-record/WOS:000616309900008.[35] Jingru Gan, Jinchang Luo, Haiwei Wang, Wang Shuhui, Wei He, Huang, Qingming. Multimodal Entity Linking: A New Dataset and A Baseline.. ACMMM(CCF-A, oral). 2021, 第 4 作者 通讯作者 [36] 韩歆哲, Wang Shuhui, Chi Su, Zhang, Weigang, Huang, Qingming, Qi Tian. Interpretable Visual Reasoning via Probabilistic Formulation under Natural Supervision. ECCV. 2020, 第 2 作者 通讯作者 [37] Cui Shuhao, Wang Shuhui, Zhuo Junbao, Su Chi, Huang Qingming, Tian Qi. Gradually Vanishing Bridge for Adversarial Domain Adaptation. 2020, 第 2 作者http://arxiv.org/abs/2003.13183.[38] Zhang Beichen, Li Liang, Yang Shijie, Wang Shuhui, Zheng-Jun Zha, Huang, Qingming. State-relabling adversarial active learning. CVPR(CCF-A, oral). 2020, 第 4 作者[39] Wang, Shuhui, Hu, Ling, Li, Liang, Zhang, Weigang, Huang, Qingming. Two-stream deep sparse network for accurate and efficient image restoration. COMPUTER VISION AND IMAGE UNDERSTANDING[J]. 2020, 第 1 作者200: http://dx.doi.org/10.1016/j.cviu.2020.103029.[40] Qi, Zhaobo, Wang, Shuhui, Su, Chi, Su, Li, Zhang, Weigang, Huang, Qingming. Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis. ACMMM. 2020, 第 2 作者 通讯作者 [41] Cui, Shuhao, 王树徽, Zhuo, Junbao, Li Liang, Huang Qingming, Tian Qi. Towards discriminability and diversity: batch nuclear-norm maximization on output under label insufficient situations. IEEE CVPR. 2020, 第 2 作者 通讯作者 [42] Song, Guoli, Wang Shuhui, Huang, Qingming, Tian Qi. Learning Feature Representation and Partial Correlation for Multimodal Multi-Labeled Data. IEEE Transactions on Multimedia(TMM)[J]. 2020, 第 2 作者 通讯作者 [43] Wu, Yiling, Wang, Shuhui, Huang, Qingming. Online Fast Adaptive Low-Rank Similarity Learning for Cross-Modal Retrieval. IEEE TRANSACTIONS ON MULTIMEDIA[J]. 2020, 第 2 作者 通讯作者 22(5): 1310-1322, http://dx.doi.org/10.1109/TMM.2019.2942494.[44] Cui, Shuhao, Wang, Shuhui, Zhuo, Junbao, Li, Liang, Huang, Qingming, Tian, Qi, IEEE. Towards Discriminability and Diversity: Batch Nuclear-norm Maximization under Label Insufficient Situations. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR). 2020, 第 11 作者3940-3949, [45] Li Liang. A structured latent variable recurrent network with stochastic attention for generating Weibo comments. IJCAI. 2020, [46] Meng Dechao, Li Liang, Liu Xuejing, Li Yadong, Yang Shijie, Zha Zhengjun, Gao Xingyu, Wang Shuhui, Huang Qingming. Parsing-based View-aware Embedding Network for Vehicle Re-Identification. 2020, 第 8 作者http://arxiv.org/abs/2004.05021.[47] 卓君宝, 苏驰, 王树徽, 黄庆明. 最小熵迁移对抗散列方法. 计算机研究与发展[J]. 2020, 第 3 作者57(4): 888-896, https://kns.cnki.net/KCMS/detail/detail.aspx?dbcode=CJFQ&dbname=CJFDLAST2020&filename=JFYZ202004018&v=MjA0MjV4WVM3RGgxVDNxVHJXTTFGckNVUjdxZVp1ZHVGeXJrVkwvT0x5dlNkTEc0SE5ITXE0OUViSVI4ZVgxTHU=.[48] Cui, Shuhao, Jin, Xuan, Wang, Shuhui, He, Yuan, Huang, Qingming. Heuristic Domain Adaptation. 2020, 第 3 作者http://arxiv.org/abs/2011.14540.[49] Wei Jun, Wang Shuhui, Wu Zhe, Su Chi, Huang Qingming, Tian Qi. Label Decoupling Framework for Salient Object Detection. 2020 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2020, 第 2 作者 通讯作者 http://arxiv.org/abs/2008.11048.[50] Zhaobo QI, Shuhui Wang, Chi Su, Su Li, Qingming Huang. Towards More Explainability: Concept Knowledge Mining Network for Event Recognition. ACMMM(CCF-A). 2020, 第 2 作者 通讯作者 [51] Li Xiaodan, Lang Yining, Chen Yuefeng, Mao Xiaofeng, He Yuan, Wang Shuhui, Xue Hui, Lu Quan. Sharp Multiple Instance Learning for DeepFake Video Detection. 2020, 第 6 作者http://arxiv.org/abs/2008.04585.[52] Guo, Dan, Wang, Hui, Wang, Shuhui, Wang, Meng. Textual-Visual Reference-Aware Attention Network for Visual Dialog. IEEE TRANSACTIONS ON IMAGE PROCESSING[J]. 2020, 第 3 作者29: 6655-6666, http://dx.doi.org/10.1109/TIP.2020.2992888.[53] Zhuo, Junbao, Wang, Shuhui, Cui, Shuhao, Huang, Qingming, IEEE Comp Soc. Unsupervised Open Domain Recognition by Semantic Discrepancy Minimization. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019). 2019, 第 11 作者750-759, [54] Li, Liang, Zhu, Xinge, Hao, Yiming, Wang, Shuhui, Gao, Xingyu, Huang, Qingming. A Hierarchical CNN-RNN Approach for Visual Emotion Classification. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS[J]. 2019, 第 4 作者 通讯作者 15(3): https://www.webofscience.com/wos/woscc/full-record/WOS:000535718800013.[55] Yang, Shijie, Li, Liang, Wang, Shuhui, Zhang, Weigang, Huang, Qingming, Tian, Qi. SkeletonNet: A Hybrid Network With a Skeleton-Embedding Process for Multi-View Image Representation Learning. IEEE TRANSACTIONS ON MULTIMEDIA[J]. 2019, 第 3 作者21(11): 2916-2929, http://dx.doi.org/10.1109/TMM.2019.2912735.[56] Wei Jun, Wang Shuhui, Huang Qingming. F3Net: Fusion, Feedback and Focus for Salient Object Detection. 2019, 第 2 作者http://arxiv.org/abs/1911.11445.[57] Wu, Yiling, Wang, Shuhui, Huang, Qingming. Multi-modal semantic autoencoder for cross-modal retrieval. NEUROCOMPUTING[J]. 2019, 第 2 作者 通讯作者 331: 165-175, http://dx.doi.org/10.1016/j.neucom.2018.11.042.[58] Liu Xuejing, Li Liang, Wang Shuhui, Zha ZhengJun, Meng Dechao, Huang Qingming. Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding. 2019, 第 3 作者http://arxiv.org/abs/1908.10568.[59] Wu, Yiling, Wang, Shuhui, Song, Guoli, Huang, Qingming. Online Asymmetric Metric Learning With Multi-Layer Similarity Aggregation for Cross-Modal Retrieval. IEEE TRANSACTIONS ON IMAGE PROCESSING[J]. 2019, 第 2 作者 通讯作者 28(9): 4299-4312, http://dx.doi.org/10.1109/TIP.2019.2908774.[60] Xue, Zhe, Li, Guorong, Wang, Shuhui, Huang, Jun, Zhang, Weigang, Huang, Qingming. Beyond global fusion: A group-aware fusion approach for multi-view image clustering. INFORMATION SCIENCES[J]. 2019, 第 3 作者493: 176-191, http://dx.doi.org/10.1016/j.ins.2019.04.034.[61] Liu Xuejing, Li Liang, Wang Shuhui, Zha ZhengJun, Su Li, Huang Qingming. Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding. 2019, 第 3 作者http://arxiv.org/abs/1909.02860.[62] Xin Yongjian, Wang Shuhui, Li Liang, Zhang Weigang, Huang Qingming, Jawahar CV, Li H, Mori G, Schindler K. Reverse Densely Connected Feature Pyramid Network for Object Detection. COMPUTER VISION - ACCV 2018, PT V. 2019, 第 11 作者11365: 530-545, [63] Liu, Xuejing, Li, Liang, Wang, Shuhui, Zha, ZhengJun, Su, Li, Huang, Qingming, ACM. Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19). 2019, 第 3 作者539-547, http://dx.doi.org/10.1145/3343031.3351074.[64] Wu, Yiling, Wang, Shuhui, Song, Guoli, Huang, Qingming, ACM. Learning Fragment Self-Attention Embeddings for Image-Text Matching. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19). 2019, 第 11 作者2088-2096, http://dx.doi.org/10.1145/3343031.3350940.[65] Wang, Shuhui, Li, Liang, Yang, Chenxue, Huang, Qingming. Regularized topic-aware latent influence propagation in dynamic relational networks. GEOINFORMATICA[J]. 2019, 第 1 作者23(3): 329-352, [66] Yang, Shijie, Li, Liang, Wang, Shuhui, Meng, Dechao, Huang, Qingming, Tian, Qi, ACM. Structured Stochastic Recurrent Network for Linguistic Video Prediction. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19). 2019, 第 3 作者21-29, http://dx.doi.org/10.1145/3343031.3350859.[67] Hu Ling, Wang Shuhui, Li Liang, Huang Qingming, Baozong Y, Qiuqi R, Yao Z, Gaoyun AN. How Functions Evolve in Deep Convolutional Neural Network. PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP). 2018, 第 2 作者1133-1138, [68] He, Jianfeng, Ma, Bingpeng, Wang, Shuhui, Liu, Yugui, Huang, Qingming. Multi-label double-layer learning for cross-modal retrieval. NEUROCOMPUTING[J]. 2018, 第 3 作者275: 1893-1902, http://dx.doi.org/10.1016/j.neucom.2017.10.032.[69] Chen, Yangyu, Wang, Shuhui, Zhang, Weigang, Huang, Qingming, Ferrari, V, Hebert, M, Sminchisescu, C, Weiss, Y. Less Is More: Picking Informative Frames for Video Captioning. COMPUTER VISION - ECCV 2018, PT XIII. 2018, 第 11 作者11217: 367-384, [70] Li Liang, Wang Shuhui, Jiang Shuqiang, Huang Qingming, ACM. Attentive Recurrent Neural Network for Weak-supervised Multi-label Image Classification. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18). 2018, 第 11 作者1092-1100, http://dx.doi.org/10.1145/3240508.3240649.[71] Wu Yiling, Wang Shuhui, Huang Qingming, ACM. Learning Semantic Structure-preserved Embeddings for Cross-modal Retrieval. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18). 2018, 第 11 作者825-833, http://dx.doi.org/10.1145/3240508.3240521.[72] Xu, Zijun, Su, Li, Wang, Shuhui, Huang, Qingming, Zhang, Yuan, IEEE. S2L: SINGLE-STREAMLINE FOR COMPLEX VIDEO EVENT DETECTION. 2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018). 2018, 第 3 作者[73] Chen, Yangyu, Zhang, Weigang, Wang, Shuhui, Li, Liang, Huang, Qingming, IEEE. Saliency-Based Spatiotemporal Attention for Video Captioning. 2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM). 2018, 第 3 作者http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000630423400053.[74] Xue, Zhe, Li, Guorong, Wang, Shuhui, Zhang, Weigang, Huang, Qingming. Bilevel Multiview Latent Space Learning. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY[J]. 2018, 第 3 作者28(2): 327-341, http://dx.doi.org/10.1109/TCSVT.2016.2607842.[75] Liu, Siyuan, Qu, Qiang, Wang, Shuhui. Heterogeneous anomaly detection in social diffusion with discriminative feature discovery. INFORMATION SCIENCES[J]. 2018, 第 3 作者439: 1-18, http://ir.siat.ac.cn:8080/handle/172644/13947.[76] Mao, Xiaofeng, Wang, Shuhui, Zheng, Liying, Huang, Qingming. Semantic invariant cross-domain image generation with generative adversarial networks. NEUROCOMPUTING[J]. 2018, 第 2 作者293: 55-63, http://dx.doi.org/10.1016/j.neucom.2018.02.092.[77] Wang Shuhui, Chen Yangyu, Zhuo Junbao, Huang Qingming, Tian Qi, ACM. Joint Global and Co-Attentive Representation Learning for Image-Sentence Retrieval. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18). 2018, 第 1 作者1398-1406, http://dx.doi.org/10.1145/3240508.3240535.[78] Jianfeng He, Qingming Huang, Weigang Zhang, Qiang Qu, Shuhui Wang. Efficient Cross-modal Retrieval Using Social Tag Information Towards Mobile Applications. 2017, 第 5 作者http://ir.siat.ac.cn:8080/handle/172644/11930.[79] Zhuo, Junbao, Wang, Shuhui, Zhang, Weigang, Huang, Qingming, ACM. Deep Unsupervised Convolutional Domain Adaptation. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17). 2017, 第 11 作者261-269, http://dx.doi.org/10.1145/3123266.3123292.[80] Yang Shijie, Li Liang, Wang Shuhui, Zhang Weigang, Huang Qingming, IEEE. Multi-view Subspace Learning with Diversity Enforced Skeleton Embedding. 2017 IEEE THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2017). 2017, 第 3 作者121-128, http://dx.doi.org/10.1109/BigMM.2017.33.[81] Song, Guoli, Wang, Shuhui, Huang, Qingming, Tian, Qi. Multimodal Similarity Gaussian Process Latent Variable Model. IEEE TRANSACTIONS ON IMAGE PROCESSING[J]. 2017, 第 2 作者 通讯作者 26(9): 4168-4181, http://dx.doi.org/10.1109/TIP.2017.2713045.[82] Min, Weiqing, Jiang, Shuqiang, Wang, Shuhui, Xu, Ruihan, Cao, Yushan, Herranz, Luis, He, Zhiqiang. A survey on context-aware mobile visual recognition. MULTIMEDIA SYSTEMS[J]. 2017, 第 3 作者23(6): 647-665, http://dx.doi.org/10.1007/s00530-016-0523-8.[83] Song, Guoli, Wang, Shuhui, Huang, Qingming, Tian, Qi, IEEE. Multimodal Gaussian Process Latent Variable Models with Harmonization. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV). 2017, 第 2 作者5039-5047, [84] Wu Yiling, Wang Shuhui, Zhang Weigang, Huang Qingming, IEEE. ONLINE LOW-RANK SIMILARITY FUNCTION LEARNING WITH ADAPTIVE RELATIVE MARGIN FOR CROSS-MODAL RETRIEVAL. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME). 2017, 第 2 作者823-828, [85] Wu, Yiling, Wang, Shuhui, Huang, Qingming, IEEE. Online Asymmetric Similarity Learning for Cross-Modal Retrieval. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017). 2017, 第 2 作者3984-3993, [86] Yang, Shijie, Li, Liang, Wang, Shuhui, Zhang, Weigang, Huang, Qingming, IEEE. A Graph Regularized Deep Neural Network for Unsupervised Image Representation Learning. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017). 2017, 第 3 作者7053-7061, [87] Huang Qingming. Bi-Level Multi-View Latent Space Learning. IEEETCSVTCCFB. 2017, [88] Liu, Siyuan, Wang, Shuhui. Trajectory Community Discovery and Recommendation by Multi-Source Diffusion Modeling. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING[J]. 2017, 第 2 作者29(4): 898-911, https://www.webofscience.com/wos/woscc/full-record/WOS:000397581000014.[89] Huang, Jun, Li, Guorong, Wang, Shuhui, Xue, Zhe, Huang, Qingming. Multi-label classification by exploiting local positive and negative pairwise label correlation. NEUROCOMPUTING[J]. 2017, 第 3 作者257: 164-174, http://dx.doi.org/10.1016/j.neucom.2016.12.073.[90] Zhang, Jiaming, Wang, Shuhui, Huang, Qingming. Location-Based Parallel Tag Completion for Geo-Tagged Social Image Retrieval. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY[J]. 2017, 第 2 作者8(3): http://dx.doi.org/10.1145/3001593.[91] Min Weiqing, Jiang Shuqiang, Wang Shuhui, Sang Jitao, Mei Shuhuan, ACM. A Delicious Recipe Analysis Framework for Exploring Multi-Modal Recipes with Various Attributes. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17). 2017, 第 3 作者402-410, http://dx.doi.org/10.1145/3123266.3123272.[92] Q Huang, J Zhang, S Wang, Q Qu. JEREMIE: Joint Semantic Feature Learning via Multi-relational Matrix Completion. 2017, http://ir.siat.ac.cn:8080/handle/172644/11929.[93] Hua, Yan, Wang, Shuhui, Liu, Siyuan, Cai, Anni, Huang, Qingming. Cross-Modal Correlation Learning by Adaptive Hierarchical Semantic Aggregation. IEEE TRANSACTIONS ON MULTIMEDIA[J]. 2016, 第 2 作者 通讯作者 18(6): 1201-1216, https://www.webofscience.com/wos/woscc/full-record/WOS:000376107100021.[94] Hua, Yan, Wang, Shuhui, Liu, Siyuan, Cai, Anni, Huang, Qingming. Cross-Modal Correlation Learning by Adaptive Hierarchical Semantic Aggregation (vol 18, pg 1201, 2016). IEEE TRANSACTIONS ON MULTIMEDIA. 2016, 第 2 作者 通讯作者 18(10): 2127-2127, http://www.corc.org.cn/handle/1471x/2375025.[95] 蒋树强, 闵巍庆, 王树徽. 面向智能交互的图像识别技术综述与展望. 计算机研究与发展[J]. 2016, 第 3 作者53(1): 113-122, http://lib.cqvip.com/Qikan/Article/Detail?id=667688334.[96] 王祯骏, 王树徽, 张维刚, 黄庆明. 基于社交内容的潜在影响力传播模型. 计算机学报[J]. 2016, 第 2 作者39(8): 1528-1540, http://lib.cqvip.com/Qikan/Article/Detail?id=669627939.[97] Chu, Lingyang, Zhang, Yanyan, Li, Guorong, Wang, Shuhui, Zhang, Weigang, Huang, Qingming. Effective Multimodality Fusion Framework for Cross-Media Topic Detection. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY[J]. 2016, 第 4 作者26(3): 556-569, https://www.webofscience.com/wos/woscc/full-record/WOS:000372547400011.[98] He Jianfeng, Ma Bingpeng, Wang Shuhui, Liu Yugui, Huang Qingming, ACM. Cross-modal Retrieval by Real Label Partial Least Squares. MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE. 2016, 第 3 作者227-231, http://dx.doi.org/10.1145/2964284.2967216.[99] Wang Shuhui. Location-Based Parallel Tag Completion for Geo-tagged Social Photo Retrieval. International Conference on Multimedia Retrieval (ICMR). 2015, 第 1 作者[100] Wang Shuhui. Cluster-Sensitive Structured Correlation Analysis for Web Cross Modality Retrieval. Neurocomputing. 2015, 第 1 作者[101] Xue, Zhe, Li, Guorong, Wang, Shuhui, Zhang, Chunjie, Zhang, Weigang, Huang, Qingming, IEEE. GOMES: A GROUP-AWARE MULTI-VIEW FUSION APPROACH TOWARDS REAL-WORLD IMAGE CLUSTERING. 2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME). 2015, 第 3 作者[102] Shen, Li, Sun, Gang, Huang, Qingming, Wang, Shuhui, Lin, Zhouchen, Wu, Enhua. Multi-Level Discriminative Dictionary Learning With Application to Large Scale Image Classification. IEEE TRANSACTIONS ON IMAGE PROCESSING[J]. 2015, 第 4 作者24(10): 3109-3123, http://www.corc.org.cn/handle/1471x/2376455.[103] Liu, Siyuan, Wang, Shuhui, Zhu, Feida. Structured Learning from Heterogeneous Behavior for Social Identity Linkage. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING[J]. 2015, 第 2 作者27(7): 2005-2019, https://www.webofscience.com/wos/woscc/full-record/WOS:000355937800019.[104] Chu, Lingyang, Wang, Shuhui, Liu, Siyuan, Huang, Qingming, Pei, Jian. ALID: Scalable Dominant Cluster Detection. PROCEEDINGS OF THE VLDB ENDOWMENT[J]. 2015, 第 2 作者8(8): 826-837, [105] Wang Shuhui, Wu Yiling, Huang Qingming, IEEE. IMPROVING CROSS-MODAL CORRELATION LEARNING WITH HYPERLINKS. 2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME). 2015, 第 1 作者[106] Zhang Jiaming, Wang Shuhui, Huang Qingming, ACM. Location-Based Parallel Tag Completion for Geo-tagged Social Image Retrieval. ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL. 2015, 第 2 作者355-362, http://dx.doi.org/10.1145/2671188.2749353.[107] Liu, Siyuan, Qu, Qiang, Wang, Shuhui. Rationality Analytics from Trajectories. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA[J]. 2015, 第 3 作者10(1): http://dx.doi.org/10.1145/2735634.[108] Liu, Siyuan, Wang, Shuhui, Liu, Ce, Krishnan, Ramayya. Understanding taxi drivers' routing choices from spatial and social traces. FRONTIERS OF COMPUTER SCIENCE[J]. 2015, 第 2 作者 通讯作者 9(2): 200-209, https://www.webofscience.com/wos/woscc/full-record/WOS:000351519500003.[109] 王树徽, 黄庆明. 异质媒体分析技术研究进展. 集成技术[J]. 2015, 第 1 作者4(2): 7-21, https://jcjs.siat.ac.cn/jcjs/article/abstract/201502002?st=article_issue.[110] Wang, Shuhui, Zhuang, Fuzhen, Jiang, Shuqiang, Huang, Qingming, Tian, Qi. Cluster-sensitive Structured Correlation Analysis for Web cross-modal retrieval. NEUROCOMPUTING[J]. 2015, 第 1 作者 通讯作者 168: 747-760, http://dx.doi.org/10.1016/j.neucom.2015.05.049.[111] Li Guorong. GROUP SENSITIVE CLASSIFIER CHAINS FOR MULTI-LABEL CLASSIFICATION. IEEE International Conference on Multimedia and Expo. 2015, [112] Song, Guoli, Wang, Shuhui, Huang, Qingming, Tian, Qi, IEEE. Similarity Gaussian Process Latent Variable Model for Multi-Modal Data Analysis. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV). 2015, 第 2 作者4050-4058, [113] Song, Xinghang, Jiang, Shuqiang, Wang, Shuhui, Li, Liang, Huang, Qingming. Polysemious visual representation based on feature aggregation for large scale image applications. MULTIMEDIA TOOLS AND APPLICATIONS[J]. 2015, 第 3 作者74(2): 595-611, http://dx.doi.org/10.1007/s11042-014-1975-5.[114] Liu Siyuan, Wang Shuhui, Zhu Feida, Zhang Jinbo, Krishnan Ramayya, ACM SIGMOD. HYDRA: Large-scale Social Identity Linkage via Heterogeneous Behavior Modeling. SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA. 2014, 第 2 作者51-62, http://dx.doi.org/10.1145/2588555.2588559.[115] Huang Jun, Li Guorong, Wang Shuhui, Huang Qingming, Zhou ZH, Wang W, Kumar R, Toivonen H, Pei J, Huang JZ, Wu X. Categorizing Social Multimedia by Neighborhood Decision using Local Pairwise Label Correlation. 2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW). 2014, 第 3 作者913-920, http://dx.doi.org/10.1109/ICDMW.2014.87.[116] Wang Shuhui, Wang Zhenjun, Jiang Shuqiang, Huang Qingming, IEEE. CROSS MEDIA TOPIC ANALYTICS BASED ON SYNERGETIC CONTENT AND USER BEHAVIOR MODELING. 2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME). 2014, 第 1 作者[117] Hua Yan Tina, Wang Shuhui, Liu Siyuan, Huang Qingming, Cai Anni, Kumar R, Toivonen H, Pei J, Huang JZ, Wu X. TINA: Cross-modal Correlation Learning by Adaptive Hierarchical Semantic Aggregation. 2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM). 2014, 第 2 作者190-199, [118] Chu, Lingyang, Wang, Shuhui, Zhang, Yanyan, Jiang, Shuqiang, Huang, Qingming, IEEE. GRAPH-DENSITY-BASED VISUAL WORD VOCABULARY FOR IMAGE RETRIEVAL. 2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME). 2014, 第 2 作者[119] Wang Shuhui. WIKI-CMR: A Web Cross Modality database for Studying and Evaluation of Cross Modality Retrival Methods. IEEE International Conference on Multimedia and Expo (ICME). 2013, 第 1 作者[120] Qing He. Xin Jin, Fuzhen Zhuang, Shuhui Wang, Qing He, and Zhongzhi Shi. Shared Structure Learning for Multiple Tasks with Multiple Views, ECML/PKDD13, September 23-27, 2013, Prague, Czech. ECML/PKDD13. 2013, [121] Zhang, Chunjie, Wang, Shuhui, Huang, Qingming, Liu, Jing, Liang, Chao, Tian, Qi. Image classification using spatial pyramid robust sparse coding. PATTERN RECOGNITION LETTERS[J]. 2013, 第 2 作者34(9): 1046-1052, http://dx.doi.org/10.1016/j.patrec.2013.02.013.[122] Zhang Chunjie, Zhang Yifan. Undo the codebook bias by linear transformation for visual applications. ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA. 2013, 533-536, http://ir.ia.ac.cn/handle/173211/4670.[123] Zhang, Yanyan, Li, Guorong, Chu, Lingyang, Wang, Shuhui, Zhang, Weigang, Huang, Qingming, IEEE. CROSS-MEDIA TOPIC DETECTION: A MULTI-MODALITY FUSION FRAMEWORK. 2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013). 2013, 第 4 作者[124] Sun, Gang, Wang, Shuhui, Liu, Xuehui, Huang, Qingming, Chen, Yanyun, Wu, Enhua. Accurate and efficient cross-domain visual matching leveraging multiple feature representations. VISUAL COMPUTER[J]. 2013, 第 2 作者29(6-8): 565-575, https://www.webofscience.com/wos/woscc/full-record/WOS:000319478400011.[125] Chu, Lingyang, Jiang, Shuqiang, Wang, Shuhui, Zhang, Yanyan, Huang, Qingming. Robust Spatial Consistency Graph Model for Partial Duplicate Image Retrieval. IEEE TRANSACTIONS ON MULTIMEDIA[J]. 2013, 第 3 作者15(8): 1982-1996, https://www.webofscience.com/wos/woscc/full-record/WOS:000327393900021.[126] Zhang, Chunjie, Wang, Shuhui, Huang, Qingming, Liang, Chao, Liu, Ting, Tian, Qi. Laplacian affine sparse coding with tilt and orientation consistency for image classification. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION[J]. 2013, 第 2 作者24(7): 786-793, http://dx.doi.org/10.1016/j.jvcir.2013.05.004.[127] Wang Shuhui. Cross Concept Local Fisher Discriminant Analysis for Image Classification. Multimedia Modelling (MMM). 2013, 第 1 作者[128] Wang Shuhui. TODMIS: Mining Communities from Trajectories. ACM International Conference on Information and Knowledge Management (CIKM). 2013, 第 1 作者[129] Shen, Li, Wang, Shuhui, Sun, Gang, Jiang, Shuqiang, Huang, Qingming, IEEE. Multi-Level Discriminative Dictionary Learning towards Hierarchical Visual Categorization. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR). 2013, 第 2 作者383-390, [130] Zhang Chunjie, Liu Jing. Beyond bag of words: Image representation in sub-semantic space. ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA. 2013, 497-500, http://ir.ia.ac.cn/handle/173211/4669.[131] Sun Gang, Wang Shuhui, Liu Xuehui, Huang Qingming, Chen Yanyun, Wu Enhua. Accurate and efficient cross-domain visual matching leveraging multiple feature representations. 2013, 第 2 作者565-575, http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000319478400011&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=3a85505900f77cc629623c3f2907beab.[132] Wang, Shuhui, Huang, Qingming, Jiang, Shuqiang, Tian, Qi. (SMKL)-M-3: Scalable Semi-Supervised Multiple Kernel Learning for Real-World Image Applications. IEEE TRANSACTIONS ON MULTIMEDIA[J]. 2012, 第 1 作者 通讯作者 14(4): 1259-1274, http://dx.doi.org/10.1109/TMM.2012.2193120.[133] Wang Shuhui, Jiang Shuqiang, Huang Qingming, Tian Qi, IEEE. Multi-feature Metric Learning with Knowledge Transfer among Semantics and Social Tagging. 2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR). 2012, 第 1 作者2240-2247, [134] Wang, Shuhui, Huang, Qingming, Jiang, Shuqiang, Tian, Qi, Qin, Lei. Nearest-neighbor method using multiple neighborhood similarities for social media data mining. NEUROCOMPUTING[J]. 2012, 第 1 作者 通讯作者 95: 105-116, http://dx.doi.org/10.1016/j.neucom.2011.06.039.[135] Shuhui Wang, Qingming Huang, Shuqiang Jiang, Qi Tian, Lei Qin. Nearest-neighbor method using multiple neighborhood similarities for social media data mining. NEUROCOMPUTING[J]. 2012, 第 1 作者95: 105-116, http://dx.doi.org/10.1016/j.neucom.2011.06.039.[136] Wang, Shuhui, Jiang, Shuqiang, Huang, Qingming, Gao, Wen, IEEE. SHOT CLASSIFICATION FOR ACTION MOVIES BASED ON MOTION CHARACTERISTICS. 200815THIEEEINTERNATIONALCONFERENCEONIMAGEPROCESSINGVOLS15. 2008, 第 1 作者2508-2511,