基本信息

陶建华

清华大学 教授 博士生导师


国家杰出青年科学基金获得者

国家万人计划领军人才

国务院政府特殊津贴


邮件:jhtao@tsinghua.edu.cn

研究领域

语音处理、认知推理、人机交互、情感计算、概率图模型、大数据分析

教育背景

   
学历

  • 清华大学计算机系     2001年获博士学位
  • 南京大学电子系        1996年获硕士学位
  • 南京大学信息物理系  1993年获本科学位


主要任职

 
期刊任职

  • IEEE Transactions on Affective Computing,Steering Committee Member
  • Speech Communication,Subject Editor
  • International Journal on Multimodal User Interface, Editorial Board Member
  • International Journal on Synthetic Emotions,Editorial Board Member
  • 计算机研究与发展,编委
  • 虚拟现实与智能硬件,副主编
  • 声学学报,编委
  • 信号处理学报,编委


学会任职

  • Chairperson, Special Interest Group on Chinese Spoken Language Processing (SIG-CSLP), International Speech Communicaiton Association (ISCA)
  • AAAC Association,Executive Committee Member
  • 中国计算机学会,会士、常务理事(兼语音对话与听觉专委副主任)
  • 中国人工智能学会,理事(兼人工心理与人工情感专委副主任)
  • 中国中文信息学会,理事(兼语音信息处理专业委员会副主任)
  • 中国图形图像学会,理事(兼人机交互专业委员会主任)
  • 中国声学学会,理事
  • 中国语言学会语音学分会,副主任
  • 中文语言资源联盟,秘书长

会议主席

  • Technical Program Committee Chair, INTERSPEECH 2020
  • General Chair, NCMMSC 2015, 2017
  • Technical Program Committee Chair, IEEE International Workshop on Marchine Learning for Signal Processing (IEEE MLSP)
  • Technical Program Committee Chair, ACII 2005
  • General Chair, ACII Asia 2018

奖励

(1) 2014年获北京市科技进步二等奖
(2) 2018年获中国电子学会科技进步一等奖
(3) 2021年获中国电子学会技术发明一等奖

(4) 2022年获中国人工智能学会吴文俊技术发明特等奖
(5) 2013年、2014年、2016年、2017年全国人机交互学术会议优秀论文或提名
​(7) 2001 年、2015年、2017年全国人机语音会议优秀论

发表论文

1.     Ya Li,Jianhua Tao,Wei Lai,Xiaoying Xu, "Quantitative intonation modeling of interrogative sentences for Mandarin speech synthesis" Speech Communication,Volume 89, PP:92-102,2017.05,.

2.     Zhengqi Wen,Kehuang Li,Zhen Huang,Chin-Hui Lee,Jianhua Tao, "Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning" Journal of Signal Processing Systems,2017.10.02,DOI 10.1007/s11265-017-1293-z..

3.     Yibin Zheng,Ya Li,Zhengqi Wen,Bin Liu and Jianhua Tao, "Investigating Deep Neural Network Adaptation for Generating Exclamatory and Interrogative Speech in Mandarin" IEEE Journal of Signal Processing System,2017.9.26,DOI 10.1007/s11265-017-1290-2,.

4.     Jiangyan Yi,Zhengqi Wen,Jianhua Tao,Hao Ni,Bin Liu, "CTC Regularized Model Adaptation for Improving LSTM RNN Based Multi-Accent Mandarin Speech Recognition" Journal of Signal Processing Systems,2017.9.23,DOI 10.1007/s11265-017-1291-1,.

5.     Bin Liu,Jianhua Tao,Dawei Zhang,Yibin Zheng, "A Novel Pitch Extraction Based On Jointly Trained Deep BLSTM Recurrent Neural Networks With Bottleneck Features" ICASSP 2017,PP:336-340,March.5-9,2017,New Orleans,

6.     Jiangyan Yi,Jianghua Tao,Zhengqi Wen and Ya Li, "Distilling Knowledge from an Ensemble of Models for Punctuation Prediction" 18th Annual Conference of the Speech Communication Associatio(Interspeech 2017), PP:2779-2783,August 20-24, 2017, Sweden,Stockholm,

7.     Jian Huang,Ya Li, Jianhua Tao, Jiangyan YI, "Effect of Dimensional Emotion in Discrete Speech Emotion Classification" 2017 Affective Social Multimedia Computing( ASMMC 2017), Stockholm Sweden.

8.     Jian Huang,Ya Li,Jianhua Tao,Zhen Lian, Zhengqi wen, Minghao Ynag,Jiangyan YI, "Continuous Multimodal Emotion Prediction Based on Long Short Term Memory Recurrent Neural Network" The Audio/Visual Emotion Challenge and Workshop (AVEC 2017), PP:11-18,October 23,2017,Mountain View,CA,USA.

9.     Bocheng Zhao,Minghao Yang,Hang Pan,Qingjie Zhu,Jianghua Tao, "Nonrigid Point Matching of Chinese Characters for Robot Writing" IEEE Ro,PP:762-767, 2017.8.25,Macao,China, 

10.  Xiaoke Qi,JianhuaTao, "A Domain Knowledge-Assisted Nonlinear Model for Head-Related Transfer Functions Based on Bottleneck Deep Neural Network" 18th Annual Conference of the Speech Communication Associatio(Interspeech 2017),PP:3058-3062, Agu 21,2017,Stockholm, Sweden,

11.  Yibin Zheng,Jianhua Tao,Zhengqi Wen,Ya Li and Bin Liu, "Investigating Efficient Feature Representation Method and Training Object Function for BLSTM-based Phone Duration Prediction" 18th Annual Conference of the Speech Communication Associatio(Interspeech 2017), PP:784-788, Agu 21,2017, Stockholm,Sweden,

12.  Xiaoke Qi,JianhuaTao, "Distance-Dependent Modeling of Head-Related Transfer Functions Based on Spherical Fourier-Bessel Transform", NCMMSC, 2017.10.11-13

13.  Jiangyan Yi,Jianhua Tao,Zhengqi Wen,Ya Li and Hao Ni, "Acoustic Model Compression with Knowledge Transfer", NCMMSC, 2017.10. 11-13.

14.  Minghao Yang,Jinlin Jiang,Jianhua Tao,Kaihui Mu,Hao Li, "Emotional head motion predicting from prosodic and linguistic features" Multimedia Tools and Applications,2016,75(9):5125-5146..

15.  Ya Li*, Jianhua Tao, Linlin Chao,Wei Bao, Yazhu Liu, "CHEAVD: a Chinese natural emotional audio–visual database" Journal of Ambient Intelligence and Humanized Computing,DOI: 10.1007/s12652-016-0406-z..

16.  Bin Liu,Jianhua Tao,Zhengqi Wen,Fuyuan Mo, "Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement" Signal Processing Systems,2016, 82(2):141-150,.

17.  Hao Che,Ya Li,Jianhua Tao,Zhengqi Wen, "Investigating Effect of Rich Syntactic Features on Mandarin Prosodic Phrase Boundaries Prediction" Journal of Signal Processing Systems,2016, 82(2):263-271,.

18.  Dawei Zhang, Minghao Yang, Jianhua Tao, Yang Wang, Bin Liu, Danish Bukhari, "Extraction of Tongue Contour in Real-time Magnetic Resonance Imaging Sequences" 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016),PP:937-941,Mar.20-25,2016,Shanghai, China,

19.  Linlin Chao,Jianhua Tao,Minghao Yang,Ya Li,Zhengqi Wen, "Long short term memory recurrent neural network based encoding method for emotion recognition in video" 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2016),PP:2752-2756,Mar.20-25,2016,Shanghai, China,

20.  Bin Liu,Jianhua Tao, "A Novel Research to Artificial Bandwidth Extension Based on Deep BLSTM Recurrent Neural Networks and Exemplar-based Sparse Representation" 17th Annual Conference of the Speech Communication Associatio(Interspeech 2016),PP:3778-3782,Sept.8-12,2016,San Francisco,USA,

21.  Yibin Zheng,Ya Li,Zhengqi Wen, Xingguang Ding, Jianhua Tao, "Improving Prosodic Boundaries Prediction for Mandarin Speech Synthesis by Using Enhanced Embedding Feature and Model Fusion Approaches" 17th Annual Conference of the Speech Communication Associatio(Interspeech 2016),PP:3201-3205, Sep 8-12, 2016,San Francisco,USA,

22.  Xiaoke Qi, Jianhua Tao, "A Sparse Spherical Harmonic-Based Model in Subbands for Head-Related Transfer Functions" 17th Annual Conference of the Speech Communication Associatio(Interspeech 2016),PP:540-544,Sept.8-12,2016,San Francisco,USA,

23.  Zhengqi Wen, Ya Li and Jianhua Tao, "The Parameterized Phoneme Identity Feature as a Continuous Real-Valued Vector for Neural Network based Speech Synthesis" 17th Annual Conference of the Speech Communication Associatio(Interspeech 2016),PP:2248-2252,Sep 8-12, 2016,San Francisco,USA,

24.  Yibin Zheng,Zhengqi Wen,Bin Liu, Ya Li, Jianhua Tao, "An Initial Research: Towards Accurate Pitch Extraction for Speech Synthesis Based on BLSTM" 13th International Conference on Signal Processing ICSP2016,PP165-170,Nov 6-10,2016,Chengdou,China,

25.  Yibin Zheng,Ya Li,Zhengqi Wen, Bin Liu, Jianhua Tao, "Investigating Deep Neural Network Adaptation for Generating Exclamatory and Interrogative Speech in Mandarin" 10th International Symposium on Chinese Spoken Language Processing (ISCSLP2016),Oct 17-20,2016,Tianjin,China,

26.  Yibin Zheng,Ya Li,Zhengqi Wen, Bin Liu, Jianhua Tao, "Text-based sentential stress prediction using continuous lexical embedding for Mandarin speech synthesis" 10th International Symposium on Chinese Spoken Language Processing (ISCSLP2016),Oct 17-20,2016,Tianjin,China,

27.  Jiangyan Yi, Hao Ni, Zhengqi Wen, Bin Liu, Jianhua Tao, "CTC Regularized Model Adaptation for Improving LSTM RNN Based Multi-Accent Mandarin Speech Recognition" 10th International Symposium on Chinese Spoken Language ProcessingISCSLP 2016),Oct 17-20,2016,Tianjin,China,

28.  Ye Bai,Jiangyan Yi,Hao Ni,Zhengqi Wen,Bin Liu,Ya Li,Jianhua Tao, "End-to-end Keywords Spotting Based on Connectionist Temporal Classification for Mandarin" 10th International Symposium on Chinese Spoken Language ProcessingISCSLP 2016),Oct 17-20,2016,Tianjin,China,

29.  Hao Ni, Jiangyan Yi, Zhengqi Wen, Bin Liu, Jianhua Tao, "Improving Accented Mandarin Speech Recognition by Using Recurrent Neural Network based Language Model Adaptation" 10th International Symposium on Chinese Spoken Language ProcessingISCSLP 2016),Oct 17-20,2016,Tianjin,China,

30.  Zhengqi Wen,Kehuang Li, Zhen Huang, Jianhua Tao and Chin-Hui Lee, "Learning Auxiliary Categorization for Neural Network Based Speech Synthesis" 10th International Symposium on Chinese Spoken Language ProcessingISCSLP 2016),Oct 17-20,2016,Tianjin,China,

31.  Jiangyan Yi, Hao Ni, Zhengqi Wen, Jianhua Tao, "Improving BLSTM RNN Based Mandarin Speech Recognition Using Accent Dependent Bottleneck Features" 8th Asia-Pacific Signal and Information Processing Association(APSIPA 2016),Dec 12-16,2016,Jeju, Korea,

32.  Zhengqi Wen,Kehuang Li, Jianhua Tao and Chin-Hui Lee, "DEEP NEURAL NETWORK FOR VOICE CONVERSION WITH A SYNTHESIZED PARALLEL CORPUS" 8th Asia-Pacific Signal and Information Processing Association(APSIPA 2016),Dec 12-16,2016,Jeju, Korea,

33.  Renjun Tang,Ke Zhang,Ruoyang Nashen,Minghao Yang,Hui Zhou,Qingjie Zhu,Yongsong Zhan,Jianhua Tao, "Football News Generation from Chinese Live Webcast Script" The Fifth Conference on Natural Language Processing and Chinese Computing & The Twenty Fourth International Conference on Computer Processing of Oriental Languages(NLPCC-ICCPCOL 2016),Dec 2-6,2016,Kunming, China,

34.  Jianhua Tao, Yibin Zheng,Zhengqi Wen, Ya Li, Bin Liu, "A BLSTM Guided Unit Selection Synthesis System for Blizzard Challenge 2016" Blizzard Challenge 2016,Sep 16,2016,San Francisco,USA.

35.  Keikichi HiroseJianhua Tao, "Speech Prosody in Speech SynthesisModeling and generation of prosody for high quality and flexible speech synthesis" Springer,ISBN 978-3-662-45257-8,March 2015.

36.  Ya LiJianhua Tao , "Mandarin Stress Analysis and Prediction for Speech Synthesis " Springer,PP83-95.March 2015.

37.  Nick CampbellYa Li, , "Expressivity in Interactive Speech Synthesis; Some Paralinguistic and Nonlinguistic Issues of Speech Prosody for Conversational Dialogue Systems" Springer,PP96-107,March 2015.

38.  Minghao Yang,Jianhua Tao,Linlin Chao,Hao Li,Dawei Zhang,Hao Che,Tingli Gao,Bin Liu, "User behavior fusion in dialog management with multi-modal history cues" Multimedia Tools and Applications,Volume 74, Issue 22 (2015), Page 10025-10051,.

39.  Ya Li,Jianhua Tao,Hirose,K.,Xiaoying Xu,Wei Lai , "Hierarchical stress modeling and generation in mandarin for expressive Text-to-Speech" Speech Communication,Vol.72,pp.59-73,.

40.  Wang Xiaoyan,Yang Minghao,Xia Ming,Zhan Yongsong,Shi LihuiMa,Chuanyan Tao,Jianhua,Chen Shengyong, "Fast unsupervised texture segmentation using Texel similarity map" Journal Of Modern Optics,2015Volume: 62 Issue: 15 Page: 1211-1222,.

41.  Su-Jing Wang,Wen-Jing Yan,Xiaobai Li,Guoying Zhao,Chun-Guang Zhou,Xiaolan Fu,Minghao Yang,Jianhua Ta, "Micro-Expression Recognition Using Color Spaces" IEEE Transactions on Image Processing,Vol. 24, No. 12, PP:6034-6047,December 2015,.

42.  Bin Liu,Jianhua Tao,Zhengqi Wen,Ya Li,Danish Bukhari, "A Novel Method of Artificial Bandwidth Extension Using Deep Architecture" 16th Annual Conference of the Speech Communication Associatio(Interspeech 2015),PP:2598-2602,Sept.6-10,2015,Dresden,Germany,

43.  Hao Li,Jianhua Tao,Minghao Yang,Bin Liu, "Estimate Articulatory MRI Series From Acoustic Signal Using Deep Architecture " 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing(ICASSP 2015),PP:4854-4858,Apr.19-24,2015,Brisbane,Australia,

44.  Hao Li,Jianhua Tao,Yang Wang, "Evaluation of Linear Regression for Speaker Adaptation in HMM-Based Articulatory Movements Estimation" 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing(ICASSP 2015),PP:4944-4948,Apr.19-24,2015,Brisbane,Australia;,

45.  Linlin Chao,Jianhua Tao,Minghao Yang,Ya Li, "Multi Task Sequence Learning for Depression Scale Prediction from Video" 6th Affecive Computing and Intelligent Interaction(ACII 2015),PP:525-531,Sep.21-24,2015,Xian,China,

46.  Ya Li,Yazhu Liu,Wei Bao, Linlin Chao,Jianhua Tao, "From Simulated Speech to Natural Speech, What are the Robust Features for Emotion Recognition?" 6th Affecive Computing and Intelligent Interaction(ACII 2015),PP:368-373,Sep.21-24,2015,Xian,China,

47.  Ya Li,Nick Campbell,Jianhua Tao, "VOICE QUALITY: NOT ONLY ABOUT “YOU” BUT ALSO ABOUT “YOUR INTERLOCUTOR”" 2015 IEEE International Conference on Acoustics, Speech, and Signal Processing(ICASSP 2015),PP:4739-4743,Apr.19-24,2015,Brisbane,Australia; ,

48.  Yang Wang,Minghao Yang,Zhengqi Wen,Jianhua Tao, "Combining Extreme Learning Machine and Decision Tree for Duration Prediction in HMM based Speech Synthesis" 16th Annual Conference of the Speech Communication Associatio(Interspeech 2015),PP:2197-2201,Sept.6-10,2015,Dresden,Germany,EI ,.

49.  Linlin Chao,Jianhua Tao,Minghao Yang,Ya Li,Zhengqi Wen, "Long Short Term Memory Recurrent Neural Network based Multimodal Dimensional Emotion Recognition" Audio/Visual Emotion Challenge (AVEC2015),PP:65-72,Oct.26,2015,Brisbane,Australia,

50.  Zhengqi Wen,Jianhua Tao,Shifeng Pan,Yang Wang, "Pitch-Scaled Spectrum Based Excitation Model for HMM-based Speech Synthesis" Journal of Signal Processing SystemsMarch 2014,Volume 74,Issue 3,PP423-435.

51.  Ya Li, Jianhua Tao, Keikichi Hirose,Wei Lai,Xiaoying Xu, "Hierarchical stress generation with Fujisaki model in expressive speech synthesis" SPEECH PROSODY 2014,May 20-23,PP:1032-1036,Dublin Ireland.

52.  Shanfeng Liu, Zhengqi Wen, Jianhua Tao, Ya Li, Yongguo Kang, "A Data Driven Method for Target and Concatenation Cost Calculation with KL-Divergence in Mandarin Hybrid Speech Synthesis" The 12th IEEE International Conference on Signal ProcessingPP:572-576,Oct 19-23,2014HangZhou, China

53.  Hao che, Jianhua Tao, Ya LiZhengqi Wen, "Investigating Effect of Rich Syntactic Features on Mandarin Prosodic Phrase Boundaries Prediction" The 9th International Symposium on Chinese Spoken Language Processing2014 ISCSLP),PP:501-505,Sept 12-14Singapore

54.  Ran Zhang , Zhengqi Wen , Jianhua Tao , Ya Li , Bing Liu, Xiaoyan Lou, "A Hierarchical Viterbi Algorithm for Mandarin Hybrid Speech Synthesis System" 2014 InterspeechPP:795-799Sept 14-18Singapore

55.  Hao Che, Jianhua Tao, Ya Li, "Improving Mandarin Prosodic Boundary Prediction with Rich Syntactic Features" 2014 InterspeechPP:46-50Sept 14-19Singapore

56.  Bin Liu, Jianhua Tao, Fuyuan Mo, Ya Li, Zhengqi Wen, Shanfeng Liu, "Efficient Voice Activity Detection Algorithm based on Sub-band Temporal Envelope and Sub-band Long-term Signal Variability" The 9th International Symposium on Chinese Spoken Language Processing2014 ISCSLP),pp.531-535,Sep 12-14, 2014Singapore

57.  Linlin Chao, Jianhua Tao, Minghao Yang, Ya Li, "Improving Generation Performance of Speech Emotion Recognition by Denoising Autoencoders" The 9th International Symposium on Chinese Spoken Language Processing2014 ISCSLP),PP:341-344,Sept12-14,2014Singapore

58.  Linlin Chao,Jianhua Tao,Minghao Yang,Ya li,Zhengqi Wen, "Multi-scale Temporal Modeling by Neural Networks for Dimensional Emotion Recognition in Video" AVEC ACM Multimedia 2014PP:11-18Nov 2-7NY, USA

59.  Shanfeng Liu, Zhengqi Wen, Ya Li, Jianhua Tao, Bin Liu, "Context Features Based Pre-Selection and Weight Prediction in Concatenation Speech Synthesis System" The 9th International Symposium on Chinese Spoken Language Processing2014 ISCSLP),PP:506-510,Sept12-14,2014Singapore

60.  Bin Liu, Fuyuan Mo, Jianhua Tao, "Speech enhancement based on analysis-synthesis framework with improved pitch estimation and spectral envelope enhancement" 2014 International Conference on Signal ProcessingICSP 2014),pp.461-467,Oct 19-23, Hangzhou, China

61.  Yang Wang, Jianhua Tao, "Evaluation of Parameter Generation Using High Order Dynamic Features and Long Span Windows for HMM based Speech Synthesis" The 9th International Symposium on Chinese Spoken Language Processing2014 ISCSLP),pp.516-520,Sep 12-14, 2014Singapore

62.  Xiaoying Xu,Huimin Wang,Ya Li,Wei Lai,Jianhua Tao, "The Expression Of Emotions by Text and Speech" The 9th International Symposium on Chinese Spoken Language Processing2014 ISCSLP),Sep 12-14, 2014Singapore

63.  Hao Li, Minghao Yang, Jianhua Tao, "TONGUE SHAPE CONVERSION WITH NON-PARALLEL TRAINING DATA" 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing (2014 ICASSP)PP: 2569-2572,May 4-9,2015,Florence,Italy

64.  Ran Zhang,Jianhua Tao,Ya Li,Zhengqi Wen, "A NOVEL HYBRID MANDARIN SPEECH SYNTHESIS SYSTEM USING DIFFERENT BASE UNITS FOR MODEL TRAINING AND CONCATENATION" 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing (2014 ICASSP),May 4-9,2014,PP: 295-299,Florence,Italy.

65.  Wei Bao,Ya Li, Mingliang Gu, Jianhua Tao, Linlin Chao, Shanfeng Liu, "Combining Prosodic and Spectral Features for Mandarin Intonation Recognition" The 9th International Symposium on Chinese Spoken Language Processing2014 ISCSLP),pp.497-500,September 12 - 14,2014Singpore

66.  Wei Bao, Ya Li, Mingliang Gu, Minghao Yang, Hao Li, Linlin Chao, Jianhua Tao, "Building a Chinese Natural Emotional Audio-visual Database" 2014 International Conference on Signal ProcessingICSP 2014),pp.583-587Oct 19-23HangzhouChina

67.  Wei Lai,Ya Li,Hao Che,Shanfeng Liu,Jianhua Tao,Xiaoying Xu, "Final Lowering Effect in Questions and Statements of Chinese Mandarin Based on a Large-scale Natural Dialogue Corpus Analysis" SPEECH PROSODY 2014,May 20-23,PP:653-657,Dublin Ireland.

68.  Wei Lai, Ya Li, Hao Che, Shanfeng Liu, Jianhua Tao, "PHONOLOGICAL INFLUENCES ON THE REALIZATION OF FINAL LOWERING" The 17th Oriental COCOSDAPP:83-88,Sept 10-12Phuket, Thailand

69.  Minghao Yang, Jianhua Tao, Dawei Zhang, "Extraction of Tongue Contour in X-ray Videos" 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP2013) ,May 26 - 31, 2013, Vancouver, Canada, PP.1094-1098.

70.  Hao Li, Minghao Yang, Jianhua Tao, "Speaker-Independent Lips and Tongue Visualization of vowels" 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP2013), May.26 - 31, 2013,Vancouver, Canada,PP.8106-8110.

71.  Linlin Chao, Jianhua Tao, Minghao Yang, "Combining Emotional History Through Multimodal Fusion Methods" Asia Pacific Signal and Information Processing Association(APSIPA 2013), Oct.29-Nov.1 2013, Taiwan, China.

72.  Linlin Chao, Jianhua Tao, Minghao Yang, "Bayesian Inference based Temporal Modeling for Naturalistic Affective Expression Classification" The 5th International Conference on Affective Computing and Intelligent Interaction (ACII2013),Sep.2-5, 2013, Geneva, Switzerland.

73.  Yang Wang, Jianhua Tao, Minghao Yang, Ya Li, "Extended Decision Tree with OR Relationship for HMM-based Speech Synthesis" The 2nd Asian Conference on Pattern Recognition (ACPR2013), Nov.5-8, 2013, pp.225-229, Japan.

74.  Ran Zhang, Jianhua Tao, Ya Li, Zhengqi Wen, "A Novel Unit Selection Method for Concatenation Speech System Using Similarity Measure" 16th International Oriental COCOSDA Conference, Oct.25-28, 2013, Gurgaon, India.

75.  Chehao,Jianhua Tao, "Stress Prediction for Mandarin Text-to-Speech System Using Discourse Context Feature" 16th International Oriental COCOSDA Conference, Nov.25-28, 2013, Gurgaon, India.

76.  Xiaoying Xu , Jianhua Tao, Ya Li, "On Constructing a Chinese Task-oriental Subjectivity Lexicon" The 14th Chinese Lexical Semantics Workshop, May 10-13, 2013, Zhengzhou.

77.  Minghao Yang, Jianhua Tao, Kaihui Mu, Ya Li, Jianfeng Che, "A Multimodal Approach of Generating 3D Human-like Talking Agent" Journal on Multimodal User Interfaces, 2012, 5(1), pp: 61-68.

78.  Xiaoying Xu, Ya Li, Jianhua Tao, Xuefei Liu, "Automatic Parsing of the Metaphor Polarity for Opinion" O-COCOSDA 2012, Oral, 2012.12,P13-17, Macau, China.

79.  Ya Li, Xuefei Liu, Xiaoying Xu, Jianhua Tao, "Assign Stress for Interrogative Sentences via Syntax Structure Mapping" Speech Prosody 2012,Oral, 2012.05,P167-170,Shanghai.

80.  Che Hao,Jianhua Tao,Sifeng pan, "Letter-to-Sound Conversion Using Coupled Hidden Markov Models for Lexicon Compression" 2012 The International Committee for the Co-ordination and Standardization of Speech Databases and Assessment, Oral,2012.12,Macao,P141-144.

81.  Zhegnqi Wen, Jianhua Tao and Che Hao, "Statistical Modification based Post-Filtering Technique for HMM-based Speech Synthesis" The 8th International Symposium on Chinese Spoken Language Processing, Poster, 2012.12,HongKong, pp.146-149.

82.  Zhengqi Wen, Jianhua Tao and Horst-Ud0 Hain, "Pitch-Scaled Spectrum based Excitation Model for HMM-based Speech Synthesis" IEEE 11th International Conference on Signal Processing, Oral,2012.10,Beijing,pp.609-612.

83.  Zhengqi Wen, and Jianhua Tao, "Prosody Modification for Vocoder Based on Amplitude Spectrum" Speech Prosody,6th International Conference,Poster,2012.05,Shanghai,pp.11-14.

84.  Zhengqi Wen and Jianhua Tao, "Amplitude Spectrum based Excitation Model for HMM-based Speech Synthesis" 13th Annual Conference of the International Speech Communication Association, Oral,2012.09,USA.

85.  Zhengqi Wen, Hideki Kawahara and Jianhua Tao, "Pitch-Scaled Analysis based Residual Reconstruction for Speech Analysis and Synthesis" 13th Annual Conference of the International Speech Communication Association, Oral,2012.09,USA.

86.  Minghao Yang;Jianhua Tao;Hao Li;Mu Kai Hui, "Multimodal Emotion Estimation and Emotional Synthesize for Interaction Virtual Agent" IEEE CCIS 2012, Oral, 2012, Vol 1, Hangzhou, pp.239-244.

87.  Jianhua Tao, Shifeng Pan, Minghao Yang, Ya Li, Kaihui Mu and Jianfeng Che, "Utterance independent bimodal emotion recognition in spontaneous communication" Tao et al. EURASIP Journal on Advances in Signal Processing 2011, 2011.4.

88.  Xiaoying Xu, Ya Li,Jianhua Tao, Yingchao Lu, "The Stability Analysis of Disyllabic Stress in Mandarin Speech" The 17th International Congress of Phonetic ences, ICPhS2011,2011.8,Hongkong.

89.  Zhengqi WenJianhua Tao, "An Excitation Model Based on Inverse Filtering for Speech Analysis and Synthesis" 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011.9, Beijing.

90.  Zhengqi WenJianhua Tao, "Inverse Filtering Based Harmonic plus Noise Excitation Model for HMM-based Speech Synthesis" 12thAnnual Conference of the International Speech Communication Association, Interspeech 201112thAnnual Conference of the International Speech Communication Association, Interspeech 2011.

91.  Jianhua TaoShifeng Pan, Yoshihiko Nankaku, Keichi Tokuda, "Global Variance Modeling on Frequency Domain Delta LSP for HMM-based Speech Synthesis" The 35th International Conference on Acoustics, Speech, and Signal ProcessingICASSP2011, 2011.5, Prague, Czech, pp. 4716-4719.

92.  Shifeng Pan, Jianhua Tao, Ya Li, "The CASIA Audio Emotion Recognition Method for Audio/Visual Emotion Challenge 2011" The 4rd International Conference on Affective Computing and Intelligent Interaction, ACII2011, 2011.8, Memphis, USA, pp.388-395.

93.  Shifeng Pan, Jianhua Tao, Yang Wang, "A State Duration Generation Algorithm Considering Global Variance for HMM-based Speech Synthesis" Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011, 2011.10, Xi'an.

94.  Ya Li, Jianhua Tao, Xiaoying Xu, "Hierarchical Stress Modeling in Mandarin Text-to-Speech" 12th Annual Conference of the International Speech Communication Association, Interspeech 2011, 2011.8, Florence, Italy, pp. 2013-2016.

95.  Minghao Yang, Jianhua Tao, Lihui Shi,Kaihui Mu, Jianfeng Che, "An Outlier Rejection Scheme For Optical Flow Tracking" 21th IEEE International Workshop on Machine Learning for Signal Processing, Beijing, China.

96.  Kaihui Mu, Jianhua Tao, Minghao Yang, "Animating A Chinese Interactive Virtual Character" 21th IEEE International Workshop on Machine Learning for Signal Processing, Beijing, China.

97.  Jianhua Tao, Meng Zhang, Jani Nurminen, Jilei Tian, Xia Wang, "Supervisory Data Alignment for Text-independent Voice Conversion" IEEE Transactions on Audio, Speech and Language Processing (IEEE Trans. ASLP), Vol. 18, No. 5, July 2010, pp 932-943 .

98.  Shifeng Pan, Meng Zhang, Jianhua Tao, "A Novel Hybrid Approach for Mandarin Speech Synthesis" Interspeech2010, Japan, Sep. 2010, pp 182-185 .

99.  Ya Li, Jianhua Tao, Meng Zhang, Shifeng Pan, Xiaoying Xu, "Text-based Unstressed Syllable Prediction in Mandarin" Interspeech2010, Japan, Sep. 2010, pp 1752-1755 .

100.         Ya Li, Shifeng Pan, Jianhua Tao, "HMM-based Speech Synthesis with a Flexible Mandarin Stress Adaptation Model" International Conference on Signal Processing (ICSP2010), Oct. 2010, pp 625-628 .

101.         Kaihui Mu, Jianhua Tao, Jianfeng Che, Minghao Yang, "Mood Avatar: Automatic Text-Driven Head Motion Synthesis" International Conference on Multimodal Interfaces (ICMI2010), Nov. 2010 .

102.         Kaihui Mu, Jianhua Tao, Jianfeng che, Minghao Yang, "Real-Time Speech-Driven Lip Synchronization" 4th International Universal Communication Symposium (IUCS2010), Oct. 2010, pp 377-381 .

103.         Jianhua Tao, Kaihui Mu, Jianfeng Che, Ya Li, Zhengqi Wen, Shifeng Pan, Lixing Huang, Le Xin, "Audio-Visual Based Emotion Recognition with the Balance of Dominances" International Conference on Artificial Intelligence (ICAI1010), Oct. 2010, pp 100-110 .

104.         Jianfeng Che, Jianhua Tao, Xingang Wang, Kaihui Mu, Hongtao Li, "Feature-based Multi-style Cartoon System" International Conference on Audio,Language and Image Processing (ICALIP2010), Nov. 2010 .

105.         Xiaoying Xu, Jianhua Tao, Ling Zhang,Yingchao Lu, "The Duration Analysis of the Checked Tone in Cantonese Speech" International Symposium on Chinese Spoken Language Processing (ISCSLP2010), Nov. 2010, pp 459-464 .

106.         Peter Khooshabeh, Jonathan Gratch, Lixing Haung, Jianhua Tao, "Does culture affect the perception of emotion in virtual faces?" 7th Symposium on Applied Perception in Graphics and Visualization (APGV '10), July 2010, pp 165 .

107.         Jian Yu, Jianhua Tao, "A Novel Prosody Adaptation Method for Mandarin Concatenation Based Text-to-Speech System"  Journal of Acoustical ence and Technology, Vol. 30, No.1, January 2009, pp.33-41.

108.         Jianhua Tao, Le Xin, Panrong Yin, "Realistic Visual Speech Synthesis based on Hybrid Concatenation Method"  IEEE Transactions on Audio, Speech and Language Processing (IEEE Trans. ASLP), Vol. 17, No. 3, March 2009, pp 469-477.

109.         Meng Zhang, Jianhua Tao, Jani Nurminen, Jilei Tian, Xia Wang, "Phoneme Cluster Based State Mapping for Text-Independent Voice Conversion"  34th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP2009, April 2009, Taipei, China.

110.         Jianhua Tao, Ya Li, Shifeng Pan, "A Multiple Perception Model on Emotional Speech"   The 3rd International Conference on Affective Computing and Intelligent Interaction, ACII09, Sep. 2009, Amsterdam, Netherlands.

111.         Xiaoying Xu, Ya Li, Liping Hu, Jianhua Tao, "Categorizing Terms’ Subjectivity and Polarity Manually for Opinion Mining in Chinese"  The 3rd International Conference on Affective Computing and Intelligent Interaction, ACII09, Sep. 2009, Amsterdam, Netherlands.

112.         Hongjun Sun, Jianhua Tao, Huibin Jia, "Dimension Reducing of LSF parameters Based on Radial Basis Function Neural Network"   INTERSPEECH 2009, Sep. 2009, Brighton, UK.

113.         Huibin Jia, Jianhua Tao, "Prosody Modeling for Mandarin Exclamatory Speech"   2009 IEEE International Conference on Multimedia and Expo ICME 2009, June 2009, New York, US.

114.         Jianhua Tao, Fang Zheng, Aijun Li, Ya Li, "Advances in Chinese Natural Language Processing and language resources"   O-COCOSDA2009, Aug. 2009, Xinjiang, China.

115.         Jianhua Tao, Tieniu Tan (Eds), "Affective Information Processing"  UK: Springer Book,354 pages, Nov. 2008, ISBN:978-1-84800-305-7.

116.         Meng Zhang, Jianhua Tao, Xia Wang, Jilei Tian, "Text-Independent Voice Conversion based on State Mapped Codebook"  33rd IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP2008, March 2008, Las Vegas, US, pp.4605-4608 .

117.         Fangzhou Liu, Jianhua Tao, Qing Shi, "Tree-Guided Transformation-Based Homograph Disambiguation in Mandarin TTS System"  33rd IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP2008, March 2008, Las Vegas, US, pp.2657-4660 .

118.         Huibin Jia Jianhua Tao, Xia Wang, "Prosody Variation: Application to Automatic Prosody Evaluation of Mandarin Speech"   Speech Prosody 2008, May. 2008, Campinas, Brazil.

119.         Jianhua Tao, Jian Yu, Lixing Huang, Fangzhou Liu, Huibin Jia, Meng Zhang, "The WISTON Text to Speech System for Blizzard 2008"  The Blizzard Challenge 2008 workshop, Oct.2008.

120.         Jianhua Tao, Fangzhou Liu, Meng Zhang, Huibin Jia, "Design of Speech Corpus for Mandarin Text to Speech"  The Blizzard Challenge 2008 workshop, Oct.2008.

121.         Meng Zhang, Jianhua Tao, Huibin Jia, Xia Wang, "Improving HMM Based Speech Synthesis by Reducing Over-Smoothing Problems"  The 6th International Symposium on Chinese Spoken Language Processing, ISCSLP2008, Dec, 2008. Kunming, pp17-20.

122.         Yi Zhang, Jianhua Tao, "Prosody Modification on Mixed-Language Speech Synthesis"   The 6th International Symposium on Chinese Spoken Language Processing, ISCSLP2008, Dec, 2008. Kunming, pp253-256 .

123.         Fangzhou Liu, Huibin Jia, Jianhua Tao, "A Maximum Entropy Based Hierarchical Model for Automatic Prosodic Boundary Labeling In Mandarin"   The 6th International Symposium on Chinese Spoken Language Processing, ISCSLP2008, Dec, 2008. Kunming, pp257-260 .

124.         Zhe Zhang, Lixing Huang, Jianhua Tao, "Unit Feature Based Pruning of Large-Scale Speech Corpus Using Decision Tree"  International Conference on Signal Processing, ICSP2008, Oct.2008, Beijing, pp719-722 .

125.         Xiaoyin Xu, Jianhua Tao, "Categorizing Emotional Vocabularies in Chinese Natural Language Communication" O-COCOSDA2008, Nov.2008, Nara, Japan .

126.         Mingyu You, Guo-Zheng Li, Luonan Chen, Jianhua Tao, "A Novel Classifier Based on Enhanced Lipschitz Embedding for Speech Emotion Recognition" 4th International Conference on Intelligent Computing, ICIC (1) 2008, Shanghai, pp:482-490.

127.         Jianhua Tao, Panrong Yin, "Speech Driven Face Animation Based on Dynamic Concatenation Model" International Journal of Information and Computational ence, v 4, n 1, March, 2007, p 271-280.

128.         Jian Yu, Meng Zhang, Jianhua Tao, Xia Wang, "A novel hmm-based TTS system using both continuous HMMs and discrete HMMs" 32nd IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP2007, Hawaii .

129.         Mingyu You, Chun Chen, Jiajun Bu, Jia Liu, Jianhua Tao, "Manifolds based Emotion Recognition in Speech" International Journal of Computational Linguistics & Chinese Language Processing, Vol. 12, No. 1, March 2007, pp49-64 .

130.         Jian Yu, Lixing Huang, Jianhua Tao, Xia Wang, "Modeling Incompletion Phenomenon in Mandarin Dialog Prosody" Interspeech2007,August 2007, Antrewp.

131.         Huibin Jia Jianhua Tao, "Automatic Prosody Quality Evaluation of Mandarin Speech" O-COCOSDA2007, Dec. 2007, Hanoi, Vietnam.

132.         Xia Wang, Aijun Li, Jianhua Tao, "An Expressive Speech Corpus of Standard Chinese" O-COCOSDA2007, Dec. 2007, Hanoi, Vietnam .

133.         Panrong Yin, Liyue Zhao, Lixing Huang, Jianhua Tao, "Expressive Face Animation Synthesis based on Dynamic Mapping Method" the 2nd International Conference on Affective Computing and Intelligent Interaction, ACII2007, Sep.2007, Lisbon.

134.         Lixing Huang, Le Xin, Liyue Zhao, Mi Zhou, Jianhua Tao, "Combining Audio and Video by Dominance in Bimodal Emotion Recognition"  the 2nd International Conference on Affective Computing and Intelligent Interaction, ACII2007, Sep.2007, Lisbon .

135.         Marc Schr?der, Laurence Devillers, Kostas Karpouzis, Jean-Claude Martin, Catherine Pelachaud, Chris, "What Should a Generic Emotion Markup Language Be Able to Represent?, " the 2nd International Conference on Affective Computing and Intelligent Interaction, ACII2007, Sep.2007, Lisbon.

136.         Liyue Zhao, Jianhua Tao, "Fast Facial Feature Tracking with Multi-Cue Particle Filter" Image and Vision Computing New Zealand, IVCNZ2007, Hamilton, New Zealand .

137.         Le Xin, Jianhua Tao, Tieniu Tan, "Dynamic Audio-Visual Mapping using Fused Hidden Markov Model Inversion Method" IEEE International Conference on Image Processing, ICIP2007, Sep.2007, pp293-296.

138.         Zhiming Wang, Jianhua Tao, "Reconstruction of Partially Occluded Face by Fast Recursive PCA" ,International Conference on Computational Intelligence and Security, Dec. 2007, Harbin .

139.         Zhiming Wang, Jianhua Tao, , "Remove Unknown Face Occlusion by Fuzzy Principal Component Analysis" CCPR2007, Beijing.

140.         Jia Liu, Chun Chen, Jiajun Bu, Mingyu You, Jianhua Tao, "Speech Emotion Recognition Based on a Fusion of All-Class and Pairwise-Class Feature Selection"  International Conference on Computational ence (1) 2007: 168-175.

141.         Jianhua Tao, Yongguo Kang, Aijun Li, "Prosody conversion from neutral speech to emotional speech" IEEE Transactions on Audio, Speech, and Language Processing (IEEE Trans. ASLP), Vol. 14, No. 4, July 2006, pp1145-1154.

142.         Jian Yu, Wanzhi Zhang and Jianhua Tao, "A new pitch generation model based on internal dependence of pitch contour for mandarin TTS system" ICASSP 2006, Toulouse, France .

143.         Yongguo Kang, Jianhua Tao, Bo Xu, "Applying Pitch Target Model to Convert F0 Contour for Expressive Mandarin Speech Synthesis " ICASSP 2006, Toulouse,France .

144.         Mingyu You, ChunChen, JiajunBu, JiaLiu,JianhuaTao , "Emotion recognition from noisy speech"  IEEE International Conference on Multimedia and Expo, ICME2006, Canada.

145.         Mingyu You, ChunChen, JiajunBu, JiaLiu, JianhuaTao, "Emotional Speech Analysis on onlinear Manifold" International Conference on Pattern Recognition, ICPR2006, Hongkong.

146.         Donghui Dong, Jianhua Tao, Bo Xu, "Prosodic Word Prediction using a Maximum Entropy Approach" International Symposium on Chinese Spoken Language Processing, Lecture Notes of Computer ence, ISCSLP2006, Singapore.

147.         Jian Yu, Jianhua Tao, "Pitch Prediction for Mandarin TTS with Mutual Prosodic Constraint" International Symposium on Chinese Spoken Language Processing, ISCSLP2006, Singapore.

148.         Jianhua Tao, Lixing Huang, Yongguo Kang, Jian Yu, "The Friendliness Perception of Dialogue Speech"  2006 International Conference on Speech Prosody, May 2006, Germany.

149.         Min Chu, Honghui Dong, Jianhua Tao, "A Perceptual Study on Variability in Break Allocation within Chinese Sentences" 2006 International Conference on Speech Prosody, May 2006, Germany.

150.         Nick Campbell, Laurence Devillers, Ellen Douglas-Cowie, Veronique Auberge, Anton Batliner, and Jian, "Resources for the Processing of Affect in Interactions" LREC2006, May 2006, Italy.

151.         Yanhong Wu,Jianhua Tao, Jilun Lu, "The Design of Corpus for Interrogative Speech Synthesis" The 2006 Oriental COCOSDA, December 2006, MALAYSIA .

152.         Wang Zhiming, Tao Jianhua, "A fast implementation of adaptive histogram equalization" the 8th International Conference on Signal Processing, December 2006, Guilin, pp1330-1334 .

153.         Ren-Hua Wang, Sin-Horng Chen, Jianhua Tao, Min Chu, "MANDARIN TEXT-TO-SPEECH SYNTHESIS" Chapter 5 of the book Advanced Chinese Spoken Language Processing, published in December 2006 .

154.         Hsiao-Chuan Wang, Thomas Fang Zheng, and Jianhua Tao, "CSLP CORPORA AND LANGUAGE RESOURCES" Chapter 23 of the book Advanced Chinese Spoken Language Processing, published in December 2006.

155.         Jian Yu, Jianhua Tao, "The Pause Duration Prediction for Mandarin Text-to-Speech System"  2005 IEEE International Conference on Natural Language Processing and Knowledge ngineering (IEEE NLP-KE 2005),Wuhan,China,pp.204-208,2005, LSBN:0-7803-9361-9 .

156.         Panrong Yin, Jianhua Tao, "Dynamic mapping method based speech driven face animation system" The First International Conference on Affective Computing & Intelligent Interaction (ACII2005),Beijing,China, pp.755-763,2005,ISSN 0302-9743  .

157.         Yonglin Li, Jianhua Tao, "Personalized Facial Animation Based on 3D Model Fitting from Two Orthogonal Face Images" the first International Conference on Affective Computing and Intelligent Interaction ,Beijing,China,pp.996-1003,2005, ISSN 0302-9743 .

158.         Yongguo Kang, Zhiwei Shuang ,Jianhua Tao, "A hybrid GMM and codebook mapping method for spectral conversion" The First International Conference on Affective Computing & Intelligent Interaction (ACII2005),Beijing,China,pp.303-310,2005, ISSN 0302-9743 .

159.         Le Xin,Qiang Wang, Jianhua Tao, "Automatic 3D Face Modeling from Video" Proc. of the Tenth IEEE International conference on computer vision (ICCV2005)Beijing China ,Vol.2.

160.         Honghui Dong, Jianhua Tao, Bo Xu, "Chinese Prosodic Phrasing with a Constraint-based Approach" INTERSPEECH 2005-EUROSPEECHLisbon, Portugal .

161.         Honghui Dong, Jianhua Tao,Bo Xu, "Prosodic Word Prediction Using the Lexical Information" 2005 IEEE International Conference on Natural Language Processing and Knowledge EngineeringWuhan, China,pp.189-193,2005,LSBN:0-7803-9361-9 .

162.         Honghui Dong, Jianhua Tao, "Length Optimized Chinese Prosodic Phrasing Model" Proceedings of the International Conference on Chinese Computing 2005, pp.48-53,2005, Singapore .

163.         Jianhua Tao,Yongguo Kang, "Features Importance Analysis for Emotional Speech Classification" The First International Conference on Affective Computing & Intelligent Interaction (ACII2005), Beijing,China,pp.449-457,2005 ,ISSN 0302-9743 .

164.         Jianhua Tao,Tieniu Tan, "Affective Computing:A review" The First International Conference on Affective Computing & Intelligent Interaction (ACII2005),Beijing,China,pp.981-995,2005, ISSN 0302-9743 .

165.         Jianhua Tao,jianyu Yongguo Kang, "An Expressive Mandarin Speech Corpus" The International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques, O-COCOSDA2005, Bali Island, Indonesia,2005 .

166.         Jianhua Tao and Tieniu Tan, "Emotional Chinese Talking Head System"  ICMI 200410pages:273-280  .

167.         Jianhua Tao, "Context Based Emotion Detection from Text Input" 8th International Conference on Spoken Language Processing, ICSLP2004, Jeju, 4-8,Oct.2004,pages:1337-1340 .

168.         Jianhua Tao, "Rhymth Correlation of Speech Synthesis System" ISCSLP2004, pages:221-224 .

169.         Jianhua Tao, "Acoustic and Linguistic Information Based Chinese Prosodic Boundary Labelling" , "" , Lecture Notes of Artificial Intelligence, Springer, 2004,9 .

170.         Jianhua Tao and Yongguo Kang,, "Multi-Source Based Acoustic Model for Speech Synthesis"  ICSP2004 Vol.I of III,pagese:621-625  .

171.         Honghui Dong, Jianhua Tao , Bo Xu, "Grapheme-to-Phoneme Conversion in Chinese TTS System" ISCSLP2004,pages:165-168.

172.         Yongguo Kang, Jianhua Tao, Bo Xu, "A New Multicomponent AM-FM Demodulation with Predicting Frequency Boundaries and Its Application to Formant Estimation" Interspeech2004, Oct. 2004, Jeju, pp 1105-1108.

 

发表著作
Jianhua Tao, Tieniu Tan (Eds), Affective Information Processing, UK: Springer Book,354 pages, Nov. 2008, ISBN:978-1-84800-305-7

科研活动

先后负责20余项国家科研项目,包括:

  • 国家重点研发计划:基于云计算的移动办公智能交互技术与系统

  • 国家重点研发计划:面向移动终端的多模态自然交互技术

  • 中科院B类先导项目:类脑计算芯片与智能系统

  • 中科院C类先导项目:大数据分析

  • 国家杰出青年科学基金项目:多通道融合的言语分析与生成理论和方法研究

  • 国家发改委专项:音视频内容分析

  • 863重点项目:多模态自然交互技术

  • 国家自然科学基金重点项目:连续情感状态的语音情感识别技术