张学良,工学博士,内蒙古大学计算机学院教授,博士生导师
English version
联系方式:
E-mail: CSZXL [AT] imu [dot] edu [dot] cn; zhangxueliang [AT] imudges [dot] com.
地址:内蒙古呼和浩特市大学西路235号,计算机学院
邮编:010021
个人经历:
l 1999年-2003年: 内蒙古大学,计算机学院,信息管理与信息系统,获学士学位;
l 2003年-2005年: 哈尔滨工业大学,计算机科学与技术系,人工智能与图像处理,获硕士学位;
l 2006年-2010年: 中国科学院自动化研究所,模式识别国家重点实验室语音信号处理,获博士学位;
l 2010年-2013年;内蒙古大学,计算机学院讲师,硕士生导师;
l 2013年-2017年;内蒙古大学,计算机学院副教授,博士生导师;
l 2018年-至 今;内蒙古大学,计算机学院教授,博士生导师;
l 2015年-2016年: 美国俄亥俄州立大学访问学者;
研究领域:
语音信号处理,计算听觉场景分析,语音合成,语音分离
论文:
Journal:
1. Chenggang Zhang, Jiujiang Liu, Hao Li and Xueliang Zhang, "Neural Multi-Channel and Multi-Microphone Acousticc Echo Cancellation," in IEEE/ACM Transactions on Audio, Speech and Language Processing, 2023. (Top journal, CCF rank B, SCI IF 5.4).
2. Kanghao Zhang, Shulin He, Hao Li and Xueliang Zhang, "A Dual-branch Convolutional Network Architecture Processing on both Frequency and Time Domain for Single-channel Speech Enhancement", APSIPA Transactions on Signal and Information Processing, 2023.
3. Heming Wang, Xueliang Zhang, DeLiang Wang, "Fusing Bone-Conduction and Air-Conduction Sensors for Complex-Domain Speech Enhancement," IEEE/ACM transactions on audio, speech, and language processing, 2022, 30: 3134-3143. (Top journal, CCF rank B, SCI IF 5.4).
4. Hao Li, DeLiang Wang, Xueliang Zhang and Guanglai Gao, "Recurrent Neural Networks and Acoustic Features for Frame-level Signal-to-noise Ratio Estimation", IEEE/ACM Transactions on Audio, Speech and Language Processing, 2021. (Top journal, CCF rank B, SCI IF 5.4).
5. Ke Tan, Xueliang Zhang and DeLiang Wang, "Deep Learning Based Real-Time Speech Enhancement for Dual-Microphone Mobile Phones", IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021. (Top journal, CCF rank B, SCI IF 5.4).
6. Zhihao Du, Xueliang Zhang and Jiqing Han, “A Joint Framework of Denoising Autoencoder and Generative Vocoder for Monaural Speech Enhancement”, in IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020. (Top journal, CCF rank B, SCI IF 5.4).
7. Zhong-Qiu Wang, Xueliang Zhang and DeLiang Wang, "Robust Speaker Localization Guided by Deep Learning-Based Time-Frequency Masking," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27(1), pp. 178-188, 2019. (Top journal, CCF rank B, SCI IF 5.4).
8. Shuai Nie, Shan Liang, Wenju Liu, Xueliang Zhang and Jianhua Tao, "Deep Learning Based Speech Separation via NMF-Style Reconstructions," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, 26(11), pp. 2043-2055, 2018. (Top journal, CCF rank B, SCI IF 5.4).
9. Xueliang Zhang and DeLiang Wang, “Deep Learning Based Binaural Speech Separation in Reverberant Environments,” IEEE/ACM Trans Audio, Speech and Language Processing 25(5): 1075-1084, 2017. (Top journal, CCF rank B, SCI IF 5.4).
10. 张晖,苏红,张学良,高光来,《基于卷积神经网络的鲁棒性基音检测方法》,自动化学报,42(6): 959-964,2016.
11. 聂帅,刘文举,梁山,张学良,《基于深度学习语音分离技术的研究现状与进展》,自动化学报,42(6): 814-833,2016.
12. Xueliang Zhang, Hui Zhang, Shuai Nie, Guanglai Gao and Wenju Liu. "A Pairwise Algorithm Using Deep Stacking Network for Speech Separation and Pitch Estimation," IEEE/ACM Trans. Audio, Speech and Language Processing 24(6): 1066-1078, 2016 (Top journal, CCF rank B, SCI IF 5.4).
13. Wenju Liu, Xueliang Zhang, Wei Jiang, et al. “Monaural Voiced Speech Segregation Based On Elaborate Harmonic Grouping Strategies”. Science China Information Sciences, 2011, 54(12): 2471-2480.
14. Xueliang Zhang, Wenju Liu and Bo Xu, “Monaural Voiced Speech Segregation Based On Dynamic Harmonic Function”, In EURASIP Journal On Audio, Speech, And Music Processing, Volume 2010, Article ID 252374, 13 Pages, 2010.
15. 张学良,刘文举,李鹏,徐波,《改进谐波组织规则的单通道浊语音分离系统》,声学学报,36(1): 89-96,2011.
Conference:
1. Yulong Wang and Xueliang Zhang, "MFT-CRN: Multi-scale Fourier Transform for Monaural Speech Enhancement." Interspeech 2023, Ireland.(Top conference, CCF rank C)
2. Jinjiang Liu and Xueliang Zhang, "ICCRN: Inplace Cepstral Convolutional Recurrent Neural Network for Monaural Speech Enhancement," ICASSP 2023, Greece.(Top conference, CCF rank B)
3. Jinjiang Liu and Xueliang Zhang, "Inplace Cepstral Speech Enhancement System for the ICASSP 2023 Clarity Challenge," ICASSP 2023, Greece.(Top conference, CCF rank B)
4. Jiahui Pan, Shuai Nie, Hui Zhang, Shulin He, Kanghao Zhang, Shan Liang, Xueliang Zhang and Jianhua Tao, "Speaker Recognition-Assisted Robust Audio Deepfake Detection", Interspeech 2022, Korea. (Top conference, CCF rank C)
5. Peng Zhang, Peng Hu and Xueliang Zhang, "Norm-constrained Score-level Ensemble for Spoofing Aware Speaker Verification", Interspeech 2022, Korea. (Top conference, CCF rank C)
6. Chenggang Zhang, Jinjiang Liu and Xueliang Zhang, "A Lightweight Complex Spectral Mapping Framework for Stereophonic Acoustic Echo Cancellation", Interspeech 2022, Korea. (Top conference, CCF rank C)
7. Zeyuan Wei, Hao Li and Xueliang Zhang, "Model Compression by Iterative Pruning with Knowledge Distillation and Its Application to Speech Enhancement", Interspeech 2022, Korea. (Top conference, CCF rank C)
8. Jinjiang Liu and Xueliang Zhang, "DRC-NET: Densely Connected Recurrent Convolutional Neural Network for Speech Dereverberation", ICASSP 2022, Singapore. (Top conference, CCF rank B)
9. Chenggang Zhang, Jinjiang Liu and Xueliang Zhang, "A Complex Spectral Mapping with Inplace Convolution Recurrent Neural Networks for Acoustic Echo Cancellation", ICASSP 2022, Singapore. (Top conference, CCF rank B)
10. Heming Wang, Xueliang Zhang and Deliang Wang, "Attention-based Fusion for Bone-conducted and Air-conducted Speech Enhancement in the Complex Domain", ICASSP 2022, Singapore. (Top conference, CCF rank B)
11. Yang Yang, Hui Zhang, Xueliang Zhang and Huaiwen Zhang, "Alleviating the Loss-Metric Mismatch in Supervised Single-channel Speech Enhancement", ICASSP 2022, Singapore. (Top conference, CCF rank B)
12. Kanghao Zhang, Shan Liang, Shuai Nie, Shulin He, Jiahui Pan, Xueliang Zhang, Xinhao Ma and Jiangyan Yi, "A Robust Deep Audio Splicing Detection Method Via Singularity Detection Feature", ICASSP 2022, Singapore. (Top conference, CCF rank B)
13. Peng Zhang, Peng Hu and Xueliang Zhang, "Investigation of IMU&Elevoc Submission for the Short-duration Speaker Verification Challenge 2021", Interspeech 2021, Brno, Czechia. (Top conference, CCF rank C).
14. Kanghao Zhang, Shulin He, Hao Li and Xueliang Zhang, "DBNet: A Dual-branch Network Architecture Processing on Spectrum and Waveform for Single-channel Speech Enhancement", Interspeech 2021, Brno, Czechia. (Top conference, CCF rank C).
15. Jinjiang Liu and Xueliang Zhang, "Inplace Gated Convolutional Recurrent Neural Network for Dual-channel Speech Enhancement", Interspeech 2021, Brno, Czechia. (Top conference, CCF rank C).
16. Ke Tan, Xueliang Zhang and Deliang Wang, "Real-Time Speech Enhancement for Mobile Communication Based on Dual-Channel Complex Spectral Mapping", ICASSP 2021, Toronto, Canada. (Top conference, CCF rank B).
17. Yue Gu; Zhihao Du, Hui Zhang and Xueliang Zhang, "An Efficient Joint Training Framework for Robust Small-Footprint Keyword Spotting," ICONIP 2020, Bangkok, Thailand.
18. Hao Li, DeLiang Wang, Xueliang Zhang and Guanglai Gao, "Frame-level Signal-to-Noise Ratio Estimation using Deep Learning," INTERSPEECH 2020, Shanghai, China. (Top conference, CCF rank C)
19. Peng Zhang and Xueliang Zhang "Deep Template Matching for Small-footprint and Configurable Keyword Spotting," INTERSPEECH 2020, Shanghai, China. (Top conference, CCF rank C)
20. Zhihao Du, Jiqing Han and Xueliang Zhang, "Double Adversarial Network based Monaural Speech Enhancement for Robust Speech Recognition," INTERSPEECH 2020, Shanghai, China. (Top conference, CCF rank C)
21. Peng Zhang, Peng Hu and XueLiang Zhang, "Deep Embedding Learning for Text-Dependent Speaker Verification," INTERSPEECH 2020, Shanghai, China. (Top conference, CCF rank C)
22. Chenggang Zhang and Xueliang Zhang, "A Robust and Cascaded Acoustic Echo Cancellation Based on Deep Learning," INTERSPEECH 2020, Shanghai, China. (Top conference, CCF rank C)
23. Tianjiao Xu, Hui Zhang and Xueliang Zhang, "Polishing the Classical Likelihood Ratio Test by Supervised Learning for Voice Activity Detection," INTERSPEECH 2020, Shanghai, China. (Top conference, CCF rank C)
24. Hao Li, Xueliang Zhang and Guanglai Gao, "Beamformed Feature for Learning-based Dual-channel Speech Separation," ICASSP 2020, Barcelona, Spain. (Top conference, CCF rank B).
25. Shulin He, Hao Li and Xueliang Zhang, "SpeakerFilter: Deep Learning-Based Target Speaker Extraction Using Anchor Speech," ICASSP 2020, Barcelona, Spain. (Top conference, CCF rank B).
26. Zhihao Du, Xueliang Zhang and Jiqing Han, "Investigation of Monaural Front-End Processing for Robust Speech Recognition without Retraining or Joint-Training," APSIPA ASC 2019, Lanzhou, China.
27. Tianjiao Xu, Hui Zhang and Xueliang Zhang, "Joint Training ResCNN-based Voice Activity Detection with Speech Enhancement," APSIPA ASC 2019, Lanzhou, China.
28. Jingdong Li, Hui Zhang, Xueliang Zhang and Changliang Li, "Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks," APSIPA ASC 2019, Lanzhou, China.
29. Hao Li, Xueliang Zhang and Guanglai Gao, “Dynamic-attention based Encoder-decoder model for Speaker Extraction with Anchor speech,” APSIPA ASC 2019, Lanzhou, China.
30. Tianjiao Xu, Hao Li, Hui Zhang and Xueliang Zhang, “Improve Data Utilization with Two-stage Learning in CNN-LSTM-based Voice Activity Detection,” APSIPA ASC 2019, Lanzhou, China.
31. Yun Liu, Hui Zhang, Xueliang Zhang and Yuhang Cao, “Investigation of Cost Function for Supervised Monaural Speech Separation,” INTERSPEECH 2019, Graz, Austria. (Top conference. CCF rank C)
32. Ke Tan, Xueliang Zhang and DeLiang Wang, "Real-time Speech Enhancement using an Efficient Convolutional Recurrent Network for Dual-Microphone Mobile Phones in Close-talk Scenarios," ICASSP 2019, Brighton, UK. (Top conference, CCF rank B)
33. Fei Zhao, Hao Li and Xueliang Zhang, "A Robust Text-Independent Speaker Verification Method based on Speech Separation and Deep Speaker," ICASSP 2019, Brighton, UK. (Top conference, CCF rank B)
34. Yun Liu, Hui Zhang, Xueliang Zhang and Linju Yang, "Supervised Speech Enhancement with Real Spectrum Approximation," ICASSP 2019, Brighton, UK. (Top conference, CCF rank B)
35. Jingdong Li, Hui Zhang, Rui Liu, Xueliang Zhang and Long Fei, "End-to-End Mongolian Text-to-Speech System," ISCSLP 2018, Taipei, Taiwan.
36. Yun Liu, Hui Zhang, and Xueliang Zhang, "Using Shifted Real Spectrum Mask as Training Target for Supervised Speech Separation," INTERSPEECH 2018, Hyderabad, India. (Top conference, CCF rank C).
37. Zhong-Qiu Wang, Xueliang Zhang and DeLiang Wang, "Robust TDOA Estimation Based on Time-Frequency Masking and Deep Neural Networks", INTERSPEECH 2018, Hyderabad, India. (Top conference, CCF rank C).
38. Hui Zhang, Xueliang Zhang and Guanglai Gao, "Training Supervised Speech Separation System to Improve STOI and PESQ Directly," ICASSP 2018,Calgary, Alberta, Canada. (Top conference, CCF rank B).
39. Qinglong Li, Xueliang Zhang and Hao Li, "Online Direction of Arrival Estimation Based on Deep Learning," ICASSP 2018, Calgary, Alberta, Canada. (Top conference, CCF rank B).
40. Shasha Xia, Hao Li and Xueliang Zhang, "Using Optimal Ratio Mask as Training Target for Supervised Speech Separation", APSIPA ASC 2017, Kuala Lumpur, Malasiya.
41. Xueliang Zhang and DeLiang Wang, “Binaural Reverberant Speech Separation Based on Deep Neural Networks,” INTERSPEECH 2017, Stockholm, Sweden. (Top conference, CCF rank C).
42. Hui Zhang, Xueliang Zhang and Guanglai Gao, "Multi-target Ensemble Learning for Monaural Speech Separation," INTERSPEECH 2017, Stockholm, Sweden. (Top conference, CCF rank C).
43. Xueliang Zhang, Zhong-Qiu Wang and DeLiang Wang, "A speech enhancement algorithm by iterating single- and multi-microphone processing and its application to robust ASR," ICASSP 2017, New Orleans, USA. (Top conference, CCF rank B).
44. Hui Zhang, Xueliang Zhang and Guanglai Gao, "Multi-channel Speech Enhancement Based on Deep Stacking Network," Speech Processing in Everyday Environments (CHiME 2016).
45. Hao Li, Shuai Nie, Xueliang Zhang and Hui Zhang, "Jointly Optimizing Activation Coefficients of Convolutive NMF Using DNN for Speech Separation," INTERSPEECH 2016, San Francisco, USA. (Top conference, CCF rank C).
46. Hong Su, Hui Zhang, Xueliang Zhang and Guanglai Gao. "Convolutional Neural Network for Robust Pitch Determination," ICASSP 2016, Shanghai, China. (Top conference, CCF rank B).
47. Shuai Nie, Shan Liang, Hao Li, Xueliang Zhang, et al. "Exploiting Spectro-Temporal Structures Using NMF For DNN-Based Supervised Speech Separation," ICASSP 2016, Shanghai, China. (Top conference, CCF rank B).
48. Shuai Nie, Shan Liang, Wei Xue, Xueliang Zhang, et al. "Two-Stage Multi-Target Joint Learning for Monaural Speech Separation," INTERSPEECH 2015, Dresden, Germany. (Top conference, CCF rank C).
49. Shuai Nie, Wei Xue, Shan Liang, Xueliang Zhang, et al. "Joint Optimization of Recurrent Networks Exploiting Source Auto-Regression for Source Separation", INTERSPEECH 2015, Dresden, Germany. (Top conference, CCF rank C).
50. Hui Zhang, Xueliang Zhang and Guanglai Gao, "Document Summarization Based on Semantic Representations," IALP 2015, Guangzhou, China.
51. Hui Zhang, Xueliang Zhang, Shuai Nie, et al. "A Pariwise Algorithm for Pitch Estimation and Speech Separation Using Deep Stacking Network", ICASSP 2015, Brisbane, Australia. (Top conference, CCF rank B).
52. Shuai Nie, Hui Zhang, Xueliang Zhang and Wenju Liu, "Deep Stacking Networks with Time Series for Speech Separation", ICASSP 2014, Florence, Italy. (Top conference, CCF rank B).
53. Xueliang Zhang, Hui Zhang and Guanglai Gao, "Missing Feature Reconstruction Methods For Robust Speaker Identification," EUSIPCO 2014, Lisbon, Portugal.
54. Xueliang Zhang, Wenju Liu and Bo Xu, "Multi-Pitch Determination Algorithm Based On Mixture Laplacian Distribution," ICALIP 2010, Shanghai, China.
55. Xueliang Zhang and Wenju Liu, "Monaural Voiced Speech Segregation Based on Pitch and Comb Filter," INTERSPEECH 2011, Florence, Italy. (Top conference, CCF rank C).
56. Xueliang Zhang, Wenju Liu, Peng Li and Bo Xu, "Monaural Voiced Speech Segregation Based On Elaborate Harmonic Grouping Strategy," ICASSP 2009, Taipei, Taiwan. (Top conference, CCF rank B).
57. Xueliang Zhang, Wenju Liu, Peng Li and Bo Xu, "Multipitch Detection Based On Weighted Summary Correlogram," ISCSLP 2008, Kunming, China.
主持或参与项目:
l 主持,知识引导的深度学习语音降噪研究,国家自然科学基金,2019.1-2022.12
l 主持,计算听觉场景分析中基于统计模型的听觉片段切分研究,国家自然科学基金,2014.1-2017.12
l 主持,蒙古语音合成系统,内蒙古大学高层次人才引进科研启动项目,2010.12-2013.12
l 主持,基于训练模型的蒙古语语音合成研究,自治区自然科学基金,2011.10-2014.10
(欢迎有志从事语音信号处理的同学报考硕士或者博士研究生)
在读学生:
l 刘晋江(博士生)(2019-),多通道语音分离
l 潘佳慧(博士生)(2021-),语音增强
l 何树林(博士生)(2021-),语音增强
l 沈鹏杰(博士生)(2022-),语音增强
l 刘鑫(博士生)(2022-),语音识别
l 赵飞(2021-)
l 范海鹏(2021-)
l 武天赐(2021-)
l 王玉龙(2021-)
l 赵丽艳(2021-)
l 白景霖(2022-)
l 方圆(2022-)
l 李璟宸(2022-)
l 赵昌江(2022-)
l 刘原武(2022-)
L 郭振龙(2023-)
L 齐宇轩(2023-)
L 包洪涛(2023-)
L 张文政(2023-)
L 薛紫旋(2023-)
毕业学生:
l 丁祺(2011-2014), 基于隐马尔可夫的石油管道检测 (IBM)
l 李婷慧(2012-2014),蒙古文韵律预测算法研究
l 康征(2013-2015),基于深度神经网络语音识别
l 苏红(2013-2016),噪声环境下基音提取 (滴滴出行)
l 彭晓腾(2013-2016),听觉感知评估方法
l 石博天(2014-2016),深度神经网络,语音分离 (商汤科技)
l 李欢(2014-2016),说话人识别
l 郭亚娜(2014-2016),深度神经网络在语音识别中的应用
l 夏莎莎(2014-2017),语音分离
l 杨冰晴(2015-2017),语音分离(嘉和美康)
l 张子慧(2015-2018),说话人识别
l 李庆龙(2015-2018),声源定位 (小米科技)
l 王思蒙(2015-2018),语音端点检测(工商银行山东省分行)
l 李光鹏(2016-2018),语音分离算法DSP实现 (招商银行济南分行)
l 毛振苏(2016-2018),单通道语音分离
l 刘允(2016-2019),单通道语音分离(搜狗)
l 赵飞(2016-2019),鲁棒性说话人识别(思必驰)
l 谷悦(2017-2019),语音关键词检测(好未来)
l 刘乐(2017-2019),单通道语音分离
l 李劲东(2017-2020),单通道语音分离(搜狗)
l 许天骄(2017-2020),鲁棒性VAD检测(中国人寿研发中心)
l 张鹏(2018-2021),声纹识别(大象声科)
l 王心恬(2018-2021),语音恢复(好未来)
l 王志杰(2018-2021),频带拓展
l 李号(博士)(2017-2021),麦克风阵列技术(南方科技大学,博士后)
l 马英(2019-2022),单通道语音增强(内蒙古移动)
l 李苗(2019-2022),单通道语音增强(小米)
l 白馨(2019-2022),语音质量评估(字节跳动)
l 何树林(2019-2021),语音分离、目标语音抽取(内蒙古大学,博士在读)
l 杨洋(2019-2021),语音增强(内蒙古大学,博士在读)
l 张成刚(博士)(2018-2022),内蒙古民族大学
l 张康豪(2020-2023),理想汽车
l 张泰龙(2020-2023),自主创业
l 魏泽渊(2020-2023),
l 张雨竹(2020-2023),内蒙古银行
学术任职:
INTERSPEECH 2020 session chair, APSIPA 2019 session chair;
INTERSPEECH, ICASSP, ACM/IEEE Transaction on Audio, Speech and Language Processing,Journal of Selected Topics in Signal Processing, Speech Communication, IEEE Signal Processing Letters, EURASIP Journal on Audio, Speech, and Music Processing, Computational Intelligence and Neuroscience审稿人;
中国计算机学会(CCF)语音对话与听觉专业组委员。