He was a Visiting
Scientist/Professor at
the Computer Science and Artificial Intelligence Laboratory (CSAIL),
Massachusetts Institute of Technology (MIT),
Cambridge, USA, an Associate Professor in the Department of Electronic
Engineering at Shanghai Jiao
Tong University (SJTU), China,
and a postdoctoral
fellow at AI Spoken Language Lab in the
Department of Computer Science at KAIST,
Korea. He is now a
Professor in the Department of Electronic Systems and a co-head of the
Centre for Acoustic Signal Processing Research at Aalborg
University, Denmark. He received the
Ph.D. degree in Electronic Engineering from Shanghai Jiao
Tong University in 1999, the B.S. and M.S. degrees in Electrical
Engineering from Hunan University in 1990 and 1996,
respectively.
His research interests include machine
learning, deep learning, pattern recognition, speech recognition, speaker recognition, noise-robust
speech processing (speech enhancement and separation, robust features,
voice activity detection), multimodal (audio-visual) signal processing,
social robotics (built a multimodal interactive social robot called iSocioBot), and recommender systems, which are the topics he has spent the last two decades on. He has about 200 publications in IEEE/ACM-TASLP, IEEE-TNNLS, IEEE-TKDE, IEEE-TMM, IEEE-TAC, IEEE-TCE, IEEE-TSG, IEEE-J-STSP, IEEE-SPL IEEE INTELL SYST, Neurocomputing, CSL, SpeechComm, ICASSP, INTERSPEECH, and other venues. He edited the book Automatic Speech Recognition on Mobile Devices and over Communication Networks (Springer-Verlag, 2008).
He is elected as Vice Chair of the IEEE Signal Processing Society Machine Learning for Signal
Processing Technical Committee (MLSP TC) for 2020 and Chair for the term of 2021-2022. He was the General Chair for 2018 IEEE 28th International Workshop on Machine Learning for
Signal Processing (MLSP2018), Technical Co-chair for the IEEE Spoken Language Technology Workshop (SLT 2016), Track Chair in MLSP for ICASSP, Area Chair in Speech and Language Processing and Area Chair in Multimedia Signal Processing for the annual European
Signal Processing Conferences (EUSIPCO).
He
has served as an Editorial Board Member/Associate Editor for IEEE/ACM Transactions on
Audio, Speech and Language Processing, Computer
Speech and Language, Digital Signal Processing, and Computers and
Electrical Engineering. He was a Lead Guest Editor of the IEEE Journal
of Selected Topics in Signal Processing and a Guest Editor of
Neurocomputing. He is a Senior Member of the
IEEE, and
a Member of ISCA.
He
has received major grants from European Commission Horizon 2020, Danish
Council for Independent Research, Innovation Fund Denmark, Danish
Strategic Research Council and various industrial grants in the areas
of machine learning/deep learning, speech and multimodal signal
processing with applications to intelligent and interactive machines.
.Vice Chair of the IEEE Signal Processing Society Machine Learning for Signal Processing Technical Committee (MLSP TC).
.Associate Editor for IEEE/ACM Transactions on
Audio, Speech and Language Processing.
.Member of the IEEE Signal Processing Society Machine Learning for Signal Processing Technical Committee (MLSP TC).
.Invited talk at Tencent AI Lab, “Deep Representation Learning for Speech and Multimodal Signal Processing”, Bellevue, USA, December 2018.
.General
Chair of 2018 IEEE 28th International Workshop on Machine Learning for
Signal Processing (MLSP2018), September 17-20, 2018, Aalborg, Denmark. Welcome to Aalborg!
. Area
Chair in Speech Processing and Human Language Technology, The 26th
European Signal Processing Conference (EUSIPCO 2018), September 3-7,
2018, Rome, Italy.
.Invited talk at MIT, “Deep Learning for Speech and Multimodal Signal Processing”, Cambridge, USA, 2017.
.Co-chair of
Oral Session “Far-field Speech Recognition” at Interspeech 2017,
Stockholm, Sweden, 20-24 August 2017.
.Chair
of the International Workshop on Sensing, Processing and Learning for
Intelligent Machines (SPLINE2016), July 6-8, 2016,
Aalborg, Denmark. SPLINE2016
IEEE Xplore Proceedings.
.Invited
talks at Tsinghua University, Beijing University of Posts and
Telecommunications, Beijing Institute of Technology, and Technical
University of Denmark, 2016.
.Area
chair in Speech and Language Processing, The 23rd European Signal
Processing Conference (EUSIPCO 2015), Nice, France.
.Chair
of Oral Sessions “Face Recognition I” and "Face Recognition II" at IEEE
ICIP 2015, 27-30 September 2015, Quebec City, Canada.
.Editorial Board Member/Associate
Editor, Computer Speech and Language (CSL) (since 2009),Digital Signal Processing (DSP), Computers
and
Electrical Engineering (CAEE), International
Journal of
Data Mining, Modelling and Management (IJDMMM).
. Guest
Editor, Machine Learning for Big Data Processing in Mobile Internet,
Special Issue of Springer Wireless Personal Communications.
.Lead guest
editor, Speech Processing for Natural Interaction with Intelligent
Environments, Special Issue of IEEE Journal of Selected Topics in
Signal Processing (J-STSP, Impact Factor: 3.629). Call-for-Papers.
. Guest
editor, New Trends in Signal Processing and Biomedical Engineering,
Special Issue of Elsevier Computers and Electrical Engineering(CAEE).
.Chair
of the 3rd AAU Workshop on Robotics (AAUROB2014),
Aalborg, Denmark.
.Best Paper Award (with Jesper
Jensen) at the 4th IEEE International Conference on Network
Infrastructure and
Digital Content (IEEE IC-NIDC2014),
Beijing, China.
.Area Chair
in Multimedia Signal Processing, The 21st European
Signal Processing Conference (EUSIPCO 2013), Marrakech, Morocco.
.Invited talks at Hunan University and Hunan Normal
University, “Multimodal Sensing and Machine Intelligence”, Changsha, China, 2013.
.Invited talk at BBN
Technologies,
"Speech Denoising and Voice Activity Detection", Cambridge, USA, 2012.
.Invited talk at MIT, "Variable Frame Rate Analysis and
Denoising for Speech Recognition", Cambridge, USA, 2012.
.Invited talk at BUPTNational 111 Base, “Multimodal Sensing for
Identification and Interaction in the IoT”, Beijing, China, 2012. News page.
.Invited talk at University of Eastern Finland, Speech and
Audio Processing Seminar, Joensuu, Finland, 2011.
.Program
Co-Chair of The 3rd International Congress on Image and Signal
Processing (CISP 2010), Yantai, China, 16-18 October 2010.
.Organising
Committee Member and Area Chair in Multimedia Signal Processing, The 18th European Signal Processing Conference (EUSIPCO 2010), Aalborg, Denmark, Aug. 23 - 28, 2010.
.Co-organiser,
Special Session "Person Tracking for Assistive Working and Living
Environment" at EUSIPCO
2010, Denmark.
.Tutorial,
"Internet of Things:
Opportunities and Challenges", at the 13th
International Symposium on Wireless Personal Multimedia Communications
(WPMC 2010), 11-14 October, 2010, Recife, Brazil.
.Chair, Special Session on "Speech recognition in
ubiquitous networking and context-aware computing(Webpage, PDF) at Interspeech 2005, Lisbon, Portugal, Sept. 2005.
.Automated
audiovisual inference of the intention of multiple users in the home.
The Innovation Fund Denmark and Bang & Olufsen A/S. 2016-2019. Featured under AAU Digital Hub Denmark - Artificial Intelligence.
.Speech
Enhancement for Hearing Aid Applications using Machine Learning
Techniques. Project funded by Oticon Foundation. 2015-2018. News.
.Durable
Interaction with Socially Intelligent Robots (iSocioBot, or SocioBot). Project
funded by The
Danish Council for Independent Research, Technology and Production
Sciences. 2013-2017. News
in Ingeniøren, NordJyske,
Nibeavis.Article 1 and article
2
in BiTE (part of BT). Our robots being @ the People’s Meeting
(Folkemødet) in Bornholm, June 2015, the official opening of the Day of
Research 2014 in Denmark, ‘Safe 7′ in Nibe 2014, and the Culture Night
2014 in Copenhagen at the Ministry of Higher Education and Science.
.CoSound – A Cognitive Systems Approach to Enriched and
Actionable Information from Audio Streams. Project
funded by Danish Strategic Research Council. 2012-2016.
.A Robust
Audio-based Hybrid Recommendations Framework for Interactive TV. Project
funded by Bang & Olufsen A/S and The
Danish Council for Technology and Innovation. 2012-2015.
.Machine Learning Spring 2020, Spring 2019, Spring 2018, Spring 2017, Spring 2016, Spring 2015, Spring 2013, Spring
2011, Fall 2009, Fall 2007. PhD moodle page.
.Deep LearningSpring 2020, Spring 2019, Spring 2018, Spring 2017, Fall
2015 (with Dong Yu,
Microsoft Research). PhD moodle page.
.Introduction to Reinforcement Learning and Dynamic Programming (reinforcement learning part) Fall 2019.
.Signal
Processing for Hearing Assistive Devices (PhD course and Winter
School) Fall 2017.
.Advanced
Technologies for Green Wireless Communication Networks (data fusion) Spring 2015.
.Energy Efficient Technologies for Green Wireless Sensor
Networks (energy-efficient data fusion)
Spring 2014.
.
Research in Vision, Graphics and Interactive Systems, 2019, 2018, 2017.
.B&O Innovation Camp,
annual 3-week summer event since 2012 (2019, 2013 and 2012 at Shanghai Jiao Tong University, China and other years at Bang & Olufsen, Denmark).
. About 100 Postdoc, PhD, Master
and Bachelor
student projects, among which 15 are PhD projects. (Info about student project and exams. Curricula,
studieordninger in Danish)
Copyright
Notice: The copyright of each paper belongs to the respective
publisher. Electronic copy is provided for personal research and
reference only. Here is a list of journals and conferences with submission
deadlines.
Journal
papers:
Xiaoxu Li, Dongliang Chang, Zhanyu Ma, Zheng-Hua Tan, Jing-Hao Xue, Jie Cao and Jun Guo, “Deep InterBoost Networks for Small-sample Image Classification,” accepted by Neurocomputing, 2020.
Xiaoxu Li, Dongliang Chang, Zhanyu Ma, Zheng-Hua Tan, Jing-Hao Xue, Jie Cao, Jingyi Yu, Jun Guo, “OSLNet: Deep Small-Sample Classification with an Orthogonal Softmax Layer,” accepted by IEEE Transactions on Image Processing, 2020.
Iván López-Espejo, Zheng-Hua Tan and Jesper Jensen, “Improved External Speaker-Robust Keyword Spotting for Hearing Assistive Devices,” IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 28, pp. 1233-1247, December 2020.
Zhanyu Ma, Xiaoou Lu, Jiyang Xie, Zhen Yang, Jing-Hao Xue, Zheng-Hua Tan, Bo Xiao, Jun Guo, "On the Comparisons of Decorrelation Approaches for non-Gaussian Neutral Vector Variables," accepted by IEEE Transactions on Neural Networks and Learning Systems, 2020.
Morten Kolbæk, Zheng-Hua Tan, Søren Holdt Jensen and Jesper Jensen, “On Loss Functions for Supervised Monaural Time-Domain Speech Enhancement,” IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 28, pp. 825-838, January 2020.
Daniel Michelsanti, Zheng-Hua Tan, Sigurdur Sigurdsson and Jesper Jensen, “Deep-Learning-Based Audio-Visual Speech Enhancement in Presence of Lombard Effect,” Speech Communication, vol. 115, pp. 38-50, December 2019.
Miklas S. Kristoffersen, Sven E. Shepstone, and Zheng-Hua Tan, “The Importance of Context When Recommending TV Content: Dataset and Algorithms,” accepted by IEEE Transactions on Multimedia, 2019.
Yonggang Qi and Zheng-Hua Tan, "SketchSegNet+: An End-to-end Learning of RNN for Multi-Class Sketch Semantic Segmentation," IEEE Access, vol. 7, pp. 102717-102726, July 2019.
Zheng-Hua Tan, Achintya kr. Sarkar and Najim Dehak,
“rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection
Method,” Computer Speech and Language, vol. 59, pp. 1-21, January 2020.. Source code in Matlab and Python.
Achintya kr. Sarkar, Zheng-Hua Tan, Hao Tang, Suwon Shon
and James Glass, "Time-Contrastive Learning Based Deep Bottleneck
Features for Text-Dependent Speaker Verification," IEEE/ACM Transactions on
Audio, Speech and Language Processing, vol. 27, no. 8, pp.1267-1279,
August 2019. PDF
from IEEEXplore.
Morten Kolbæk, Zheng-Hua Tan and Jesper Jensen, "On the
Relationship between Short-Time Objective Intelligibility and
Short-Time Spectral-Amplitude Mean-Square Error for Speech
Enhancement," IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 27, no. 2, pp. 283-295, February 2019.
Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan
and
Jesper Jensen, "Non-Intrusive Speech Intelligibility Prediction using
Convolutional Neural Networks," IEEE/ACM Transactions on Audio, Speech and
Language Processing, vol. 26, no. 10, pp. 1925-1939, October 2018.
Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan and
Jesper Jensen, “Refinement and Validation of the Binaural Short Time
Objective Intelligibility Measure for Spatially Diverse Conditions,” Speech Communication, vol. 102, pp. 1-13, September 2018.
Xiaodong Duan and Zheng-Hua Tan, “A Spatial Self-Similarity
Based Feature Learning Method for Face Recognition under Varying
Poses,” Pattern Recognition Letters, vol. 111, pp. 109-116, August 2018.
Hengwei
Lin, Kai Sun, Zheng-Hua Tan, Chengxi Liu, Josep M. Guerrero and Juan C.
Vasquez, “Adaptive Protection Combined with Machine Learning for Microgrids,” IET Generation, Transmission
& Distribution, vol. 13, no. 6, pp. 770-779, March 2019.
Hong Yu, Zheng-Hua Tan, Zhanyu Ma, Rainer Martin, and Jun
Guo, "Spoofing Detection in Automatic Speaker Verification Systems
Using DNN Classifiers and Dynamic Acoustic Features," IEEE
Transactions on Neural Networks and Learning Systems, vol. 29, no. 10, pp 4633-4644, October 2018.
Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan,
and Jesper Jensen, “Bias-compensated Informed Sound Source Localization
Using Relative Transfer Functions," IEEE/ACM Transactions on Audio,
Speech and Language Processing, vol. 26, no. 7, pp. 1275 – 1289, July
2018.
Chengshi Zheng, Zheng-Hua Tan, Renhua Peng, and Xiaodong Li, “Guided spectrogram filtering for speech dereverberation,” Applied Acoustics, vol. 134, pp. 154-159, May 2018.
Renhua Peng, Zheng-Hua Tan, Xiaodong Li, and Chengshi
Zheng. "A Perceptually Motivated LP Residual Estimator in Noisy and
Reverberant Environments," Speech Communication, vol. 96, pp. 129-141, February 2018.
(Elsevier).
Md Sahidullah, Dennis Alexander Lehmann
Thomsen, Rosa Gonzalez Hautamaki, Tomi Kinnunen, Zheng-Hua Tan, Robert
Parts, Martti Pitkanen,
“Robust Voice Liveness Detection and Speaker Verification Using Throat
Microphones,” IEEE/ACM Transactions on Audio, Speech and
Language Processing, vol. 26, no. 1, pp. 44 – 56, January 2018.
Zheng-Hua Tan, Nicolai Bæk Thomsen, Xiaodong Duan,
Evgenios Vlachos, Sven Ewan Shepstone, Morten H. Rasmussen and Jesper
Lisby Højvang, "iSocioBot - A Multimodal Interactive Social Robot," International Journal of
Social Robotics, vol. 10, no. 1, pp. 5–19, January 2018. (Springer).
Jen-Tzung Chien, Chao-Hsi Lee and Zheng-Hua Tan, "Latent
Dirichlet Mixture Model," Neurocomputing, vol. 278, pp. 12-22, February 2018.
Sven Ewan Shepstone, Zheng-Hua Tan and Miklas Strøm
Kristoffersen, "Using Closed-set Speaker Identification Score
Confidence to Enhance Audio-based Collaborative Filtering for Multiple
Users," IEEE Transactions on Consumer Electronics, vol. 64, no. 1, pp. 1-8, February 2018.
Sven Ewan Shepstone, Zheng-Hua Tan and Søren Holdt
Jensen,
“Audio-based Granularity-adapted Emotion Classification,” IEEE
Transactions on Affective Computing, vol. 9, no. 2, pp. 176-190,
April-June 2018. PDF
from IEEEXplore.
Morten Kolbæk, Dong Yu, Zheng-Hua Tan and Jesper
Jensen, "Multi-talker Speech Separation with Utterance-level
Permutation Invariant Training of Deep Recurrent Neural Networks”,
IEEE/ACM
Transactions on Audio, Speech and Language Processing, vol. 25, no. 10, pp. 1901-1913,
October 2017. PDF
from IEEEXplore.
Achintya Sarkar and Zheng-Hua Tan, "Incorporating
Pass-Phrase Dependent Background Models for Text-Dependent Speaker
Verification,” Computer Speech & Language, vol. 47, pp. 259-271,
January 2018. PDF
from Elsevier.
Stefanos Astaras, Aristodemos Pnevmatikakis and Zheng-Hua
Tan, "Visual Detection of Events of Interest from Urban Activity,"
Wireless Personal Communications, vol. 97, no. 2, November 2017, pp. 1877–1888.
Zhanyu Ma, Jing-Hao Xue, Arne Leijon, Zheng-Hua Tan, Zhen Yang,
and Jun Guo, "Decorrelation of Neutral Vector: Theory and Applications,”
IEEE Transactions on Neural Networks and Learning Systems, vol. 29, no. 1, pp. 129 – 143, January 2018. PDF
from IEEEXplore.
Swati Prasad, Zheng-Hua Tan and Ramjee Prasad,
"Frame Selection for
Robust Speaker Identification: A Hybrid Approach,” Wireless Personal
Communications, vol. 97, no. 1, pp. 933–950, November 2017. (Springer). PDF from
Springer.
Hong
Yu, Zheng-Hua Tan, Yiming Zhang, Zhanyu Ma, and Jun Guo, “DNN Filter
Bank Cepstral Coefficients for Spoofing Detection,” IEEE Access, vol. 5, pp. 4779-4787, March 2017. PDF
from IEEEXplore.
Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan,
and Jesper Jensen, “Informed Sound Source Localization using Relative
Transfer Functions for Hearing Aid Applications,” IEEE/ACM
Transactions on Audio, Speech and Language Processing, vol. 25, no. 3,
pp. 611-623, March 2017. PDF
from IEEEXplore.
Morten Kolbæk, Zheng-Hua Tan and Jesper Jensen, "Speech
Intelligibility Potential of General and Specialized Deep Neural
Network based Speech Enhancement Systems," IEEE/ACM
Transactions on Audio, Speech and
Language Processing, vol. 25, no. 1, pp. 153-167, January 2017. PDF
from IEEEXplore.
Zhanyu Ma, Hong Yu, Zheng-Hua Tan, and Jun Guo,
“Text-Independent Speaker Identification Using the Histogram Transform
Model”, IEEE ACCESS, vol. 4, pp. 9733 - 9739, January 2017. PDF
from IEEEXplore.
Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan
and Jesper Jensen, “Predicting the Intelligibility of Noisy and
Non-Linearly Processed Binaural Speech," IEEE/ACM Transactions on
Audio, Speech and
Language Processing, vol. 24, no. 11, November 2016. PDF
from IEEEXplore.
Elizabeth
Ann Jochum, Evgenios Vlachos, Sally Grindsted Nielsen,
Anja Christoffersen, Ibrahim Hameed and Zheng-Hua Tan, "Using Theatre
to Study Interaction with Care Robots," International Journal of Social
Robotics, vol. 8, no. 4, pp. 457-470, August 2016. (Springer). PDF
from Springer.
Swati Prasad, Zheng-Hua Tan and Ramjee Prasad, “Multiple Frame
Rates for Feature Extraction and Reliable Frame Selection at the
Decision for Speaker Identification Under Voice Disguise,” Conasense,
vol. 1, no. 1, pp. 29-44, January 2016.
Sven Shepstone, Kong Aik Lee, Haizhou Li, Zheng-Hua Tan and
Søren Holdt Jensen, “Total Variability Modeling using Source-specific
Priors,” IEEE/ACM Transactions on Audio, Speech and
Language Processing, vol. 24, no. 3, pp. 504-517, March 2016. PDF
from IEEEXplore.
Zhanyu Ma, Zheng-Hua Tan, and Jun Guo,
“Feature Selection for Neutral Vector in EEG Signal Classification”,
Neurocomputing, vol. 174, pp. 937-945, January 2016. PDF from Elsevier.
Nikolaos Katsarakis, Aristodemos Pnevmatikakis,
Zheng-Hua Tan and Ramjee Prasad, “Improved Gaussian Mixture Models for
Adaptive Foreground Segmentation,” Wireless Personal Communications,
vol. 87, no. 3, pp. 629-643, April 2016. (Springer). PDF from
Springer.
Yonggang Qi, Jun Guo, Yi-Zhe Song, Tao Xiang,
Honggang Zhang and Zheng-Hua Tan, “Im2Sketch: Sketch Generation by
Unconflicted Perceptual Grouping,” Neurocomputing, vol. 165, pp
338-349, 2015. PDF from Elsevier.
Konstantinos Kouzelis, Zheng-Hua Tan, Birgitte
Bak-Jensen, Jayakrishnan R. Pillai and Ewen Ritchie, “Estimation of
Residential Heat Pump Consumption for Flexibility Market Applications,”
IEEE Transactions on Smart Grid, vol. 6, no. 4, pp. 1852-1864, July
2015. PDF
from IEEEXplore.
Nikolaos
Katsarakis, Aristodemos Pnevmatikakis, Zheng-Hua Tan and Ramjee Prasad,
"Combination of Multiple Measurement Cues for Visual Face Tracking,"
Wireless Personal Communications, vol. 78, no.3, pp. 1789-1810, July
2014. PDF
from Springer.
Emanouil Amolochitis, Ioannis
T. Christou and Zheng-Hua Tan, "Implementing a
Commercial-Strength Parallel Hybrid Movie Recommendation Engine," IEEE
Intelligent Systems (AI Innovation and Industry track), vol. 29, no. 2,
pp. 92-96, Mar-Apr 2014. PDF
from IEEEXplore.
Zhanyu
Ma, Arne Leijon, Zheng-Hua Tan and Sheng Gao, “Predictive
Distribution of the Dirichlet Mixture Model by the Local Variational
Inference Method," The Journal of Signal Processing Systems, 3/2014
74(3), pp: 359-374. PDF from
Springer.
Emanouil Amolochitis, Ioannis T. Christou, Zheng-Hua
Tan
and Ramjee
Prasad, “A Heuristic Hierarchical Scheme for Academic Search and
Retrieval,” Information Processing and Management, vol. 49, no. 6, pp.
1326–1343, November 2013. PDF from Elsevier.
Theodoros
Petsatodis, Fotios Talantzis, Christos Boukis, Zheng-Hua Tan
and Ramjee Prasad, "Exploring super-gaussianity towards
robust information-theoretical time delay estimation," The Journal of
the Acoustical Society of America (JASA), vol.
133, no. 3, pp. 1515-1524,
2013. PDF from JASA.
Pejman
Mowlaee, Rahim Saeidi, Mads Græsbøll Christensen, Zheng-Hua Tan,
Tomi Kinnunen, Pasi Franti, and Søren Holdt Jensen, “A Joint Approach
for Single-Channel Speaker Identification and Speech Separation,” IEEE
Transactions on Audio, Speech and Language Processing, vol.20, no.9,
pp.2586-2601, Nov. 2012.PDF from IEEEXplore.
Hongbing Cheng, Chunming
Rong, Zhenghua Tan and Qingkai
Zeng, "Identity based Encryption and Biometric Authentication Scheme for Secure Data Access in Cloud Computing," Chinese
Journal of Electronics (English edition), vol. 21, no. 2, pp.
254-259, April 2012. PDF from CJE.
Theodoros
Petsatodis, Christos Boukis, Fotios Talantzis, Zheng-Hua Tan
and Ramjee Prasad, “Convex Combination of Multiple Statistical Models with Application to
VAD,” IEEE Transactions on Audio, Speech and Language Processing,
vol. 19, no. 8, pp. 2314 - 2327, November 2011.PDF from IEEEXplore.
Hristijan
Petreski, Sofia Tsekeridou, Eri Giannaka, Neeli Prasad,
Ramjee Prasad and Zheng-Hua Tan,
"Technology-enabled social learning: a review," International
Journal of Knowledge and Learning, vol. 7, nos. 3/4, pp. 253-270,
2011.
Haitian Xu, Zheng-Hua Tan, Paul
Dalsgaard and Børge Lindberg, “Robust Speech Recognition by Non-Local
Means De-Noising Processing,” IEEE Signal Processing
Letters, 2008.PDF from IEEEXplore.
Haitian Xu, Paul Dalsgaard,
Zheng-Hua Tan and Børge Lindberg, “Noise Condition-Dependent Training
Based on Noise Classification and SNR Estimation,” IEEE Transactions on Audio, Speech and Language
Processing, vol. 15, no. 8, pp. 2431 – 2443, Nov. 2007. PDF from IEEEXplore.
Nelly Pustelnik, Zhanyu Ma, Zheng-Hua Tan
and Jan Larsen (eds.), Proceedings of 2018 IEEE International Workshop
on Machine Learning for Signal Processing (MLSP 2018), IEEE Press,
Aalborg, Denmark, September 17-20, 2018. (ISBN:
978-1-5386-5477-4).
Zhanyu
Ma, Jen-Tzung Chien, Zheng-Hua Tan, Yi-Zhe Song, Jalil Taghia and Ming
Xiao, "Recent Advances in Machine Learning for Non-Gaussian Data
Processing,” Neurocomputing, vol. 278, pp. 1-152, February 2018.
Jun Guo, Zheng-Hua Tan, Sung Ho Cho, and Guoqiang Zhang, “Machine
Learning for Big Data Processing in Mobile Internet,” Wireless Personal
Communications, vol. 102, no. 3, pp. 2093-2387, October 2018.
Weichuan
Yu, Zheng-Hua Tan and Yi Wang, “Guest Editors’ Introduction to
the Special Issue on New Trends in Signal Processing and Biomedical
Engineering,” Elsevier Computers and Electrical Engineering,
vol. 38, no. 1, pp. 1-81, January 2012.
Zheng-Hua
Tan, Reinhold Haeb-Umbach, Sadaoki Furui, James R. Glass and Maurizio
Omologo, “Introduction to the Issue on Speech Processing for Natural
Interaction with Intelligent Environments,” IEEE Journal of
Selected Topics in Signal Processing, vol. 4,
no. 5, pp. 769 – 910, October 2010.PDF from IEEEXplore.
Zheng-Hua Tan, Najim Dehak, Jan Larsen and Zhanyu Ma (eds.),
Proceedings of the First International Workshop on Sensing, Processing
and Learning for Intelligent Machines (SPLINE 2016), IEEE Press, 2016.
Zheng-Hua Tan, Shaoping Bai, Thomas Bak,
Matthias Rehm and Elizabeth Ann Jochum (eds.), Proceedings of the 3rd
AAU Workshop on Robotics, AAU Press, 2015.
Mohamed
Abou-Zleikha , Zheng-Hua Tan, Mads Græsbøll Christensen and Søren Holdt
Jensen, "Utilising Tree-Based Ensemble Learning For Speaker
Segmentation,” Full paper published in Springer LNCS: Proceedings of
the 10th International Conference on Artificial Intelligence
Applications and Innovations (AIAI 2014), Island of Rhodes, Greece,
September 19-21, 2014.
Nicolai B. Thomsen, Zheng-Hua Tan, Børge
Lindberg and Søren Holdt Jensen, “Improving Robustness against
Environmental Sounds for Directing Attention of Social Robots,”
Springer LNAI, vol. 8757: Proceedings of the
2nd
Workshop on Multimodal Analyses Enabling Artificial Agents in
Human-Machine Interaction, September 14, 2014, Singapore. PDF
Zheng-Hua
Tan and Børge Lindberg, "Speech Recognition on Mobile Devices," X.
Jiang, M. Ma and C. Chen (eds.), Mobile
Multimedia Processing: Fundamentals, Methods, and Applications,
Springer LNCS, vol. 5960, 2010.
Zheng-Hua Tan, Yi Wan, Tao Xiang,
and Yibin Song (eds.), Proceedings of the 3rd International
Congress on
Image and Signal Processing (CISP 2010), IEEE Press, Yantai,
China, October 2010. (ISBN: 978-1-4244-6515-6)
Zheng-Hua Tan and Imre Varga
“Networked, distributed and embedded speech recognition: an overview”,
Z.-H. Tan, and B. Lindberg (eds.), Automatic
speech recognition on mobile devices and over communication networks,
Springer-Verlag, London, Feb. 2008, pp. 1-23.
Haitian
Xu, Zheng-Hua Tan, Paul
Dalsgaard, Ralf Mattethat and Børge Lindberg, “A Configurable
Distributed Speech Recognition System”, H.
Abut, J.H.L. Hansen, K. Takeda (Editors), Digital Signal
Processing for In-Vehicle and Mobile Systems 2, Springer
Science, New York, NY, 2006.
Paul Dalsgaard, Borge
Lindberg, Henrik Benner and Zheng-Hua Tan, Book Abstract of Eurospeech'01
Proceedings, Kommunik
Grafiske Løsninger, September 2001.
Guangrui Hu, Changqing Xu,
Zheng-Hua Tan and Xinbao Gong, Problem Handbook of Signals and Systems, Science Press
of China, 1999.
Conference
papers:
Daniel Michelsanti, Olga Slizovskaia, Gloria Haro, Emilia Gómez, Zheng-Hua Tan and Jesper Jensen, “Vocoder-Based Speech Synthesis from Silent Videos,” Interspeech 2020, Shanghai, China, October 25-29, 2020.
Iván López-Espejo, Zheng-Hua Tan and Jesper Jensen, “Exploring Filterbank Learning for Keyword Spotting,” The 28th European Signal Processing Conference (EUSIPCO 2020), Amsterdam, The Netherlands, January 18-22, 2021.
Saeid Samizade, Zheng-Hua Tan, Chao Shen,
Xiaohong Guan, "Adversarial Example Detection by Classification for
Deep Speech Recognition,” The 45th International Conference on
Acoustics, Speech and Signal Processing (ICASSP 2020), Barcelona, May
4-8, 2020.
Poul Hoang, Zheng-Hua Tan, Thomas Lunner, Jan Mark de Haan,
Jesper Jensen, “Maximum Likelihood Estimation of the
Interference-Plus-Noise Cross Power Spectral Density Matrix for Own
Voice Retrieval,” The 45th International Conference on Acoustics,
Speech and Signal Processing (ICASSP 2020), Barcelona, May 4-8, 2020.
Poul Hoang, Zheng-Hua Tan, Jan Mark de Haan, Thomas Lunner and
Jesper Jensen, “Robust Bayesian and Maximum a Posteriori Beamforming
for Hearing Assistive Devices,” The 7th IEEE Global Conference on
Signal and Information Processing (GlobalSIP 2019), Nov. 11-14, 2019,
Shaw Centre, Ottawa, Canada.
Jiyang Xie, Zhanyu Ma, Guoqiang Zhang, Jing-Hao Xue, Zheng-Hua Tan
and Jun Guo, “Soft Dropout and Its Variational Bayes Approximation,”
2019 IEEE International Workshop on Machine Learning for Signal
Processing (MLSP 2019), Oct. 13–16, 2019, Pittsburgh, PA, USA.
Iván López-Espejo, Zheng-Hua Tan, and Jesper Jensen, "Keyword
Spotting for Hearing Assistive Devices Robust to External Speakers,"
Interspeech 2019, September 15-19, 2019, Graz, Austria.
Miklas
S. Kristoffersen, Jacob L. Wieland, Sven E. Shepstone, Zheng-Hua Tan
and Vinoba Vinayagamoorthy, “Deep Joint Embeddings of Context and
Content for Recommendation,” CARS 2.0 – Workshop on Context-Aware
Recommender Systems, in conjunction with RecSys’ 2019, 20 September
2019, Copenhagen, Denmark.
Daniel Michelsanti, Zheng-Hua Tan, Sigurdur
Sigurdsson and Jesper Jensen, “Effects of Lombard Reflex on the
Performance of Deep-Learning-Based Audio-Visual Speech Enhancement
Systems,” 2019 IEEE International Conference on Acoustics, Speech, and
Signal Processing (ICASSP 2019), Brighton, UK, May 12-17, 2019.
Daniel Michelsanti, Zheng-Hua Tan, Sigurdur Sigurdsson and Jesper
Jensen, “On Training Targets and Objective Functions for
Deep-Learning-Based Audio-Visual Speech Enhancement,” 2019 IEEE
International Conference on Acoustics, Speech, and Signal Processing
(ICASSP 2019), Brighton, UK, May 12-17, 2019.
Andrea Coifman, Peter Rohoska, Miklas S.
Kristoffersen, Sven E. Shepstone, and Zheng-Hua Tan, "Subjective
Annotations for Vision-Based Attention Level Estimation," The 14th
International Conference on Computer Vision Theory and Applications
(VISAPP 2019), Prague, Czech Republic, 25-27 February 2019.
Evgenios Vlachos and Zheng-Hua Tan, "Public
Perception of Android Robots: Indications from an Analysis of YouTube
Comments," the 2018 IEEE/RSJ International Conference on Intelligent
Robots and Systems (IROS 2018), Madrid, Spain, 1-5 October 2018.
Hong Yu, Tianrui Hu, Zhanyu Ma, Zheng-Hua Tan and Jun Guo,
"Multi-Task Adversarial Network Bottleneck Features for Noise-Robust
Speaker Verification," the IEEE International Conference on Network
Infrastructure and Digital Content (IC-NIDC 2018), Guiyang, China, August 22 - 24, 2018.
Peter Sibbern Frederiksen, Jesus Villalba, Shinji Watanabe,
Zheng-Hua Tan and Najim Dehak, "Effectiveness of Single-Channel BLSTM
Enhancement for Language Identification," Interspeech 2018, Hyderabad,
India, September 2-6, 2018.
Gabriele
Trovato, Renato Paredes, Javier Balvin, Francisco Cuellar, Nicolai Bæk
Thomsen, Søren Bech, and Zheng-Hua Tan, “The Sound or Silence:
investigating the influence of robot noise on proxemics,” the 27th IEEE
International Conference on Robot and Human Interactive Communication,
RO-MAN 2018, Nanjing and Tai’an, China, 27-31 August 2018.
Morten Kolbæk, Zheng-Hua Tan and Jesper Jensen,
“Monaural Speech Enhancement Using Deep Neural Networks by Maximizing a
Short-Time Objective Intelligibility Measure,” The 43th IEEE
International Conference on Acoustics, Speech and Signal Processing
(ICASSP 2018), 15-20 April 2018, Calgary, Alberta, Canada.
Achintya Kr. Sarkar and Zheng-Hua Tan, “Time-Contrastive
Learning Based DNN Bottleneck Features for Text-Dependent Speaker
Verification,” NIPS 2017 Time Series Workshop, Long
Beach, CA, USA, Dec. 8, 2017.
Xiaodong Duan, Nicolai B. Thomsen, Zheng-Hua Tan, Børge
Lindberg and Søren H. Jensen, “Weighted Score Based Fast Converging
CO-training with Application to Audio-Visual Person Identification,”
The 29th IEEE International Conference on Tools with Artificial
Intelligence (ICTAI2017), Boston, Massachusetts, USA, Nov. 6-8, 2017.
Morten Kolbæk, Dong Yu, Zheng-Hua Tan and Jensen, Jesper,
"Joint Separation and Denoising of Noisy Multi-Talker Speech Using
Recurrent Neural Networks and Permutation Invariant Training,” the IEEE
27th International Workshop on Machine Learning for Signal Processing
(MLSP), Tokyo, Japan, 25-28 September 2017. PDF. Best student paper award. AAU
News
Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan and Jesper Jensen, "On the use of Band Importance Weighting in the Short-Time Objective Intelligibility Measure,” Interspeech 2017, Stockholm, Sweden, 20-24 August 2017.
Hong Yu, Zheng-Hua Tan, Zhanyu Ma and Jun Guo, "Adversarial
Network Bottleneck Features for Noise Robust Speaker Verification,”
Interspeech 2017, Stockholm, Sweden, 20-24 August 2017.
Daniel Michelsanti and Zheng-Hua Tan, "Conditional Generative
Adversarial Networks for Speech Enhancement and Noise-Robust Speaker
Verification,” Interspeech 2017, Stockholm, Sweden, 20-24 August 2017. PDF
Achintya Sarkar, Md Sahidullah, Zheng-Hua Tan and Tomi Kinnunen,
"Improving Speaker Verification Performance in Presence of Spoofing
Attacks Using Out-of-Domain Spoofed Data,” Interspeech 2017, Stockholm,
Sweden, 20-24 August 2017.
Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan and
Jesper
Jensen, "On the use of Band Importance Weighting in the Short-Time
Objective Intelligibility Measure,” Interspeech 2017, Stockholm,
Sweden, 20-24 August 2017.
K. A. Lee, et al. , "The I4U Mega Fusion and Collaboration for
NIST Speaker Recognition Evaluation 2016,” Interspeech 2017, Stockholm,
Sweden, 20-24 August 2017.
K. A. Lee, et al., "The I4U Submission to the 2016 NIST
Speaker Recognition Evaluation." NIST SRE 2016 Workshop, San Diego,
California, USA, 2016.
Dong Yu, Morten Kolbæk, Zheng-Hua Tan, and Jesper Jensen,
“Permutation Invariant Training of Deep Models for Speaker-independent
Multi-talker Speech Separation,” The 42th IEEE International Conference
on Acoustics, Speech and Signal Processing (ICASSP 2017), New Orleans,
USA, 5-9 March 2017.
Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan, and
Jesper Jensen, 'A Non-intrusive Short-time Objective Intelligibility
Measure,” The 42th IEEE International Conference on Acoustics, Speech
and Signal Processing (ICASSP 2017), New Orleans, USA, 5-9 March 2017.
Tomi Kinnunen, Md Sahidullah, Mauro Falcone, Luca Costantini,
Rosa Gonzalez Hautamäki, Dennis Thomsen, Achintya Sarkar, Zheng-Hua
Tan, Hector Delgado, Massimiliano Todisco, Nicholas Evans, Ville
Hautamäki, and Kong Aik Lee, "RedDots Replayed: A New Replay Spoofing
Attack Corpus for Text-dependent Speaker Verification Research,” The
42th IEEE International Conference on Acoustics, Speech and Signal
Processing (ICASSP 2017), New Orleans, USA, 5-9 March 2017.
Morten Kolbæk, Zheng-Hua Tan and Jesper Jensen, "Speech
Enhancement Using Long Short-Term Memory Based Recurrent Neural
Networks for Noise Robust Speaker Verification,” 2016 IEEE Workshop on
Spoken Language Technology (SLT 2016), San Diego, California, USA,
December 13-16, 2016.
Héctor Delgado, Massimiliano Todisco, Md Sahidullah,
Achintya K Sarkar, Nicholas Evans, Tomi Kinnunen and Zheng-Hua Tan,
"Further Optimisations of Constant Q Cepstral Processing for Integrated
Utterance and Text-Dependent Speaker Verification," 2016 IEEE Workshop
on Spoken Language Technology (SLT 2016), San Diego, California, USA,
December 13-16, 2016.
Jen-Tzung Chien, Chao-Hsi Lee and Zheng-Hua Tan, “Dirichlet
Mixture Allocation”, the 26th IEEE International Workshop on Machine
Learning for Signal Processing (MLSP), Salerno-Italy, 13-16 September
2016.
Nicolai Thomsen, Dennis Alexander Lehmann Thomsen,
Zheng-Hua Tan, Børge Lindberg and Søren Holdt Jensen,
"Speaker-dependent Dictionary-based Speech Enhancement for
Text-Dependent Speaker Verification," Interspeech 2016, San Francisco,
USA, 8 - 12 September 2016.
Achintya Kumar Sarkar and
Zheng-Hua Tan, "Text Dependent Speaker Verification Using Unsupervised
HMM-UBM and Temporal GMM-UBM," Interspeech 2016, San Francisco, USA, 8
- 12 September 2016.
Tomi Kinnunen, Md Sahidullah, Ivan
Kukanov, Héctor Delgado,
Massimiliano Todisco, Achintya sarkar, Nicolai Thomsen, Ville
Hautamaki, Nicholas Evans and Zheng-Hua Tan, "Utterance Verification
for Text-Dependent Speaker Recognition: a Comparative Assessment Using
the RedDots Corpus," Interspeech 2016, San Francisco, USA, 8 - 12
September 2016.
Md Sahidullah, Rosa González Hautamäki, Dennis Alexander Lehmann
Thomsen, Tomi Kinnunen, Zheng-Hua Tan, Ville Hautamaki, Robert Parts
and Martti Pitkanen, "Robust Speaker Recognition with Combined Use of
Acoustic and Throat Microphone Speech,"Interspeech 2016, San Francisco,
USA, 8 - 12 September 2016.
Md Sahidullah, Héctor Delgado, Massimiliano Todisco, Hong Yu,
Tomi Kinnunen, Nicholas Evans and Zheng-Hua Tan,"Integrated Spoofing
Countermeasures and Automatic Speaker Verification: an Evaluation on
ASVspoof 2015,"Interspeech 2016, San Francisco, USA, 8 - 12 September
2016.
Tomi Kinnunen, Alexey Sholokhov, Elie Khoury, Dennis Thomsen, Md
Sahidullah and Zheng-Hua Tan, "HAPPY Team Entry to NIST OpenSAD
Challenge: A Fusion of Short-Term Unsupervised and Segment i-Vector
Based Speech Activity Detectors," Interspeech 2016, San Francisco, USA,
8 - 12 September 2016. PDF
Hengwei
Lin, Josep M. Guerrero, Juan C. Vásquez, Zheng-hua Tan, Chengxi Liu,
andChenxi Jia, "Adaptive Overcurrent Protection for Microgrids in
Extensive Distribution Systems," the 42nd IEEE Industrial Electronics
Conference (IEEE IECON2016), Florence, Italy, October 24-27, 2016.
Nicolai B. Thomsen, Xiaodong Duan, Zheng-Hua Tan, Børge
Lindberg, and Søren Holdt Jensen, “Improving the Convergence of
CO-training for Audio-Visual Person Identification,” The International
Workshop on Sensing, Processing and Learning for Intelligent Machines
(SPLINE2016), July 6-8, 2016, Aalborg, Denmark.
Mohamed Abou-Zleikha, Mads Græsbøll Christensen, Zheng-Hua Tan,
and Søren Holdt Jensen, “Projecting Emotional Speech into
Arousal-valence Space Using Pairwise Preference Learning,” The
International Workshop on Sensing, Processing and Learning for
Intelligent Machines (SPLINE2016), July 6-8, 2016, Aalborg, Denmark.
Hong Yu, Achintya Sarkar, Dennis Alexander Lehmann Thomsen,
Zheng-Hua Tan, Zhan-Yu Ma, and Jun Guo, “Investigating the Effect of
Multi-conditional Training and Speech Enhancement Methods on Spoofing
Detection,” The International Workshop on Sensing, Processing and
Learning for Intelligent Machines (SPLINE2016), July 6-8, 2016,
Aalborg, Denmark.
Stefanos Astaras, Aristodemos Pnevmatikakis and Zheng-Hua Tan,
“Background Subtraction for Patterns of Activities in Cities,” The
International Workshop on Sensing, Processing and Learning for
Intelligent Machines (SPLINE2016), July 6-8, 2016, Aalborg, Denmark.
Mojtaba Farmani, Richard Heusdens, Michael Syskind Pedersen,
Zheng-Hua Tan and Jesper Jensen, “Concurrent Localization of Sound
Sources and Dual-Microphone Sub-Arrays Using TOFs,” The 19th
International Conference on Information Fusion (FUSION 2016),
Heidelberg, July 5-8, 20016.
Zongji
Sun, Li Meng, Aladdin Ariyaeeinia, Xiaodong Duan, and Zheng-Hua Tan,
“Privacy Protection Performance of De-identified Face Images with and
without Background,” The 39th International ICT Convention MIPRO 2016,
May 30 - June 03, 2016, Opatija, Croatia.
Ibrahim A. Hameed, Zheng-Hua Tan, Nicolai B. Thomsen and Xiaodong
Duan, “User Acceptance of Social Robots,” The 9th International
Conference on Advances in Computer-Human Interactions (ACHI 2016),
Venice, Italy, April 24-28, 2016. Best
Paper Award.
Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan
and Jesper Jensen, “A Method for Predicting the Intelligibility of Nisy
and Non-linearly Enhanced Binaural Speech,” The 41th IEEE International
Conference on Acoustics, Speech and Signal Processing (ICASSP 2016),
Shanghai, China, 20-25 March 2016.
Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan and
Jesper Jensen, “Informed Direction of Arrival Estimation Using a
Spherical-head Model for Hearing Aid Applications,” The 41th IEEE
International Conference on Acoustics, Speech and Signal Processing
(ICASSP 2016), Shanghai, China, 20-25 March 2016.
Xiaodong Duan and Zheng-Hua Tan, "Neighbors Based
Discriminative Feature Difference Learning for Kinship Verification,”
The 11th International Symposium on Visual Computing, December 14-16,
2015 , Las Vegas, Nevada, USA.
Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan
and Jesper Jensen, “Informed TDoA-based Direction of Arrival Estimation
for Hearing Aid Applications,” The 3rd IEEE Global Conference on Signal
and Information Processing (GlobalSIP 2015), Orlando, Florida, USA,
December 14-16, 2015.
Sally Grindsted Nielsen, Anja Christoffersen, Elizabeth Jochum
and Zheng-Hua Tan, "Robot Future: Using Theatre to Influence Acceptance
of Care Robots," The New Friend 2015 Conference, Almere, The
Netherlands, October 22-23, 2015. Best
Paper Award Runner-up.
Nicolai Thomsen, Zheng-Hua Tan, Børge Lindberg and Søren
Holdt Jensen, “A Heuristic Approach for a Social Robot to Navigate to a
Person Based on Audio and Range Information,” 2015 IEEE/RSJ
International Conference on Intelligent Robots and Systems (iROS),
Hamburg, Germany, September 28 - October 02, 2015.
Ivan Kraljevski, Zheng-Hua Tan and Maria Paola Bissiri,
“Comparison of Forced-Alignment Speech Recognition and Humans for
Generating Reference VAD,” Interspeech 2015, Dresden, Germany,
September 6-10, 2015.
Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan and
Jesper Jensen, “A Binaural Short Time Objective IntelligibilityMeasure
for Noisy and Enhanced Speech,” Interspeech 2015, Dresden, Germany,
September 6-10, 2015.
Mohamed Abou-Zleikha, Zheng-Hua Tan, Mads Græsbøll
Christensen and Søren Holdt Jensen, “Discriminative Approach for Voice
Selection in Speaker De-identification Systtem,” The 23rd European
Signal Processing Conference (EUSIPCO 2015), Nice, France, August 31 –
September 4, 2015.
Xiaodong Duan and Zheng-Hua Tan, "Local Feature Learning for Face
Recognition under Varying Poses," IEEE International Conference on
Image Processing (ICIP 2015), 27-30 September 2015, Quebec City,
Canada.
Xiaodong Duan and Zheng-Hua Tan, "A Feature Subtraction Method
for Image Based Kinship Verification under Uncontrolled Environments,"
IEEE International Conference on Image Processing (ICIP 2015), 27-30
September 2015, Quebec City, Canada.
Clara Schaarup, Gunnar Hartvigsen, Lars Bo Larsen, Zheng-Hua Tan,
Eirik Årsand, and Ole Hejlesen, “Assessing the potential use of
eye-tracking triangulation for evaluating the usability of an online
diabetes exercise system,” The 15th World Congress on Health and
Biomedical Informatics (MEDINFO 2015: eHealth-enabled Health), pp.
84-88, 1August 9-23, 2015, Sao Paulo, Brazil.
Rasmus Lyngby Kristensen, Zheng-Hua Tan, Zhanyu Ma and Jun
Guo, "Binary Pattern Flavored Feature Extractors for Facial Expression
Recognition: An Overview," CIS-MIPRO 2015, 25-29 May 2015,
Opatija, Croatia. PDF
Sven Shepstone, Kong Aik Lee, Haizhou Li, Zheng-Hua Tan and
Søren Holdt Jensen, “Source-Specific Informative Prior for I-Vector
Extraction,” The 40th IEEE International Conference on Acoustics,
Speech and Signal Processing (ICASSP 2015), April 19 – 24, 2015,
Brisbane, Australia. The Ganesh N.
Ramaswamy Memorial Student Grant and Award.
Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan and
Jesper Jensen, “Maximum Likelihood Approach to "Informed" Sound Source
Localization for Hearing Aid Applications,” The 40th IEEE International
Conference on Acoustics, Speech and Signal Processing (ICASSP 2015),
April 19 – 24, 2015, Brisbane, Australia.
Mojtaba Farmani, Michael Syskind Pedersen, Zheng-Hua Tan and
Jesper Jensen, “On the Influence of Microphone Array Geometry on
HRTF-Based Sound Source Localization,” The 40th IEEE International
Conference on Acoustics, Speech and Signal Processing (ICASSP 2015),
April 19 – 24, 2015, Brisbane, Australia.
Zheng-Hua
Tan, Nicolai Bæk Thomsen and Xiaodong Duan, "Designing and Implementing
an Interactive Social Robot from Off-the-shelf Components,"The 3rd
IFToMM Symposium on Mechanism Design for Robotics (MEDER2015), June
2-4, 2015, Aalborg, Denmark. PDF
Jesper Jensen and Zheng-Hua Tan, “A Theoretically Consistent
Method
for Minimum Mean-Square Error Estimation of Mel-Frequency Cepstral
Features,” The 4th IEEE International Conference on Network
Infrastructure and Digital Content (IEEE IC-NIDC2014), Beijing, China,
September 19-21, 2014. Best Paper
Award.
Yonggang Qi, Honggang Zhang, Yi-Zhe Song
and Zheng-Hua Tan, "A Patch-based Sparse Representation for Sketch
Recognition," The 4th IEEE International Conference on Network
Infrastructure and Digital Content (IEEE IC-NIDC2014), Beijing, China,
September 19-21, 2014.
Ivan Kraljevski and Zheng-Hua Tan, “Variable Frame Rate and
Length
Analysis for Data Compression in Distributed Speech Recognition,” The
4th IEEE International Conference on Network Infrastructure and Digital
Content (IEEE IC-NIDC2014), Beijing, China, September 19-21, 2014.
Nicolai
Bæk Thomsen, Zheng-Hua Tan, Børge Lindberg and Søren Holdt Jensen,
“Learning Direction of Attention for a Social Robot in Noisy
Environments,” The 3rd AAU Workshop on Robotics (AAUROB2014), Aalborg,
Denmark.
Mohamed Abou-Zleikha, Zheng-Hua Tan,
Søren Holdt Jensen and Mads Græsbøll Christensen, “Cluster-Based
Adaptation Using Density Forest for HMM Phone Recognition,” EUSIPCO 2014 - the 2nd European Signal
Processing Conference, September 1-5, 2014, Lisbon, Portugal.
Hristijan Petreski, Sofia Tsekeridou, Neeli R. Prasad and
Zhen-Hua Tan, “Methodology for Dynamic Learning Resources Discovery and
Retrieval from Social Media,” full paper at EDULEARN 2014 – the 6th
annual International Conference on Education and New Learning
Technologies, Barcelona, July 7-9, 2014.
Mohamed Abou-Zleikha, Zheng-Hua Tan,
Søren Holdt Jensen and Mads Græsbøll Christensen, “Non-linguistic Vocal
Events Detection and Localistion Using Online Random Forest,” MIPRO
2014 – the 37th International Convention, Special Session on BiForD –
Biometrics & Forensics & De-identification and Privacy
Protection, 26-30 May 2014, Opatija, Croatia.
Yonggang Qi, Jun Guo, Yi Li, Honggang Zhang, Tao Xiang, Yi-Zhe
Song and
Zheng-Hua Tan, “Perceptual Grouping via Untangling Gestalt Principles,”
The 2013 IEEE Visual
Communications and Image Processing conference (VCIP), Kuching,
Sarawak, Malaysia, November 17-20, 2013.
Swati Prasad, Zheng-Hua Tan and Ramjee Prasad, “Multistyle
Training
and Fusion for Speaker Identification of Disguised Voice,” The First
International Conference on Communications, Connectivity, Convergence,
Content and Cooperation (IC5), Mumbai, Maharashtra, India, December
16-19, 2013.
Hristijan Petreski, Sofia Tsekeridou, Neeli R.
Prasad and
Zhen-Hua
Tan, "Multi-dimensional technology-enabled social learning approach,” The 7th International Conference on Open
and Distance Learning (ICODL 2013), Athens, Greece, November
8-10, 2013.
Sven Ewan Shepstone, Zheng-Hua Tan and
Søren Holdt Jensen,
“Demographic Recommendation by means of Group Profile Elicitation Using
Speaker Age and Gender Recognition,” Interspeech
2013, Lyon, France, August 25-29, 2013. PDF
Morten Højfeldt Rasmussen and
Zheng-Hua Tan, “Fusing Eye-gaze and Speech Recognition for Tracking in
an Automatic Reading Tutor – A Step in the Right Direction?” Speech and
Language Technology for Education (SLaTE 2013), Grenoble, France -
August 30-31 & September 1st, 2013. PDF
O. Plchot, S. Matsoukas, P. Matejka, N. Dehak, J. Ma, S. Cumani, O. Glembek, H. Hermansky, S.H.
Mallidi, N. Mesgarani, R.
Schwartz, M. Soufifar, Z.-H. Tan, S. Thomas, B. Zhang and X. Zhou,
“Developing a Speaker Identification System for the DARPA RATS
project,” ICASSP 2013 - the 38th International Conference on
Acoustics, Speech, and Signal Processing, Vancouver, Canada, May
26 - 31, 2013. PDF
Swati Prasad, Zheng-Hua Tan and
Ramjee Prasad, “Multi-Frame Rate Based Multiple-Model Training for
Robust Speaker Identification of Disguised Voice,” The 16th
International Symposium on Wireless Personal Multimedia Communications
(WPMC 2013), Atlantic City, New Jersey, USA, June 24-27, 2013.
Zhanyu Ma, Zheng-Hua Tanand SwatiPrasad, "EEG Signal
Classification With Super-Dirichlet Mixture Model," IEEE
Statistical Signal Processing Workshop, Ann Arbor, USA,
Aug 5-8, 2012.
Emanouil Amolochitis, Ioannis T.
Christou and Zheng-Hua Tan, "PUBSEARCH: A
Hierarchical Heuristic Scheme for Ranking Academic Search Results," International
Conference on Pattern Recognition Applications and Methods (ICPRAM 2012),
Vilamoura, Algarve, Portugal, 6-8 Feburary, 2012.
Menelaos Bakopoulos, Sofia
Tsekeridou, Eri Giannaka, Zheng-Hua Tan and Ramjee Prasad , “Mobile
Video Annotation For Enhanced Rich Media Communication During Emergency
Handling,” The 4th International Symposium on Applied Sciences in
Biomedical and Communication Technologies (ISABEL 2011), 26-29October, 2011, Barcelona, Spain.
Swati Prasad, Zheng-Hua Tan,
Ramjee Prasad, Alvaro Fuentes Cabrera, Ying Gu and Kim Dremstrup,
“Feature Selection Strategy for Classification of Single-Trial EEG
Elicited by Motor Imagery,” The 14th International Symposium on
Wireless Personal Multimedia Communications (WPMC 2011), Brest,
France, 3-7 October 2011.
P. Mowlaee , R. Saeidi , Z. -H.
Tan , M. G. Christensen , T. Kinnunen, P.
Fränti, and S. H. Jensen,
"Sinusoidal Approach for the Single-Channel Speech Separation and
Recognition Challenge," Interspeech 2011, Florence, Italy,
27-31 August 2011.
Theodoros
Petsatodis, Fotios Talantzis, Christos Boukis, Zheng-Hua Tan and Ramjee
Prasad, "Multi-Sensor Voice Activity Detection based on Multiple
Observation Hypothesis Testing," Interspeech 2011, Florence,
Italy, 27-31 August 2011.
Morten Højfeldt
Rasmussen, Jack Mostow, Zheng-Hua Tan, Børge Lindberg and Yuanpeng Li,
"Evaluating Tracking Accuracy of an Automatic Reading Tutor," Speech
and Language Technology for Education
(SLaTE 2011), Venice, Italy, 24 - 26
August 2011. PDF
Morten Højfeldt
Rasmussen, Børge Lindberg and Zheng-Hua Tan, "Combining Acoustic and
Language Model Miscue Detection Methods for Dyslexic Read Speech," Speech
and Language Technology for Education
(SLaTE 2011), Venice, Italy, 24 - 26
August 2011. PDF
MenelaosBakopoulos,
SofiaTsekeridou, Eri Giannaka, Zheng-HuaTan, and RamjeePrasad, "Command & Control: Information Merging, Selective
Visualization and Decision Support for Emergency Handling," The
8th International Conference on Information Systems for Crisis Response
and Management, Lisbon, Portugal, May 8-11, 2011.
Pejman Mowlaee, Mads Græsbøll
Christensen, Zheng-Hua Tan, Søren Holdt Jensen, "A MAP Criterion for
Detecting the Number of Speakers at Frame Level in Model-based
Single-Channel Speech Separation," The 44th Annual Asilomar
Conference on Signals, Systems, and Computers, Pacific Grove,
California, USA, November 2010.
Zheng-Hua
Tan, "Machine Perception for Identification and Interaction in the
Internet of Things," invited paper at The 13th International Symposium
on Wireless Personal Multimedia Communications (WPMC 2010), October,
2010, Recife, Brazil. PDF
R.
Saeidi, P. Mowlaee, T. Kinnunen, Z. -H. Tan, M. G. Christensen, S. H.
Jensen, and P. Fränti, “Improving Monaural Speaker Identification by
Double-Talk Detection,” Interspeech 2010, Makuhari, Japan, 26-30 Sep.
2010.
Rahim
Saeidi, Pejman Mowlaee, Tomi Kinnunen, Zheng-Hua Tan, Mads Græsbøll
Christensen, Søren Holdt Jensen, Pasi Fränti, "Signal-to-Signal Ratio
Independent Speaker Identification Co-Channel Speech Signals," The 20th International Conference on Pattern
Recognition (ICPR), Istanbul, Turkey, August 2010.
M.
Andersen, R. S. Andersen, N. Katsarakis, A. Pnevmatikakis and Z.-H.
Tan, "Three-Dimensional Adaptive Sensing of People in a Multi-Camera
Setup," invited paper at EUSIPCO 2010 – the 18th European Signal
Processing Conference, Aalborg, Denmark, August 2010. Video demo.
Francesco
Santoro, Sergio Pedro, Zheng-Hua Tan and Thomas B. Moeslund, " Crowd
Analysis by Using Optical Flow and Density Based Clustering," EUSIPCO
2010 – the 18th European Signal Processing Conference, Aalborg,
Denmark, August 2010. Video demo.
Pejman
Mowlaee, Rahim Saiedi, Zheng-Hua Tan, Mads Græsbøll Christensen, Pasi
Franti, Søren Holdt Jensen, "Joint Single-Channel Speech Separation and
Speaker Identification,” ICASSP 2010 - the 35th International Conference on
Acoustics, Speech, and Signal Processing,Dallas, Texas, USA,
March 2010.
Zheng-Hua Tan and Børge
Lindberg, “High-Accuracy, Low-Complexity Voice Activity Detection Based
on A Posteriori SNR Weighted Energy,” Interspeech 2009, Brighton, U.K.,
September 2009.
Morten Højfeldt Rasmussen,
Zheng-Hua Tan, Børge Lindberg and Søren Holdt Jensen, "A System for
Detecting Miscues in Dyslexic Read Speech,” Interspeech 2009, Brighton,
U.K., September 2009.
Zheng-Hua Tan and Borge
Lindberg, “A Posteriori SNR Weighted Energy
Based Variable Frame Rate Analysis for Speech Recognition,”
Interspeech 2008, Brisbane, Australia, September 2008. PDF
Zheng-Hua Tan and Borge
Lindberg, “An Efficient Frame Selection Approach
to Variable Frame Rate Analysis for Noise Robust Speech Recognition,”
Acoustics 2008 (the 155th ASA
meeting), Paris, France, June 2008.
Zheng-Hua
Tan and Borge Lindberg, "A Variable Frame Rate Method for Distributed
Speech Recognition over Wireless Networks,” The 10th International
Symposium on Wireless Personal Multimedia Communications, Jaipur,
India, December 2007.
Zheng-Hua
Tan, “Variable Frame Rate Analysis for Automatic Speech Recognition,” SPIE Multimedia Systems and Applications X,
Boston, MA, USA, September 2007.
Zheng-Hua
Tan and Borge Lindberg, "A Variable Frame Rate Method for Distributed
Speech Recognition over Wireless Networks,” The
10th International Symposium on Wireless Personal Multimedia
Communications, Jaipur, India, December 2007.
Zheng-Hua Tan, Paul Dalsgaard and Borge
Lindberg, "Robust
Speech Recognition over Mobile
Networks Using Combined Weighted Viterbi Decoding and Subvector Based
Error Concealment," Interspeech
2006, Pittsburgh PA, USA, September 2006.PDF
Tom Brøndsted, Lars Bo Larsen, Børge
Lindberg, Morten Rasmussen, Zheng-Hua Tan, Haitian Xu, “Distributed
Speech Recognition for Information Retrieval on Mobile Devices,”
Workshop on Speech in Mobile and Pervasive Environments, Espoo,
Finland, September 2006.
Zheng-Hua Tan, Paul Dalsgaard and Borge Lindberg, "Adaptive Multi-Frame-Rate Scheme for
Distributed Speech Recognition Based on a Half Frame-Rate Front-End”, IEEE MMSP 2005 – the 7th international workshop on multimedia
signal processing, Shanghai, China, November 2005.PDF
Haitian Xu, Zheng-Hua Tan, Paul Dalsgaard
and Børge Lindberg, “Combined Spectral Subtraction and Cepstral
Normalisation for Robust Speech Recognition”,
ASIDE 2005 - COST278 and
ISCA Tutorial and Research Workshop (ITRW) on Applied Spoken Language
Interaction in Distributed Environments, Aalborg, Denmark,
November 2005.
Tom Brøndsted,
Henrik L. Larsen, Lars B. Larsen, Børge Lindberg, Daniel Ortiz-Arroyo, Zheng-Hua Tan, Haitian Xu, “Mobile Information
Access with Spoken Query Answering”, ASIDE 2005 - COST278 and ISCA Tutorial and Research Workshop (ITRW)
on Applied Spoken Language Interaction in Distributed Environments,
Aalborg, Denmark, November 2005. PDF
Zheng-Hua Tan, Paul Dalsgaard, Borge Lindberg and Haitian Xu, “Robust Speech Recognition in Ubiquitous
Networking and Context-Aware Computing”, Interspeech 2005,
Lisbon, Portugal,
September 2005.PDF
Haitian Xu,
Zheng-Hua Tan, Paul Dalsgaard and Børge Lindberg, “Robust Speech
Recognition Based on Noise and SNR Classification - a Multiple-Model
Framework”, Interspeech 2005, Lisbon, Portugal, September 2005.PDF
Haitian Xu, Zheng-Hua Tan, Paul
Dalsgaard, Ralf Mattethat and Børge Lindberg, “A Configurable
Distributed Speech Recognition System”, Biennial
on DSP for in-Vehicle and Mobile Systems, Sesimbra , Portugal , September 2005. PDF
Zheng-Hua Tan, Paul Dalsgaard and Borge Lindberg, “On the Integration of Speech Recognition into Personal
Networks,” invited paper at ICSLP 2004 - the 8th
International Conference on Spoken Language Processing, Jeju
Island, Korea, October 2004. PDF
Haitian Xu, Zheng-Hua Tan,
PaulDalsgaard and Børge Lindberg, “Spectral
Subtraction with Full-Wave Rectification and Likelihood Controlled
Instantaneous Noise Estimation for Robust Speech Recognition,” ICSLP 2004 - the 8th International Conference on Spoken
Language Processing, Jeju Island , Korea , October 2004. PDF
Zheng-Hua Tan, Borge Lindberg
and Paul Dalsgaard, "A Comparative Study of Feature-Domain Error
Concealment Techniques for Distributed Speech Recognition", Robust 2004 -COST278 and ISCA Tutorial and Research Workshop (ITRW)
on Robustness Issues in Conversational Interaction, Norwich, UK, August 2004.PDF
Zheng-Hua Tan, Paul Dalsgaard
and Borge Lindberg, "A
Subvector-Based Error Concealment Algorithm for Speech Recognition over
Mobile Networks," ProceedingsICASSP 2004 - the 29th International Conference on
Acoustics, Speech, and Signal Processing, Montreal , Quebec
, Canada
, May 2004.PDF
Zheng-Hua Tan, Paul Dalsgaard
and Borge Lindberg, "OOV-Detection And Channel
Error Protection For Distributed Speech Recognition Over Wireless
Networks," ProceedingsICASSP
2003 - the 28th International Conference on Acoustics, Speech, and
Signal Processing, pp. I-336-339, Hong Kong, P R China
April, 2003.PDF
Zheng-Hua
Tanand Paul
Dalsgaard."Channel
Error Protection Scheme for Distributed Speech
Recognition," ProceedingsICSLP 2002 - the 7th International Conference on
Spoken Language Processing, pp. 2225-2228, DenverUSA , September 2002. PDF
Zheng-Hua Tan, Borge Lindberg and Paul
Dalsgaard, "Experiments on A Channel Error
Protection Scheme for Distributed Speech Recognition," Proceedings NORSIG 2002 – the 5th
Nordic Signal Processing Symposium, Norway, October, 2002. PDF
Xiaolin Ren, Guangrui Hu and Zhenghua Tan, "Controlling Chaos in a
Chaotic Neuron," IEEE IECON'99, pp.652-655, San Jose, California, USA,
1999.
Journal papers in Chinese: :
Zheng-Hua
Tan et al., "Modified Miller-Matrix Encoding Method and Its
Application in Evolutionary Artificial Neural Networks," Journal of
Shanghai Jiao Tong University , 2001.
Zheng-Hua
Tan et al., "Study on An Evolutionary Artificial Neural
Network," Nature Magazine, 2000.
Zheng-Hua
Tan et al., "Designing Artificial Neural Networks Through
Evolutionary Programming," Computer Engineering and Applications, 1999,
Vol. 35, No. 10.
Zheng-Hua
Tan et al., "Fuzzy Metagraphs and Its Feature Analysis,"
Computer Research & Development, 2000, Vol. 37, No. 3, pp. 272-277.
Zheng-Hua
Tan et al., "Fuzzy Metagraphs: A New Method of Constructing
Fuzzy Knowledge Base," Control and Decision, 2000, Vol. 15, No. 4, pp.
406-410.
Zheng-Hua
Tan et al., "Fuzzy Metagraph and Its Applications in
Aerocraft Fault Diagnosis," Journal of Shanghai Jiao Tong University,
1999, 33(9), pp.1103-1106.
Zheng-Hua
Tan et al., "Uncertain Knowledge Management in Expert Systems
Using Fuzzy Metagraphs," Journal of Shanghai Jiao Tong University
(English edition), 2000, Vol. 5, No. 2, pp. 6-9.
Zheng-Hua
Tan et al., "The Application of Computational Intelligence in
Fault Diagnosis Expert Systems," Computer Engineering and Applications,
1999, Vol. 35, No. 6, pp.7-10.
Chen
Wei, Hu Guangrui and Zheng-Hua Tan, "Knowledge Association in
Expert System for Fault Diagnosis of Certain Spacecraft," Journal of
Shanghai Jiao Tong University, 2000, Vol. 34, No. 2, pp.241-243.
Zheng-Hua
Tan et al., "Portable Electrometer Based on PIC Series
Singlechip," Chinese Journal of Scientific Instrument, 2000, Vol. 21,
No. 1, pp.78-79, 82.
Zheng-Hua
Tan et al., "The Development of Intelligent Inverter Supply
and the Generation of SPWM Wave by Software," Journal of Shanghai Jiao
Tong University, 2000, Vol. 34, No. 2, pp.273-275.
Zheng-Hua
Tan et al., "Static Measure and Its Programming," Aerospace
Measure Technology, 2000, No.1.
Ren
Xiaolin, Hu Guangrui and Zheng-Hua Tan, "Controlling Chaos in a
Chaotic Neuron," Nature Magazine, 1999, No. 5, pp.308-309.
Ren
Xiaolin, Hu Guangrui and Zheng-Hua Tan, "Controlling Chaos in
Chaotic Neuron by Constant Pulses Method," Journal of Shanghai Jiao
Tong University, 2000, Vol. 34, No. 2, pp.269-272.
Ren
Xiaolin, Hu Guangrui and Zheng-Hua Tan, "Synchronization of chaotic
neural networks and applications in secure communications," Journal of
Shanghai Jiao Tong University, 2000, Vol. 34, No. 6, pp. 744-747.
Zheng-Hua
Tan et al., "Investigation on Some Problems of GAL
Programming Using ABEL Software," Journal of Electrical Engineering
Education, 1995, 17 (Supplemental Issue), 83-85.
Zheng-Hua
Tan et al., "Research on GAL On-line Programming," Journal of
Hunan University, 1995, Vol. 22, Special No. 5, 8-12.
Patent:
Electronicelectrical
energy meter based on IC card (No. ZL 95 2 36331.3).