Speech intelligibility and quality: a comparative study of speech enhancement algorithms.
MetadataShow full item record
Mobile devices are widely used today for speech communication. The environments in which these devices are used are widely varied and often the level of background noise in the speaker's environment can be significant. The purpose of speech enhancement is to reduce the level of background noise, ideally to such a level that it is not noticed by the listener. While speech enhancement algorithms can significantly reduce the noise level in a speech signal, improving speech quality, it is widely recognized that enhancement algorithms can have a negative impact on speech intelligibility. This paper compares the effect of three different speech enhancement algorithms on the intelligibility and the quality of speech. This work is the initial phase of an investigation into mitigating the impact of speech enhancement algorithms on speech intelligibility. The speech enhancement algorithms evaluated each use different approaches for noise reduction, namely, a statistical model-based algorithm, a noise estimation algorithm and a wavelet packet decomposition-based algorithm. Two objective speech intelligibility measurements and three objective speech quality measurements are used to assess the performance of the enhancement algorithms. The results of the experiments show that all the speech enhancement algorithms in this study have a negative impact on speech intelligibility to varying degrees.
The following license files are associated with this item:
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Ireland
Showing items related by title, author, creator and subject.
Costello, Gabriel J.; Donnellan, Brian (2007)The growth and diffusion of self-service technology (SST) over the last decade has resulted in an increasing number of business and government transactions being completed without human assistance. One innovation in this ...
Costello, Gabriel J. (2006)Speech enabled business applications are characterized by complex implementations that bring together language processing technologies, applications development, and end-user psychology. Resilience is critical to maintaining ...
Comparing user QoE via physiological and interaction measurements of immersive AR and VR speech and language therapy applications Keighrey, Conor; Flynn, Ronan; Brennan, Sean; Murray, Siobhan; Murray, Niall (Athlone Institute of Technology, 2018-04-24)Virtual reality (VR) and augmented reality (AR) applications are gaining significant attention in industry and academia as potential avenues to support truly immersive and interactive multimedia experiences. Understanding ...