Comparison of Speech Enhancement Algorithms

Downloads

Downloads per month over past year

Siddala Vihari, . and Sreenivasa Murthy, A. and Priyanka Soni, . and Naik, D.C (2016) Comparison of Speech Enhancement Algorithms. Procedia Computer Science, 89. pp. 666-676. ISSN 1877-0509

[img]
Preview
Text
v.pdf - Published Version

Download (769kB) | Preview
Official URL: https://doi.org/10.1016/j.procs.2016.06.032

Abstract

The simplest and very familiar method to take out stationary background noise is spectral subtraction. In this algorithm, a spectral noise bias is calculated from segments of speech inactivity and is subtracted from noisy speech spectral amplitude, retaining the phase as it is. Secondary procedures follow spectral subtraction to reduce the unpleasant auditory effects due to spectral error. The drawback of spectral subtraction is that it is applicable to speech corrupted by stationary noise. The research in this topic aims at studying the spectral subtraction & Wiener filter technique when the speech is degraded by non-stationary noise. We have studied both algorithms assuming stationary noise scenario. In this we want to study these two algorithms in the context of non-stationary noise. Next, decision directed (DD) approach, is used to estimate the time varying noise spectrum which resulted in better performance in terms of intelligibility and reduced musical noise. However, the a priori SNR estimator of the current frame relies on the estimated speech spectrum from the earlier frame. The undesirable consequence is that the gain function doesn’t match the current frame, resulting in a bias which causes annoying echoing effect. A method called Two-step noise reduction (TSNR) algorithm was used to solve the problem which tracks instantaneously the non-stationarity of the signal but, not by losing the advantage of the DD approach. The a priori SNR estimation was modified and made better by an additional step for removing the bias, thus eliminating reverberation effect. The output obtained even with TSNR still suffers from harmonic distortions which are inherent to all short time noise suppression techniques, the main reason being the inaccuracy in estimating PSD in single channel systems. To outdo this problem, a concept called, Harmonic Regeneration Noise Reduction (HRNR) is used wherein a non-linearity is made use of for regenerating the distorted/missing harmonics. All the above discussed algorithms have been implemented and their performance evaluated using both subjective and objective criteria. The performance is significantly improved by using HRNR combined with TSNR, as compared to TSNR, DD alone, as HRNR ensures restoration of harmonics. The spectral subtraction performance stands much below the above discussed methods for obvious reasons.

Item Type: Article
Uncontrolled Keywords: Decision Directed Approach; Harmonic Regeneration; Speech Enhancement; Two-step Noise Reduction; Wiener
Subjects: Faculty of Engineering > Computer Science & Information Science Engineering
Divisions: University Visvesvarayya College of Engineering > Department of Computer Science and Information Science Engineering
Depositing User: Mr. Vasu K
Date Deposited: 17 Oct 2016 10:44
Last Modified: 17 Oct 2016 10:44
URI: http://eprints-bangaloreuniversity.in/id/eprint/6490

Actions (login required)

View Item View Item