Hauptmenü
  • Autor
    • Leitner, Christina
    • Pernkopf, Franz
  • TitelOn pre-image iterations for speech enhancement
  • Volltext
  • DOI10.1186/s40064-015-0983-x
  • Persistent Identifier
  • Erschienen inSpringerPlus
  • Band4
  • Erscheinungsjahr2015
  • Heft1
  • LicenceCC-BY
  • ISSN2193-1801
  • ZugriffsrechteCC-BY
  • Download Statistik122
  • Peer ReviewNein
  • AbstractIn this paper, we apply kernel PCA for speech enhancement and derive pre-image iterations for speech enhancement. Both methods make use of a Gaussian kernel. The kernel variance serves as tuning parameter that has to be adapted according to the SNR and the desired degree of de-noising. We develop a method to derive a suitable value for the kernel variance from a noise estimate to adapt pre-image iterations to arbitrary SNRs. In experiments, we compare the performance of kernel PCA and pre-image iterations in terms of objective speech quality measures and automatic speech recognition. The speech data is corrupted by white and colored noise at 0, 5, 10, and 15 dB SNR. As a benchmark, we provide results of the generalized subspace method, of spectral subtraction, and of the minimum mean-square error log-spectral amplitude estimator. In terms of the scores of the PEASS (Perceptual Evaluation Methods for Audio Source Separation) toolbox, the proposed methods achieve a similar performance as the reference methods. The speech recognition experiments show that the utterances processed by pre-image iterations achieve a consistently better word recognition accuracy than the unprocessed noisy utterances and than the utterances processed by the generalized subspace method.