Approximate Bayesian inference for robust speech processing

Ciira wa Maina

doi:10.17918/etd-3643

Back

Approximate Bayesian inference for robust speech processing

Dissertation

Open access

Approximate Bayesian inference for robust speech processing

Ciira wa Maina

Doctor of Philosophy (Ph.D.), Drexel University

Jun 2011

DOI:

https://doi.org/10.17918/etd-3643

Files and links (1)

pdf

Maina_Ciira_20111.72 MBDownload View

PDFOpen Access (License Unspecified), Open Access

Abstract

Electrical engineering

Bayesian statistical decision theory

Speech processing systems

Speech processing applications such as speech enhancement and speaker identification rely on the estimation of relevant parameters from the speech signal. Theseparameters must often be estimated from noisy observations since speech signals are rarely obtained in 'clean' acoustic environments in the real world. As a result, the parameter estimation algorithms we employ must be robust to environmental factors such as additive noise and reverberation. In this work we derive and evaluate approximate Bayesian algorithms for the following speech processing tasks: 1) speech enhancement 2) speaker identification 3) speaker verification and 4) voice activity detection. Building on previous work in the field of statistical model based speech enhancement, we derive speech enhancement algorithms that rely on speaker dependent priors over linear prediction parameters. These speaker dependent priors allow us to handle speech enhancement and speaker identification in a joint framework. Furthermore, we show how these priors allow voice activity detection to be performed in a robust manner. We also develop algorithms in the log spectral domain with applications in robust speaker verification. The use of speaker dependent priors in the log spectral domain is shown to improve equal error rates in noisy environments and to compensate for mismatch between training and testing conditions.

Metrics

42 File views/ downloads

22 Record Views

Details

Title: Approximate Bayesian inference for robust speech processing
Creators: Ciira wa Maina - DU
Contributors: John MacLaren Walsh (Advisor) - Drexel University (1970-)
Awarding Institution: Drexel University
Degree Awarded: Doctor of Philosophy (Ph.D.)
Publisher: Drexel University; Philadelphia, Pennsylvania
Resource Type: Dissertation
Language: English
Academic Unit: College of Engineering (1970-2026); Electrical (and Computer) Engineering [Historical]; Drexel University
Other Identifier: 3643; 991014632618504721

Approximate Bayesian inference for robust speech processing

Files and links (1)

Abstract

Metrics

Details

Drexel University Social media