Speech enhancement in the Karhunen-Loève expansion domain /

Saved in:
Bibliographic Details
Author / Creator:Benesty, Jacob.
Imprint:San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA) : Morgan & Claypool, c2011.
Description:1 electronic text (ix, 102 p.) : ill., digital file.
Language:English
Series:Synthesis lectures on speech and audio processing, 1932-1678 ; # 7
Synthesis digital library of engineering and computer science.
Synthesis lectures on speech and audio processing, # 7.
Subject:
Format: E-Resource Book
URL for this record:http://pi.lib.uchicago.edu/1001/cat/bib/10510974
Hidden Bibliographic Details
Other authors / contributors:Chen, J. (Jingdong)
Huang, Yiteng, 1972-
ISBN:9781608456055 (electronic bk.)
9781608456048 (pbk.)
Notes:Part of: Synthesis digital library of engineering and computer science.
Series from website.
Includes bibliographical references (p. 91-95) and index.
Abstract freely available; full-text restricted to subscribers or individual document purchasers.
Compendex
INSPEC
Google scholar
Google book search
Also available in print.
Mode of access: World Wide Web.
System requirements: Adobe Acrobat Reader.
Summary:This book is devoted to the study of the problem of speech enhancement whose objective is the recovery of a signal of interest (i.e., speech) from noisy observations. Typically, the recovery process is accomplished by passing the noisy observations through a linear filter (or a linear transformation). Since both the desired speech and undesired noise are filtered at the same time, the most critical issue of speech enhancement resides in how to design a proper optimal filter that can fully take advantage of the difference between the speech and noise statistics to mitigate the noise effect as much as possible while maintaining the speech perception identical to its original form. The optimal filters can be designed either in the time domain or in a transform space. As the title indicates, this book will focus on developing and analyzing optimal filters in the Karhunen-Loève expansion (KLE) domain. We begin by describing the basic problem of speech enhancement and the fundamental principles to solve it in the time domain. We then explain how the problem can be equivalently formulated in the KLE domain. Next, we divide the general problem in the KLE domain into four groups, depending on whether interframe and interband information is accounted for, leading to four linear models for speech enhancement in the KLE domain. For each model, we introduce signal processing measures to quantify the performance of speech enhancement, discuss the formation of different cost functions, and address the optimization of these cost functions for the derivation of different optimal filters. Both theoretical analysis and experiments will be provided to study the performance of these filters and the links between the KLE-domain and time-domain optimal filters will be examined.
Standard no.:10.2200/S00326ED1V01Y201101SAP007