Header

Shop : Details

Shop
Details
45,80 €
ISBN 978-3-8440-7881-7
Softcover
150 pages
39 figures
221 g
29,7 x 21 cm
English
Thesis
March 2021
Johannes Abel
DNN-Based Artificial Bandwidth Extension – Enhancement and Instrumental Assessment of Speech Quality
Speech quality in conventional telephony is degraded, since only a fraction of the original acoustic speech bandwidth is transmitted. Artificial speech bandwidth extension (ABE) is a means to recover missing frequency components to increase speech quality and intelligibility. Whenever larger acoustic speech bandwidths are not available, ABE can serve as fallback solution, since it can be used independently of the communication system.

In this work, ABE approaches have been developed to extend the acoustic bandwidth of speech signals towards higher and lower frequencies in order to increase the perceived speech quality. For the extension of higher frequencies, deep neural networks (DNNs) are employed to establish a link between the available bandwidth and missing high-frequency regions, whereas the extension towards lower frequencies is based on a robust signal model, considering the properties of low-frequency components in speech signals. In subjective listening tests, all of the developed ABE solutions for an extension towards higher and lower frequencies were found to improve the speech quality. Additionally, speech intelligibility and quality could be increased for persons compensating their profound deafness by a cochlear implant using a DNN-based ABE approach.

Furthermore, an instrumental measure for predicting the speech quality of ABE-processed speech signals has been developed, since existing measures are not well suited for this task. Good generalization capabilities of the developed instrumental measure to accurately predict the speech quality of ABE-processed speech were proven in scenarios of unknown speech material, unknown languages, and, most importantly, unknown ABE solutions.
Keywords: Speech Enhancement; Machine Learning; Bandwidth Extension; Speech Quality
Mitteilungen aus dem Institut für Nachrichtentechnik der Technischen Universität Braunschweig
Edited by Prof. Dr.-Ing. U. Reimers, Prof. Dr.-Ing. T. Kürner, Prof. Dr.-Ing. T. Fingscheidt and Prof. Dr.-Ing. Eduard A. Jorswieck, Braunschweig
Volume 66
Export of bibliographic data
Shaker Verlag GmbH
Am Langen Graben 15a
52353 Düren
Germany
  +49 2421 99011 9
Mon. - Thurs. 8:00 a.m. to 4:00 p.m.
Fri. 8:00 a.m. to 3:00 p.m.
Contact us. We will be happy to help you.
Captcha
Social Media