Simon ReceveurContributions to Turbo Automatic Speech Recognition | |||||||
| |||||||
ISBN: | 978-3-8440-7756-8 | ||||||
Series: | Mitteilungen aus dem Institut für Nachrichtentechnik der Technischen Universität Braunschweig Herausgeber: Prof. Dr.-Ing. U. Reimers, Prof. Dr.-Ing. T. Kürner and Prof. Dr.-Ing. T. Fingscheidt Braunschweig | ||||||
Volume: | 63 | ||||||
Keywords: | Speech Recognition; Decoding; Digital communication; Hidden Markov models; Iterative decoding; Convolutional codes; Speech; Acoustics | ||||||
Type of publication: | Thesis | ||||||
Language: | English | ||||||
Pages: | 272 pages | ||||||
Figures: | 29 figures | ||||||
Weight: | 405 g | ||||||
Format: | 21 x 14,8 cm | ||||||
Bindung: | Paperback | ||||||
Price: | 49,80 € / 62,30 SFr | ||||||
Published: | December 2020 | ||||||
Buy: | |||||||
Recommendation: | You want to recommend this title? | ||||||
Review copy: | Here you can order a review copy. | ||||||
Link: | You want to link this page? Click here. | ||||||
Export citations: |
|
||||||
Abstract: | Be it Siri or Amazon Echo - automatic speech recognition is making its way into our lives and despite astonishing improvements in recognition in general, it is still far from being as good as human speech comprehension. In order to open up possible paths for more robust and possibly distributed speech recognition systems, the PhD thesis "Contributions to Turbo Automatic Speech Recognition" deals with a novel method for iterative optimal information fusion. A fusion is always necessary and profitable when different information sources are to be combined in a statistically optimal way. This can be the combination of audio (speech recognition) and video (lip reading), but also the combination of two similar sensors (two microphones, or for humans the right and left ear). The chosen approach represents the consequent application of the turbo code principle known from communications to questions of automatic speech recognition with multiple data streams. As a major innovation, the PhD thesis presents a so-called modified Viterbi algorithm, which provides a novel information representation for iterative feedback. Two individual recognizers repeatedly evaluate their respective input signal of the underlying speech utterance and exchange information from iteration to iteration, thus moving step by step towards a jointly improved recognition result. |