Header

Shop : Details

Shop
Details
48,80 €
ISBN 978-3-8440-6845-0
Softcover
158 pages
36 figures
219 g
21 x 14,8 cm
English
Thesis
August 2019
Pablo Gómez
Deep Learning Methods for Processing Endoscopic High-Speed Video and Laryngeal Parameter Estimation
Deep learning methods have had tremendous impact in computer vision, image processing and all areas that relate to these fields. This dissertation explores the application of these methods to the enhancement and processing of endoscopic high-speed video (HSV).

HSV is one of the main technique used in voice research as the small-scale, rapid oscillation of the vocal folds requires sophisticated recording techniques. As voice disorders have been shown to have a tremendous negative impact on the quality of life of the affected and society in general, a new generation of more objective diagnostic techniques is required. This dissertation features several contributions towards this goal:

  • An innovative method to enhance low-light HSV using an improved U-Net convolutional neural network
  • A robust and fast deep-learning-based automatic method for the segmentation of the glottis in HSV data
  • Development of an improved two-mass-model of the vocal folds
  • Proof of concept of estimating ex-vivo subglottal pressure validated on experimental data
  • Proof of concept of estimating subglottal pressure with a recurrent neural network trained on a numerical model
After a thorough introduction to the field of voice research and deep learning the dissertation describes the developed methods and results in detail. The dissertation describes signifcant improvements in regard to low-light image enhancement, automatic glottis segmentation physical voice parameter inference.
Keywords: Deep Learning; High-Speed Videoendoskopie; Neuronale Netzwerke; Phoniatrie; High-speed Videoendoscopy; Image Processing; Automatic Segmentation; Vocal Fold Models; Voice Parameter Estimation; Recurrent Neural Network
Kommunikationsstörungen - Berichte aus Phoniatrie und Pädaudiologie
Edited by Prof. Dr.-Ing. Michael Döllinger and Reihe begründet 1996 von Prof. Dr. Dr. Ulrich Eysholdt, Erlangen-Nürnberg
Volume 27
Available online documents for this title
DOI 10.2370/9783844068450
You need Adobe Reader, to view these files. Here you will find a little help and information for downloading the PDF files.
Please note that the online documents cannot be printed or edited.
Please also see further information at: Help and Information.
 
 DocumentDocument 
 TypePDF 
 Costs36,60 € 
 ActionDownloadPurchase in obligation and download the file 
     
 
 DocumentTable of contents 
 TypePDF 
 Costsfree 
 ActionDownloadDownload the file 
     
User settings for registered online customers (online documents)
You can change your address details here and access documents you have already ordered.
User
Not logged in
Export of bibliographic data
Shaker Verlag GmbH
Am Langen Graben 15a
52353 Düren
Germany
  +49 2421 99011 9
Mon. - Thurs. 8:00 a.m. to 4:00 p.m.
Fri. 8:00 a.m. to 3:00 p.m.
Contact us. We will be happy to help you.
Captcha
Social Media