Audiovisual Speech Processing
eBook - PDF

Audiovisual Speech Processing

  1. English
  2. PDF
  3. Available on iOS & Android
eBook - PDF

About this book

When we speak, we configure the vocal tract which shapes the visible motions of the face and the patterning of the audible speech acoustics. Similarly, we use these visible and audible behaviors to perceive speech. This book showcases a broad range of research investigating how these two types of signals are used in spoken communication, how they interact, and how they can be used to enhance the realistic synthesis and recognition of audible and visible speech. The volume begins by addressing two important questions about human audiovisual performance: how auditory and visual signals combine to access the mental lexicon and where in the brain this and related processes take place. It then turns to the production and perception of multimodal speech and how structures are coordinated within and across the two modalities. Finally, the book presents overviews and recent developments in machine-based speech recognition and synthesis of AV speech.

Frequently asked questions

Yes, you can cancel anytime from the Subscription tab in your account settings on the Perlego website. Your subscription will stay active until the end of your current billing period. Learn how to cancel your subscription.
At the moment all of our mobile-responsive ePub books are available to download via the app. Most of our PDFs are also available to download and we're working on making the final remaining ones downloadable now. Learn more here.
Perlego offers two plans: Essential and Complete
  • Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
  • Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Both plans are available with monthly, semester, or annual billing cycles.
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, we’ve got you covered! Learn more here.
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Yes! You can use the Perlego app on both iOS or Android devices to read anytime, anywhere — even offline. Perfect for commutes or when you’re on the go.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app.
Yes, you can access Audiovisual Speech Processing by Gérard Bailly,Pascal Perrier,Eric Vatikiotis-Bateson in PDF and/or ePUB format, as well as other popular books in Languages & Linguistics & Phonetics & Phonology. We have over one million books available in our catalogue for you to explore.

Table of contents

  1. Cover
  2. Audiovisual Speech Processing
  3. Title
  4. Copyright
  5. Dedication
  6. Contents
  7. Figures
  8. Tables
  9. Contributors
  10. Preface
  11. Acknowledgments
  12. Introduction
  13. 1: Three puzzles of multimodal speech perception
  14. 2: Visual speech perception
  15. 3: Dynamic information for face perception
  16. 4: Investigating auditory-visual speech perception development
  17. 5: Brain bases for seeing speech: fMRI studies of speechreading
  18. 6: Temporal organization of Cued Speech production
  19. 7: Bimodal perception within the natural time-course of speech production
  20. 8 Visual and audiovisual synthesis and recognition of speech by computers
  21. 9: Audiovisual automatic speech recognition
  22. 10: Image-based facial synthesis
  23. 11: A trainable videorealistic speech animation system
  24. 12: Animated speech: research progress and applications
  25. 13: Empirical perceptual-motor linkage of multimodal speech
  26. 14: Sensorimotor characteristics of speech production
  27. Notes
  28. References
  29. Index