Parametric Time-Frequency Domain Spatial Audio
  1. English
  2. ePUB (mobile friendly)
  3. Available on iOS & Android
eBook - ePub

About this book

A comprehensive guide that addresses the theory and practice of spatial audio

This book provides readers with the principles and best practices in spatial audio signal processing. It describes how sound fields and their perceptual attributes are captured and analyzed within the time-frequency domain, how essential representation parameters are coded, and how such signals are efficiently reproduced for practical applications. The book is split into four parts starting with an overview of the fundamentals. It then goes on to explain the reproduction of spatial sound before offering an examination of signal-dependent spatial filtering. The book finishes with coverage of both current and future applications and the direction that spatial audio research is heading in.

Parametric Time-frequency Domain Spatial Audio focuses on applications in entertainment audio, including music, home cinema, and gaming—covering the capturing and reproduction of spatial sound as well as its generation, transduction, representation, transmission, and perception. This book will teach readers the tools needed for such processing, and provides an overview to existing research. It also shows recent up-to-date projects and commercial applications built on top of the systems.

  • Provides an in-depth presentation of the principles, past developments, state-of-the-art methods, and future research directions of spatial audio technologies
  • Includes contributions from leading researchers in the field
  • Offers MATLAB codes with selected chapters

An advanced book aimed at readers who are capable of digesting mathematical expressions about digital signal processing and sound field analysis, Parametric Time-frequency Domain Spatial Audio is best suited for researchers in academia and in the audio industry.

Tools to learn more effectively

Saving Books

Saving Books

Keyword Search

Keyword Search

Annotating Text

Annotating Text

Listen to it instead

Listen to it instead

Part I
Analysis and Synthesis of Spatial Sound

1
Time–Frequency Processing: Methods and Tools

Juha Vilkamo1 and Tom Bäckström2
1Nokia Technologies, Finland
2Department of Signal Processing and Acoustics, Aalto University, Finland

1.1 Introduction

In most audio applications, the purpose is to reproduce sounds for human listening, whereby it is essential to design and optimize systems for perceptual quality. To achieve such optimal quality with given resources, we often use principles in the processing of signals that are motivated by the processes involved in hearing. In the big picture, human hearing processes the sound entering the ears in frequency bands (Moore, 1995). The hearing is thus sensitive to the spectral content of ear canal signals, which changes quickly with time in a complex way. As a result of frequency-band processing, the ear is not particularly sensitive to small differences in weaker sounds in the presence of a stronger masking sound near in frequency and time to the weaker sound (Fastl and Zwicker, 2007). Therefore, a representation of audio signals where we have access to both time and frequency information is a well-motivated choice.
A prerequisite for efficient audio processing methods is a representation of the signal that presents features desirable to hearing in an accessible form and also allows high-quality playback of signals. Useful properties of such a representation are, for example, that its coefficients have physically or perceptually relevant interpretations, and that the coefficients can be processed independently from each other. The time–frequency domain is such a domain, and it is commonly used in audio processing (Smith, 2011). Spectral coefficients in this domain explain the signal content in terms of frequency components as a function of time, which is an intuitive and unambiguous physical interpretation. Moreover, time–frequency components are approximately uncorrelated, whereby they can be independently processed and the effect on the output is deterministic. These properties make the spectrum a popular domain for audio processing, and all the techniques discussed in this book utilize it. The first part of this chapter will give an overview of the theory and practice of the tools typically needed in time–frequency processing of audio channels.
The time–frequency domain is also useful when processing the spatial characteristics of sound, for example in microphone array processing. Differences in directions of arrival of wavefronts are visible as differences in time of arrival and amplitude between microphone signals. When the microphone signals are transformed to the time–frequency domain, the differences directly correspond to differences in phase and magnitude in a similar fashion to the way spatial cues used by a human listener are encoded in the ear canal signals (Blauert, 1997). The tim...

Table of contents

  1. Cover
  2. Title page
  3. Copyright
  4. List of Contributors
  5. Preface
  6. About the Companion Website
  7. Part I Analysis and Synthesis of Spatial Sound
  8. Part II Reproduction of Spatial Sound
  9. Part III Signal-Dependent Spatial Filtering
  10. Part IV Applications
  11. Index
  12. EULA

Frequently asked questions

Yes, you can cancel anytime from the Subscription tab in your account settings on the Perlego website. Your subscription will stay active until the end of your current billing period. Learn how to cancel your subscription
No, books cannot be downloaded as external files, such as PDFs, for use outside of Perlego. However, you can download books within the Perlego app for offline reading on mobile or tablet. Learn how to download books offline
Perlego offers two plans: Essential and Complete
  • Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
  • Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
Both plans are available with monthly, semester, or annual billing cycles.
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 990+ topics, we’ve got you covered! Learn about our mission
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more about Read Aloud
Yes! You can use the Perlego app on both iOS and Android devices to read anytime, anywhere — even offline. Perfect for commutes or when you’re on the go.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app
Yes, you can access Parametric Time-Frequency Domain Spatial Audio by Ville Pulkki, Symeon Delikaris-Manias, Archontis Politis, Ville Pulkki,Symeon Delikaris-Manias,Archontis Politis in PDF and/or ePUB format, as well as other popular books in Technology & Engineering & Electrical Engineering & Telecommunications. We have over one million books available in our catalogue for you to explore.