116,99 €
Digital Audio Signal Processing
The fully revised new edition of the popular textbook, featuring additional MATLAB exercises and new algorithms for processing digital audio signals
Digital Audio Signal Processing (DASP) techniques are used in a variety of applications, ranging from audio streaming and computer-generated music to real-time signal processing and virtual sound processing.
Digital Audio Signal Processing provides clear and accessible coverage of the fundamental principles and practical applications of digital audio processing and coding. Throughout the book, the authors explain a wide range of basic audio processing techniques and highlight new directions for automatic tuning of different algorithms and discuss state- of-the-art DASP approaches. Now in its third edition, this popular guide is fully updated with the latest signal processing algorithms for audio processing. Entirely new chapters cover nonlinear processing, Machine Learning (ML) for audio applications, distortion, soft/hard clipping, overdrive, equalizers and delay effects, sampling and reconstruction, and more.
Balancing essential concepts and technological topics, Digital Audio Signal Processing, Third Edition remains the ideal textbook for advanced music technology and engineering students in audio signal processing courses. It is also an invaluable reference for audio engineers, hardware and software developers, and researchers in both academia and industry.
Sie lesen das E-Book in den Legimi-Apps auf:
Seitenzahl: 463
Veröffentlichungsjahr: 2022
Udo Zölzer
Helmut Schmidt University Hamburg, Germany
with
Martin Holters, Etienne Gerat, Patrick Nowak, Purbaditya Bhattacharya, Lasse Köper, and Daniel Ahlers
This third edition first published 2022
© 2022 John Wiley & Sons Ltd
Edition History: 1e (1997); 2e (2008)
All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, except as permitted by law. Advice on how to obtain permission to reuse material from this title is available at http://www.wiley.com/go/permissions.
The right of Udo Zölzer to be identified as the author of this work has been asserted in accordance with law.
Registered Offices
John Wiley & Sons, Inc., 111 River Street, Hoboken, NJ 07030, USA
John Wiley & Sons Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, UK
Editorial Office
The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, UK
For details of our global editorial offices, customer services, and more information about Wiley products visit us at www.wiley.com.
Wiley also publishes its books in a variety of electronic formats and by print‐on‐demand. Some content that appears in standard print versions of this book may not be available in other formats.
Limit of Liability/Disclaimer of Warranty
In view of ongoing research, equipment modifications, changes in governmental regulations, and the constant flow of information relating to the use of experimental reagents, equipment, and devices, the reader is urged to review and evaluate the information provided in the package insert or instructions for each chemical, piece of equipment, reagent, or device for, among other things, any changes in the instructions or indication of usage and for added warnings and precautions. While the publisher and author have used their best efforts in preparing this work, they make no representations or warranties with respect to the accuracy or completeness of the contents of this work and specifically disclaim all warranties, including without limitation any implied warranties of merchantability or fitness fora particular purpose. No warranty may be created or extended by sales representatives, written sales materials or promotional statements for this work. The fact that an organization, website, or product is referred to in this work as a citation and/or potential source of further information does not mean that the publisher and author endorse the information or services the organization, website, or product may provide or recommendations it may make. This work is sold with the understanding that the publisher is not engaged in rendering professional services. The advice and strategies contained herein may not be suitable for your situation. You should consult with a specialist where appropriate. Further, readers should be aware that websites listed in this work may have changed or disappeared between when this work was written and when it is read. Neither the publisher nor author shall be liable for any loss of profit or any other commercial damages, including but not limited to special, incidental, consequential, or other damages.
Library of Congress Cataloging‐in‐Publication data
Name: Zölzer, Udo, author.
Title: Digital audio signal processing / Udo Zölzer, Helmut Schmidt
University Hamburg, Germany with Martin Holters, Etienne Gerat, Patrick
Nowak, Purbaditya Bhattacharya, Lasse Köper, and Daniel Ahlers.
Other titles: Digitale Audiosignalverarbeitung. English
Description: Third edition. | Hoboken : Wiley, 2022. | Includes
bibliographical references and index.
Identifiers: LCCN 2021059110 (print) | LCCN 2021059111 (ebook) | ISBN
9781119832676 (cloth) | ISBN 9781119832683 (adobe pdf) | ISBN
9781119832690 (epub)
Subjects: LCSH: Sound–Recording and reproducing–Digital techniques. |
Signal processing–Digital techniques.
Classification: LCC TK7881.4 .Z6513 2022 (print) | LCC TK7881.4 (ebook) |
DDC 621.389/3–dc23/eng/20220103
LC record available at https://lccn.loc.gov/2021059110
LC ebook record available at https://lccn.loc.gov/2021059111
Cover Design: Wiley
Cover Image: © whiteMocca/Shutterstock
This 3rd Edition is a revised and extended version of the book. It offers three new chapters (1, 10, and 11) and extensions on a further four chapters (5, 7, 8, and 9). The content of this book is the basis of a course on Digital Audio Signal Processing at the Hamburg University of Technology (TUHH) and a course on Multimedia Signal Processing at the Helmut Schmidt University Hamburg. For further study, you can find lecture slides, exercises, Matlab examples, and interactive audio demonstrations on the website.
https://mydasp.com/dasp.
My thanks go to Dr. Martin Holters for the interactive audio demonstrations and maintaining the website http://www.dafx.de . Thanks also to him and Lasse Köper for their contribution to the new Chapter 10 on “Nonlinear Processing”. I would like to thank Daniel Ahlers for his contributions and updates to Chapter 5 on “Audio Processing Systems”. Purbaditya Bhattacharya and Patrick Nowak made contributions to Chapter 7 on “Room Simulation” and are mainly responsible for the new Chapter 11 on “Machine Learning for Audio”. Purbaditya Bhattacharya also made contributions to Chapter 9 on “Audio Coding”. Thanks to both of them. Finally, I would like to thank Etienne Gerat for his refinements and extensions to Chapter 8 on “Dynamic Range Control”. Last but not least, I would like to thank all participants of the lectures over all these years who have been my driving force:‐).
Udo Zölzer
Hamburg, January 2022
This 2nd Edition represents a revised and extended version, and offers an improved description as well as new issues and extended references. The content of this book is the basis of a course on Digital Audio Signal Processing at the Hamburg University of Technology (TU Hamburg‐Harburg) and a course on Multimedia Signal Processing at the Helmut Schmidt University Hamburg. For further study, you can find interactive audio demonstrations, exercises, and Matlab examples on the website.
http://ant.hsu‐hh.de/dasp/.
In addition to the basics of digital audio signal processing introduced in this 2nd edition, further advanced algorithms for digital audio effects can be found in the book DAFX – Digital Audio Effects (Ed. U. Zölzer) with a related website.
http://www.dafx.de.
My thanks go to Prof. Dieter Leckschat, Dr. Gerald Schuller, Udo Ahlvers, Mijail Guillemard, Christian Helmrich, Martin Holters, Dr. Florian Keiler, Stephan Möller, Francois‐Xavier Nsabimana, Christian Ruwwe, Harald Schorr, Dr. Oomke Weikert, Catja Wilkens, and Christian Zimmermann.
Udo Zölzer
Hamburg, December 2007
Digital audio signal processing is employed in recording and storing music and speech signals, for sound mixing and production of digital programs, in digital transmission to broadcast receivers as well as in consumer products like CDs, DATs, and PCs. In the latter case, the audio signal is in a digital form all the way from the microphone up to the loudspeakers, enabling real‐time processing with fast digital signal processors.
This book provides the basis of an advanced course in Digital Audio Signal Processing which I have been giving since 1992 at the Technical University Hamburg‐Harburg. It is directed at students studying engineering, computer science, and physics but is also for professionals who are looking for solutions to problems in audio signal processing in the fields of studio engineering, consumer electronics, and multimedia. The mathematical and theoretical fundamentals of digital audio signal processing systems will be presented and typical applications with an emphasis on realization aspects will be discussed. Prior knowledge of systems theory, digital signal processing, and multirate signal processing are assumed as a prerequisite.
The book is divided into two parts. The first part (Chapters 1–4) presents a basis for hardware systems used in digital audio signal processing. The second part (Chapters 5–9) discusses algorithms for processing digital audio signals. Chapter 1 describes the course taken by an audio signal from its recording in a studio up to its reproduction at home. Chapter 2 contains a representation of signal quantization, dither techniques, and spectral shaping of quantization errors used for reducing the nonlinear effects of quantization. In the end, a comparison is made between the fixed‐point and floating‐point number representations as well as their associated effects on format conversion and algorithms. Chapter 3 describes methods for AD/DA conversion of signals, starting with Nyquist sampling, methods for oversampling techniques, and delta‐sigma modulation. The chapter closes with a presentation of some circuit design of AD/DA converters. After an introduction to digital signal processors and digital audio interfaces, Chapter 4 describes simple hardware systems based on single‐ and multiprocessor solutions. The algorithms introduced in Chapters 5–9 are, to a great extent, implemented in real‐time on the hardware platforms presented in Chapter 4. Chapter 5 describes digital audio equalizers. Apart from the implementation aspects of recursive audio filters, non‐recursive linear phase filters based on fast convolution and filter banks are introduced. Filter designs, parametric filter structures, and precautions for reducing quantization errors in recursive filters are dealt with in detail. Chapter 6 deals with room simulation. Methods for simulation of artificial room impulse response and methods for approximation of measured impulse responses are discussed. In Chapter 7, the dynamic range control of audio signals is described. These methods are applied at several positions in the audio chain from the microphone to the loudspeakers to adapt to the dynamics of the recording, transmission, and listening environment. Chapter 8 contains a presentation of methods for synchronous and asynchronous sampling rate conversion. Efficient algorithms are described which are suitable for real‐time processing as well as off‐line processing. Both lossless and lossy audio coding are discussed in Chapter 9. Lossless audio coding is applied for the storage of higher word lengths. Lossy audio coding, however, plays a significant role in communication systems.
I would like to thank Prof. Fliege (University of Mannheim), Prof. Kammeyer (University of Bremen), and Prof. Heute (University of Kiel) for comments and support. I am also grateful to my colleagues at the TUHH and especially Dr. Alfred Mertins, Dr. Thomas Boltze, Dr. Bernd Redmer, Dr. Martin Schönle, Dr. Manfred Schusdziarra, Dr. Tanja Karp, Georg Dickmann, Werner Eckel, Thomas Scholz, Rüdiger Wolf, Jens Wohlers, Horst Zölzer, Bärbel Erdmann, Ursula Seifert, and Dieter Gödecke. Additionally, I would like to say a word of gratitude to all those students who helped me in carrying out this work successfully.
Special thanks go to Saeed Khawaja for his help during translation and to Dr. Anthony Macgrath for proof‐reading the text. I also would like to thank Jenny Smith, Colin McKerracher, Ian Stoneham, and Christian Rauscher (Wiley).
My special thanks are directed to my wife Elke and my daughter Franziska.
Udo Zölzer
Hamburg, July 1997
U. Zölzer
In this first chapter, we will introduce the basics of signals and systems, and describe the transmission of signals through these systems [Opp14]. These fundamental concepts and the describing algorithms lay the foundation for digital audio signal processing. We will start with analog signals and analog systems, then we will sample the analog signals and perform digital signal processing, and finally reconstruct an analog output signal from the digital output signal. Figure 1.1 shows a typical audio application of capturing a vocalist and transmission to a loudspeaker via an amplifier for reproduction in another room for a listener or listening audience. The microphone delivers an electrical input signal and the output signal is the signal that will be received by the listener's ear. Both signals are continuous‐time input and output signals. The entire chain of operations from microphone, amplifier, loudspeaker, and sound transmission through the listening room to the listener can be modeled by a system with a continuous‐time impulse response . Such an impulse response can be acquired by an impulse response measurement approach. The entire continuous‐time approach description can also be represented by a discrete‐time approach through sampling the microphone signal , using the discrete‐time impulse response , and then delivering the output signal . Both continuous‐time and discrete‐time signal‐processing techniques [Opp10, Opp14] will be introduced in the following sections.
Continuous‐time signals, as shown in Fig. 1.2, can be used as test signals to analyze the behavior of the response of a physical system to an excitation signal. We need a few simple test signals that will allow for the derivation of all important relations to obtain the input/output description of an input signal transformed to an output signal (handclap acoustical transmission through room received by human ear). The rectangular (rect) function is defined by
Figure 1.1 Audio capturing and reproduction for a listener, and representations of the operations by a signal and system model with input and output signals and by a system represented by an impulse response.
Figure 1.2 Continuous‐time signals , , , , , and .
The Dirac impulse is defined by
The step function is defined by
A general signal can be written using the sampling property of the Dirac impulse as
Continuous‐time systems transform the input to the output . A time‐domain description can be given by the following signal flow graph: . The system parameter inside the box is called the impulse response of the system. It describes the output of the system when the input is the Dirac . Using Eq. (1.4), we can easily derive that the input/output relation of a system with impulse response is given by the integral (sliding the folded impulse response along the input and performing weighting and integration)
which is called continuous‐time convolution. The convolution integral describes a filter operation and is written as . Causality of a system implies for and stability of a system is achieved if the integral of impulse response . A simple example for continuous‐time convolution is demonstrated in Fig. 1.3.
Figure 1.3 Continuous‐time convolution showing the folded version of the impulse response and shifted versions for .
Using the complex exponential as input with , the output is given by the convolution integral as
This shows that for a exponential input , the output is again an exponential signal where the input signal is weighted by the complex number , which is the Fourier transform (integral) of the impulse response , and is also called the frequency response of a continuous‐time system given by
From , we can compute the magnitude response
and the phase response
of a continuous‐time system. For a given signal , we can give its continuous‐time Fourier transform as
The Fourier integral describes a spectral transform from time domain to frequency domain , which is called the Fourier spectrum or Fourier transform of . The inverse continuous‐time Fourier transform is given by
which takes the Fourier spectrum and reconstructs the input . In the following, useful Fourier transform pairs are listed in Eqs. (1.13)–(1.23). An important relation between the time domain , using and giving , and frequency domain , using and giving , shows that convolution in the time domain can be described by multiplication in the frequency domain.
Figure 1.4 shows the Fourier transforms of even and causal rect signals and Fig. 1.5 shows two even sinc signals and their Fourier transforms. The ripple in the passband is based on the truncated length of the sinc signal.
Figure 1.4 Fourier transforms of an even and a causal rect signal. The small imaginary part of the lower left plot arises from a small asymmetry of the rect signal in the upper left plot.
Figure 1.5 Fourier transforms of two even sinc signals.
The extension of the continuous‐time Fourier transform to the Laplace transform allows for the transform of signals and impulse responses where the Fourier transform does not converge but the Laplace transform converges for a given convergence region. This extension of the continuous‐time Fourier transform
is achieved by introducing a real part to the imaginary part according to a new complex variable , which then gives
and thus the Laplace transform
The Laplace transform of signals often leads to a rational function with a numerator polynomial and a denominator polynomial in the variable . The zeros of the numerator are called the zeros of and the zeros of are called the poles of . The rational function can be in given in polynomial, pole/zero, and partial expansion forms.
For digital signal processing, the sampling of with a sampling rate and a sampling interval is performed, which leads to a sequence of numbers with time index . According to the sampling theorem, the input signal must be band limited to . The sampling and the reconstruction of from the number sequence is achieved by the following sequence of operations: . Both operations are performed by an analog‐to‐digital converter (ADC) and a digital‐to‐analog converter (DAC). The converters can be considered as mixed continuous‐time and discrete‐time systems.
Sampling and quantization (analog‐to‐digital conversion) can be described by
where the input is sampled by multiplying it with a series of Dirac impulses giving the ideal sampled and then quantization of the samples to the sequence of numbers with a finite number representation. Figure 1.6 shows in the left column the time‐domain signals involved.
Figure 1.6 Sampling and reconstruction – Time‐domain signals (left column) and corresponding Fourier spectra (right column).
Reconstruction (digital‐to‐analog conversion) of the continuous‐time from the sampled sequence
