We may earn an affiliate commission when you visit our partners.
Course image
Course image
Coursera logo

Audio Signal Processing for Music Applications

Xavier Serra and Prof Julius O Smith, III

In this course you will learn about audio signal processing methodologies that are specific for music and of use in real applications. We focus on the spectral processing techniques of relevance for the description and transformation of sounds, developing the basic theoretical and practical knowledge with which to analyze, synthesize, transform and describe audio signals in the context of music applications.

Read more

In this course you will learn about audio signal processing methodologies that are specific for music and of use in real applications. We focus on the spectral processing techniques of relevance for the description and transformation of sounds, developing the basic theoretical and practical knowledge with which to analyze, synthesize, transform and describe audio signals in the context of music applications.

The course is based on open software and content. The demonstrations and programming exercises are done using Python under Ubuntu, and the references and materials for the course come from open online repositories. We are also distributing with open licenses the software and materials developed for the course.

Enroll now

What's inside

Syllabus

Introduction
Introduction to the course, to the field of Audio Signal Processing, and to the basic mathematics needed to start the course. Introductory demonstrations to some of the software applications and tools to be used. Introduction to Python and to the sms-tools package, the main programming tool for the course.
Read more
Discrete Fourier transform
The Discrete Fourier Transform equation; complex exponentials; scalar product in the DFT; DFT of complex sinusoids; DFT of real sinusoids; and inverse-DFT. Demonstrations on how to analyze a sound using the DFT; introduction to Freesound.org. Generating sinusoids and implementing the DFT in Python.
Fourier theorems
Linearity, shift, symmetry, convolution; energy conservation and decibels; phase unwrapping; zero padding; Fast Fourier Transform and zero-phase windowing; and analysis/synthesis. Demonstration of the analysis of simple periodic signals and of complex sounds; demonstration of spectrum analysis tools. Implementing the computation of the spectrum of a sound fragment using Python and presentation of the dftModel functions implemented in the sms-tools package.
Short-time Fourier transform
STFT equation; analysis window; FFT size and hop size; time-frequency compromise; inverse STFT. Demonstration of tools to compute the spectrogram of a sound and on how to analyze a sound using them. Implementation of the windowing of sounds using Python and presentation of the STFT functions from the sms-tools package, explaining how to use them.
Sinusoidal model
Sinusoidal model equation; sinewaves in a spectrum; sinewaves as spectral peaks; time-varying sinewaves in spectrogram; sinusoidal synthesis. Demonstration of the sinusoidal model interface of the sms-tools package and its use in the analysis and synthesis of sounds. Implementation of the detection of spectral peaks and of the sinusoidal synthesis using Python and presentation of the sineModel functions from the sms-tools package, explaining how to use them.
Harmonic model
Harmonic model equation; sinusoids-partials-harmonics; polyphonic-monophonic signals; harmonic detection; f0-detection in time and frequency domains. Demonstrations of pitch detection algorithm, of the harmonic model interface of the sms-tools package and of its use in the analysis and synthesis of sounds. Implementation of the detection of the fundamental frequency in the frequency domain using the TWM algorithm in Python and presentation of the harmonicModel functions from the sms-tools package, explaining how to use them.
Sinusoidal plus residual model
Stochastic signals; stochastic model; stochastic approximation of sounds; sinusoidal/harmonic plus residual model; residual subtraction; sinusoidal/harmonic plus stochastic model; stochastic model of residual. Demonstrations of the stochastic model, harmonic plus residual, and harmonic plus stochastic interfaces of the sms-tools package and of its use in the analysis and synthesis of sounds. Presentation of the stochasticModel, hprModel and hpsModel functions implemented in the sms-tools package, explaining how to use them.
Sound transformations
Filtering and morphing using the short-time Fourier transform; frequency and time scaling using the sinusoidal model; frequency transformations using the harmonic plus residual model; time scaling and morphing using the harmonic plus stochastic model. Demonstrations of the various transformation interfaces of the sms-tools package and of Audacity. Presentation of the stftTransformations, sineTransformations and hpsTransformations functions implemented in the sms-tools package, explaining how to use them.
Sound and music description
Extraction of audio features using spectral analysis methods; describing sounds, sound collections, music recordings and music collections. Clustering and classification of sounds. Demonstration of various plugins from SonicVisualiser to describe sound and music signals and demonstration of some advance features of freesound.org. Presentation of Essentia, a C++ library for sound and music description, explaining how to use it from Python. Programming with the Freesound API in Python to download sound collections and to study them.
Concluding topics
Audio signal processing beyond this course. Beyond audio signal processing. Review of the course topics. Where to learn more about the topics of this course. Presentation of MTG-UPF. Demonstration of Dunya, a web browser to explore several audio music collections, and of AcousticBrainz, a collaborative initiative to collect and share music data.
Concluding topics: Lesson Choices

Good to know

Know what's good
, what to watch for
, and possible dealbreakers
Taught by recognized instructors in this field: Xavier Serra, Prof Julius O Smith, III
Develops core skills in spectral processing techniques
Uses freely available software and materials
Focuses on music related applications of audio signal processing
Introduces students to relevant open software libraries such as Essentia and STOMP
Provides interactive materials with demonstrations of various signal processing techniques

Save this course

Save Audio Signal Processing for Music Applications to your list so you can find it easily later:
Save

Reviews summary

Audio signal processing for music enthusiasts

Learners say this engaging course provides a comprehensive introduction to audio signal processing. It particularly benefits those with a background in Python programming, mathematics, or music. Students appreciate the clear explanations of theory and the hands-on exercises that solidify concepts. While the programming assignments can be challenging, they are an opportunity for significant learning. Overall, it's a rewarding course for those interested in pursuing computational musicology.
Course introduces current research in computational musicology.
"It also introduces you to...current research in computational musicology."
Programming assignments provide a steep learning curve.
"Assignments were challenging, but they make for a great learning experience."
Practical exercises reinforce theoretical concepts.
"A lot of excercies and practices that helps you to solidify the concepts."
"It makes Python programming approachable (basic previous knowledge is needed) with hands-on exercises."
Theory lectures provide easy-to-understand explanations.
"Excellent course with great explanations of the theory..."
"The lectures walk you through the theory, the application..."
Assumes background knowledge in Python, mathematics, and possibly music.
"As the course page mentions, this is an intermediate level course."
"It doesn't cover filter design and other signal processing topics in detail."

Activities

Coming soon We're preparing activities for Audio Signal Processing for Music Applications. These are activities you can do either before, during, or after a course.

Career center

Learners who complete Audio Signal Processing for Music Applications will develop knowledge and skills that may be useful to these careers:
Data Scientist
Data Scientists use data to solve problems and make predictions. This course may be useful for aspiring Data Scientists by providing them with a foundation in audio signal processing techniques, including the discrete Fourier transform, Fourier theorems, and short-time Fourier transform. They will also learn about sinusoidal and harmonic models, as well as sound transformations and sound and music description, all of which are relevant to the work of a Data Scientist in the field of audio signal processing.
Software Engineer
Software Engineers design, develop, and maintain software systems. This course may be useful for aspiring Software Engineers by providing them with a foundation in audio signal processing techniques, including the discrete Fourier transform, Fourier theorems, and short-time Fourier transform. They will also learn about sinusoidal and harmonic models, as well as sound transformations and sound and music description, all of which are relevant to the work of a Software Engineer in the field of audio signal processing.
Professor
Professors teach and conduct research at colleges and universities. This course may be useful for aspiring Professors by providing them with a foundation in audio signal processing techniques, including the discrete Fourier transform, Fourier theorems, and short-time Fourier transform. They will also learn about sinusoidal and harmonic models, as well as sound transformations and sound and music description, all of which are relevant to the work of a Professor in the field of audio signal processing.
Speech-Language Pathologist
Speech-Language Pathologists assess and treat speech, language, and swallowing disorders. This course may be useful for aspiring Speech-Language Pathologists by providing them with a foundation in audio signal processing techniques, including the discrete Fourier transform, Fourier theorems, and short-time Fourier transform. They will also learn about sinusoidal and harmonic models, as well as sound transformations and sound and music description, all of which are relevant to the work of a Speech-Language Pathologist.
Audio Engineer
As an Audio Engineer, you will be responsible for recording, mixing, and mastering audio for a variety of purposes, such as music, film, and television. This course may help you develop your skills as an Audio Engineer by providing you with a foundation in audio signal processing techniques, including the discrete Fourier transform, Fourier theorems, and short-time Fourier transform. You will also learn about sinusoidal and harmonic models, as well as sound transformations and sound and music description, all of which are relevant to the work of an Audio Engineer.
Researcher
Researchers conduct scientific research in a variety of fields, including audio signal processing. This course may be useful for aspiring Researchers by providing them with a foundation in audio signal processing techniques, including the discrete Fourier transform, Fourier theorems, and short-time Fourier transform. They will also learn about sinusoidal and harmonic models, as well as sound transformations and sound and music description, all of which are relevant to the work of a Researcher in the field of audio signal processing.
Acoustical Engineer
Acoustical Engineers use their knowledge of sound and vibration to design and improve the acoustic environment in buildings, vehicles, and other spaces. This course may be useful for aspiring Acoustical Engineers by providing them with a foundation in audio signal processing techniques, including the discrete Fourier transform, Fourier theorems, and short-time Fourier transform. They will also learn about sinusoidal and harmonic models, as well as sound transformations and sound and music description, all of which are relevant to the work of an Acoustical Engineer.
Audiologist
Audiologists assess and treat hearing and balance disorders. This course may be useful for aspiring Audiologists by providing them with a foundation in audio signal processing techniques, including the discrete Fourier transform, Fourier theorems, and short-time Fourier transform. They will also learn about sinusoidal and harmonic models, as well as sound transformations and sound and music description, all of which are relevant to the work of an Audiologist.
Musician
Musicians create, perform, and teach music. This course may be useful for aspiring Musicians by providing them with a foundation in audio signal processing techniques, including the discrete Fourier transform, Fourier theorems, and short-time Fourier transform. They will also learn about sinusoidal and harmonic models, as well as sound transformations and sound and music description, all of which are relevant to the work of a Musician.
Sound Designer
In the role of a Sound Designer, you will be responsible for creating, editing and mixing sound effects and music for film, television, radio, video games and other media. An understanding of audio signal processing is essential for a Sound Designer, and this course may help you build that understanding by teaching you the relevant spectral processing techniques used to describe and transform sounds. It also covers the sinusoidal and harmonic models used in sound synthesis, as well as sound transformations and sound and music description, all of which are relevant to the work of a Sound Designer.
Music Producer
Music Producers are responsible for overseeing all aspects of music production, from recording and editing to mixing and mastering. This course may be useful for aspiring Music Producers by providing them with a foundation in audio signal processing techniques, including the discrete Fourier transform, Fourier theorems, and short-time Fourier transform. They will also learn about sinusoidal and harmonic models, as well as sound transformations and sound and music description, all of which are relevant to the work of a Music Producer.
Music Therapist
Music Therapists use music to help people improve their physical, emotional, and mental health. This course may be useful for aspiring Music Therapists by providing them with a foundation in audio signal processing techniques, including the discrete Fourier transform, Fourier theorems, and short-time Fourier transform. They will also learn about sinusoidal and harmonic models, as well as sound transformations and sound and music description, all of which are relevant to the work of a Music Therapist.
Composer
Composers create original music. This course may be useful for aspiring Composers by providing them with a foundation in audio signal processing techniques, including the discrete Fourier transform, Fourier theorems, and short-time Fourier transform. They will also learn about sinusoidal and harmonic models, as well as sound transformations and sound and music description, all of which are relevant to the work of a Composer.
Music Teacher
Music Teachers teach music to students of all ages. This course may be useful for aspiring Music Teachers by providing them with a foundation in audio signal processing techniques, including the discrete Fourier transform, Fourier theorems, and short-time Fourier transform. They will also learn about sinusoidal and harmonic models, as well as sound transformations and sound and music description, all of which are relevant to the work of a Music Teacher.
Music Technologist
Music Technologists use their knowledge of music, technology, and acoustics to design, develop, and maintain music technology systems. This course may be useful for aspiring Music Technologists by providing them with a foundation in audio signal processing techniques, including the discrete Fourier transform, Fourier theorems, and short-time Fourier transform. They will also learn about sinusoidal and harmonic models, as well as sound transformations and sound and music description, all of which are relevant to the work of a Music Technologist.

Reading list

We've selected 16 books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Audio Signal Processing for Music Applications.
This textbook covers a wide range of topics in sound and music computing, including sound synthesis, processing, and analysis, which would complement this course.
This textbook provides a comprehensive foundation for learning about various signal processing techniques used in audio engineering. Thus, it can be used as a background before taking this course.
Provides a comprehensive overview of computer music, including topics such as sound synthesis, processing, and composition, which would provide additional context for this course.
Provides a comprehensive overview of algorithmic music, including topics such as music generation, analysis, and performance, which would complement this course.
Provides a comprehensive overview of music signal processing, covering topics such as digital audio, audio feature extraction, and music information retrieval. It offers a solid foundation for understanding the fundamental concepts and algorithms used in this field.
Provides a detailed exploration of sound synthesis and sampling techniques, covering topics such as oscillators, filters, and granular synthesis. It offers a practical approach to sound design and discusses the latest developments in the field.
Provides a practical guide to audio programming in Python, which could be used as a reference for students interested in implementing audio signal processing algorithms.
Offers a mathematical perspective on music theory, exploring topics such as pitch, rhythm, harmony, and form. It provides a deep understanding of the mathematical principles underlying musical structures.
This textbook covers a variety of audio effects used in music production, which would provide practical insights to complement this course.
Provides a comprehensive introduction to digital signal processing using MATLAB, which could be used as a reference for students interested in the implementation of audio signal processing algorithms.
Provides a practical guide to designing and implementing audio effect plugins in C++, which could be used as a reference for students interested in the technical aspects of audio signal processing.
Provides a comprehensive overview of the field of computer music, covering topics such as sound synthesis, algorithmic composition, and interactive performance systems. It offers a historical perspective and discusses the latest advances in the field.
This textbook is more focused on the mathematics and algorithms used in audio signal processing and coding, which could be used as a reference.
This textbook provides an in-depth treatment of advanced signal processing techniques, including noise reduction, which could be used as a reference for students interested in more advanced topics.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Here are nine courses similar to Audio Signal Processing for Music Applications.
Game Music Composition: Make Music For Games From Scratch
Most relevant
The Technology of Music Production
Most relevant
Ableton Live 11 - Music Production in Ableton Live
Most relevant
FL Studio 21 - Music Production In FL Studio 21 for Mac &...
Most relevant
Ableton Live 12 - Music Production in Ableton Live
Most relevant
Audio Engineering Fundamentals
YouTube Creator Course: Audio & Video Production for...
Music + Audio Production in Logic Pro X - The Complete...
Avid Pro Tools: Audio Recording For Beginners In Pro Tools
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2024 OpenCourser