ENEE 632: Speech and Audio Processing
Course Goals:
The objective of this course is to study different aspects of the speech communication process and the principles of discrete-time processing of speech and music.
Course Prerequisite(s):
ENEE 620 and ENEE 630.
Topics Prerequisite(s):
Textbook(s)
Reference(s):
- Quatieri, Discrete-time Speech Signal Processing, Prentice-Hall
- Stevens, Acoustic Phonetics, MIT press
- Flanagan, Speech Analysis, Synthesis, and Perception, Springer-Verlag.
- Rabiner & Juang, Fundamentals of Speech Recognition, Prentice-Hall
Core Topics:
- Review of DSP (Chap. 2, Quatieri or Chaps. 6 & 7, Gold and Morgan)
- Discrete-time Fourier Transform, z Tranform and Discrete Fourier Transform
- Upsampling, Downsampling
- Speech Production and Acoustic Phonetics (Chaps. 3 & 4, Quatieri or Chaps. 10 & 11, Gold and Morgan)
- Articulatory Phonetics, Acoustic Theory of Speech Production, Prosody
- Vocal tract Modeling, Discrete-time Modeling of Speech Production
- Music Production (Chap. 12, Gold and Morgan)
- Auditory Perception (Chap. 14 & 15, Gold and Morgan)
- Peripheral Auditory System
- Psychoacoustics
- Speech Perception (Chap. 3, Quatieri or Chap. 17, Gold and Morgan)
- Signal Processing Techniques (Chaps. 5, 6 & 7, Quatieri or Chaps. 19, 20 and 21, Gold and Morgan )
- Short-time Fourier Transform,
- Linear Prediction Analysis of Speech
- Cepstral Analysis of Speech
- Speech Analysis Tools (Chap. 10, Quatieri or Chap. 30, Gold and Morgan)
- Pitch Detection, Formant Tracking
- Music Analysis
- Pitch Detection, Feature Analysis for Recognition
- Speech Coding (Chap. 12, Quatieri or Chaps. 31, 32 & 33, Gold and Morgan)
- Waveform Coding, Model-Based Coding, LPC Residual
- Speech Synthesis (Chap. 6, Quatieri or Chap. 29, Gold and Morgan)
- Music Synthesis (Chap. 32, Gold and Morgan)
Optional Topics:
Course Structure:
Grading Method:
|