National Institute of Technology Rourkela

राष्ट्रीय प्रौद्योगिकी संस्थान राउरकेला

ଜାତୀୟ ପ୍ରଯୁକ୍ତି ପ୍ରତିଷ୍ଠାନ ରାଉରକେଲା

An Institute of National Importance
NIT Rourkela Inside Page Banner

Syllabus

Course Details

Subject {L-T-P / C} : EE6145 : Digital Speech Processing { 3-0-0 / 3}

Subject Nature : Theory

Coordinator : Prasanna Kumar Sahu

Syllabus

Module 1 :

Module 1 (4 Hours)
Fundamentals of Speech:
Introduction to Human Speech, Parameters of Speech (Pitch Frequencies, Pitch Cepstral Domain, Pitch Period Measurement using Cepstral Domain, etc.)

Module 2 (6 Hours)
Spectral Parameters of Speech:
(Mel Frequency Cepstral Coefficients, Perceptual Linear Prediction, Wavelet transform Analysis of Speech), Linear Prediction of Speech

Module 3 (8 Hours)
Speech Quantization and Coding

Module 4 (8 Hours)
Speech Processing Applications

Module 5 (10 Hours)
Speech Synthesis:
(Text to Speech System, Synthesizer Technologies, Speech Synthesis using other methods, Emotion Recognition from Speech, Watermarking for Authentication of a Speech/Music Signal, etc.)

Course Objective

1 .

To provide students with the knowledge of basic characteristics of speech signal in relation to production and hearing of speech by humans

2 .

To describe basic algorithms of speech analysis common to many applications

3 .

To give an overview of applications (recognition, synthesis, coding) and to inform about practical aspects of speech algorithms implementation.

4 .

To give an overview of speech processing applications, including speech enhancement, speech recognition, and speaker recognition.

Course Outcome

1 .

1. The students will get familiar with the basic characteristics of speech signals, which are concerned with the production and hearing of speech by humans.
2. They will understand basic speech analysis algorithms common to many applications.
3. They will be given an overview of applications (recognition, synthesis, coding) and informed about the practical aspects of implementing speech algorithms.
4. The students will be able to design a simple system for speech processing (speech activity detector, recognizer of limited number of isolated words), including its implementation into application programs
5. Students will be able to understand the applications of speech processing, including speaker recognition and speech recognition.

Essential Reading

1 .

Gold Ben, Nelson Morgan, and Dan Ellis, Speech and Audio signal processing: processing and perception of speech and music, John Wiley & Sons , 2nd Edition, August 2011

2 .

S.D Apte, Speech and Audio Processing, Wiley India , Edition, 2015

Supplementary Reading

1 .

Rabiner Lawrence R., and Biing-Hwang Juang, Fundamentals of Speech Recognition, Prentice Hall International , 1993

2 .

Benesty Jacob, M. Mohan Sondhi, and Yiteng Huang, Handbook of speech processing, Springer , 2007