Information Package / Course Catalogue
Automatic Speech Recognition and Synthesis
Course Code: MTK639
Course Type: Area Elective
Couse Group: Third Cycle (Doctorate Degree)
Education Language: Turkish
Work Placement: N/A
Theory: 3
Prt.: 0
Credit: 3
Lab: 0
ECTS: 8
Objectives of the Course

In this course, the subjects of Speech Recognition and Synthesis will be taught and the applications about the course will be developed. Firstly, speech production and acoustic modeling will be presented. Then, The methods of Speech Recognition and Synthesis will be explained.

Course Content

Acoustic Theory of Speech Production, Human hearing, acoustics, and phonetics. Signal Representation, Vector Quantization. Speech spectrum analysis (Fourier analysis, cepstral analysis, spectrogram reading). Fundamental frequency analysis (F0 estimation, prosody models). Speech synthesis. Linear Prediction (all-pole model, LPC, PARCOR, LSP analysis). Learning algorithms and application (Viterbi algorithm, Bayes’ Theorem). Speech coding (waveform coding, PCM, LPC). Dynamic time warping and acoustic modeling. Hidden Markov Modeling, expectation-maximization, and search. Language Modeling. Graphical Models. Segment-Based ASR, Finite State Transducers.

Name of Lecturer(s)