11-756 / 18799D Design and Implementation of ASR Systems

11-756/18799D ASR: Assignment 1, Feature Computation

Problem

Write a routine for computing MFCC from audio

Some suggestions

You are allowed to use code from the web

However, we recommend doing your own code if you can.

Regardless of what you use, the feature computation code must be integrated with the audio capture routine.

How to visualize the spectrogram represented by cepstra

The Mel-log spectrum can be directly visualized as a matrix.

However, the cepstrum is a dimensionality-reduced and transformed version of the log spectrum. It is not visually meaningful. However, the truncated cepstrum can be converted back to a log spectrum by zeropadding it to 64 or 128 poitns and computing an inverse DCT (if you used a DCT to derive cepstra from log spectra). The IDCT-derived logspectrum is what the cepstrum really represents.

Due: Wednesday, 8 Feb 2011.