Significance of Frame Size
Overlapping Frames
8000Hz = 8000 samples for 1 second of speech
Sampling Rate
Number of samples per second in Hz
Sample Period
1/sampling rate
Seconds per sample
Frame Length
Frame Shift
Windowing Purpose
Window Functions
Energy
Normalised Energy
Removes sensitivity based on no. of samples in analysis frame
Zero-Crossing Rate
Speech/Non-speech detection
Simple version can be constructed using short-time energy (high in voiced speech) and zero-crossing rate (high in unvoiced speech)
Auto-Correlation
Auto-Correlation Function
r[k] = 1/N sum (s[i] * s[i+k])
Cepstral Analysis
Pitch Estimation with Cepstrum
Cepstrum - compute log spectrum of relationship
pseudo-frequency domain
Quefrency graph - find peak cepstral value in high quefrency components
Frame Rate
fr = 1/RT