Extraction of words

For segmentation of the words' data from the sound data that we are receiving from the microphone, we are using a simple algorithm. We find the average of the amplitude of the input data over a sampling window (i.e. over a small time interval) compare it with a threshold value. As soon as the average becomes greater than the threshold, we assume the word to have started. And as soon as the average falls below another threshold level, which is lower than the earlier threshold, we assume the word to have ended. In this way we segment the word data into an array which is different from the input array as we do not include the quiet periods. The threshold values were chosen by hit and trial, taking into account the ambient noise levels.
Graphs of words
Back