VAD Voice activity detection (also known as speech activity detection or, more simply, speech detection) is a technique used in speech processing wherein the presence or absence of human speech is detected in regions of audio (which may also contain music, noise, or other sound) . The main uses of VAD are in speech coding and speech recognition. It can facilitate speech processing, and can also be used to deactivate some processes during non-speech segments: it can avoid unnecessary coding/transmission of silence packets in VOIP, saving on computation and on network bandwidth.
VAD is an important enabling technology for a variety of speech-based applications. Therefore various VAD algorithms have been proposed that provide different compromises between latency, sensitivity, accuracy and computational cost. Some VAD algorithms also provide further analyses, for example whether the speech is voiced, unvoiced or sustained. Voice activity detection is usually language independent.
It was first investigated for use on time-assignment speech interpolation (TASI) systems
Answered by
Manoj
, an ibibo Specialist,
at
7:28 PM on June 25, 2009