Introduction - If you have any usage issues, please Google them yourself
To evaluate a VAD, its output using test recordings is compared with those of an "ideal" VAD - created by hand-annotating the presence/absence of voice in the recordings. The performance of a VAD is commonly evaluated on the basis of the following four parameters[2]: