add min_speech_samples param

This commit is contained in:
adamnsandle
2021-01-26 11:54:09 +00:00
parent e4b7006fee
commit d1d9e6368f
2 changed files with 3 additions and 1 deletions

View File

@@ -322,6 +322,7 @@ Since our VAD (only VAD, other networks are more flexible) was trained on chunks
- `neg_trig_sum` - same as `trig_sum`, but for switching from triggered to non-triggered state (non-speech)
- `num_steps` - nubmer of overlapping windows to split audio chunk into (we recommend 4 or 8)
- `num_samples_per_window` - number of samples in each window, our models were trained using `4000` samples (250 ms) per window, so this is preferable value (lesser values reduce [quality](https://github.com/snakers4/silero-vad/issues/2#issuecomment-750840434));
- `min_speech_sample` - minimum speech chunk duration in samples
### How VAD Works