Silero VAD
Single Image Why our VAD is better than WebRTC
Silero VAD: pre-trained enterprise-grade Voice Activity Detector (VAD), Number Detector and Language Classifier. Enterprise-grade Speech Products made refreshingly simple (all see our STT).
Currently, there are hardly any high quality / modern / free / public voice activity detectors except for WebRTC Voice Activity Detector (link).
Also in enterprise it is crucial to be able to anonymize large-scale spoken corpora (i.e. remove personal data). Typically personal data is considered to be private / sensitive if it contains (i) a name (ii) some private ID. Name recognition is highly subjective and would depend on locale and business case, but Voice Activity and Number detections are quite general tasks.
Key advantages / features:
- Modern, portable;
- Small memory footprint;
- Trained on huge spoken corpora and noise / sound libraries;
- Slower than WebRTC, but sufficiently fast for IOT / edge / mobile applications;
- Superior metrics to WebRTC;
Typical use cases:
- Spoken corpora anonymization;
- Voice detection for IOT / edge / mobile use cases;
- Data cleaning and preparation, number and voice detection in general;
Getting Started
The models are small enough to be included directly into this repository. Newer models will supersede older models directly.
Currently we provide the following models:
| Released | PyTorch | ONNX | VAD | Number Detector | Language Classifier | Languages | Colab | |
|---|---|---|---|---|---|---|---|---|
| v1 | 2020-12-15 | ✔️ | ✔️ | ✔️ | ru, en, de, es |
Version history:
- v1, 2020-12-15, initial release, no Number Detector or Language Classifier heads yet;
PyTorch
TBD
ONNX
You can run our model everywhere, where you can import the ONNX model or run ONNX runtime.
TBD
Metrics
Performance Metrics
Speed metrics here.
Quality Metrics
Quality metrics here.
Contact
Get in Touch
Try our models, create an issue, start a discussion, join our telegram chat, email us.
Commercial Inquiries
Please see our wiki and tiers for relevant information and email us directly.
