Files
silero-vad/README.md
2020-12-15 14:26:28 +00:00

4.8 KiB

Mailing list : test Mailing list : test License: CC BY-NC 4.0

Open on Torch Hub (coming soon)

Open In Colab

header

Silero VAD

Single Image Why our VAD is better than WebRTC

Silero VAD: pre-trained enterprise-grade Voice Activity Detector (VAD), Number Detector and Language Classifier. Enterprise-grade Speech Products made refreshingly simple (see our STT models).

Currently, there are hardly any high quality / modern / free / public voice activity detectors except for WebRTC Voice Activity Detector (link).

Also in enterprise it is crucial to be able to anonymize large-scale spoken corpora (i.e. remove personal data). Typically personal data is considered to be private / sensitive if it contains (i) a name (ii) some private ID. Name recognition is highly subjective and would depend on locale and business case, but Voice Activity and Number detections are quite general tasks.

Key features:

  • Modern, portable;
  • Lowe memory footprint;
  • Superior metrics to WebRTC;
  • Trained on huge spoken corpora and noise / sound libraries;
  • Slower than WebRTC, but fast enough for IOT / edge / mobile applications;

Typical use cases:

  • Spoken corpora anonymization;
  • Voice activity detection for IOT / edge / mobile use cases;
  • Data cleaning and preparation, number and voice detection in general;

Getting Started

The models are small enough to be included directly into this repository. Newer models will supersede older models directly.

Currently we provide the following functionality:

PyTorch ONNX VAD Number Detector Language Clf Languages Colab
✔️ ✔️ ✔️ ru, en, de, es Open In Colab

Version history:

Version Date Comment
v1 2020-12-15 initial release
v2 coming soon Add Number Detector or Language Classifier heads

PyTorch

Open In Colab

Open on Torch Hub (coming soon)

TBD

ONNX

Open In Colab

You can run our model everywhere, where you can import the ONNX model or run ONNX runtime.

TBD

Metrics

Performance Metrics

Speed metrics here.

Quality Metrics

Quality metrics here.

Contact

Get in Touch

Try our models, create an issue, start a discussion, join our telegram chat, email us.

Commercial Inquiries

Please see our wiki and tiers for relevant information and email us directly.