diff --git a/README.md b/README.md index 6f00094..7ff6baf 100644 --- a/README.md +++ b/README.md @@ -1,4 +1,4 @@ -[![Mailing list : test](http://img.shields.io/badge/Email-gray.svg?style=for-the-badge&logo=gmail)](mailto:hello@silero.ai) [![Mailing list : test](http://img.shields.io/badge/Telegram-blue.svg?style=for-the-badge&logo=telegram)](https://t.me/silero_speech) [![License: CC BY-NC 4.0](https://img.shields.io/badge/License-MIT-lightgrey.svg?style=for-the-badge)](https://github.com/snakers4/silero-vad/blob/master/LICENSE) +[![Mailing list : test](http://img.shields.io/badge/Email-gray.svg?style=for-the-badge&logo=gmail)](mailto:hello@silero.ai) [![Mailing list : test](http://img.shields.io/badge/Telegram-blue.svg?style=for-the-badge&logo=telegram)](https://t.me/silero_speech) [![License: CC BY-NC 4.0](https://img.shields.io/badge/License-MIT-lightgrey.svg?style=for-the-badge)](https://github.com/snakers4/silero-vad/blob/master/LICENSE) [![downloads](https://img.shields.io/pypi/dm/silero-vad?style=for-the-badge)](https://pypi.org/project/silero-vad/) [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/snakers4/silero-vad/blob/master/silero-vad.ipynb) @@ -22,6 +22,8 @@ https://user-images.githubusercontent.com/36505480/144874384-95f80f6d-a4f1-42cc-9be7-004c891dd481.mp4 +Please note, that video loads only if you are logged in your GitHub account. +
@@ -64,7 +66,11 @@ If you are planning to run the VAD using solely the `onnx-runtime`, it will run from silero_vad import load_silero_vad, read_audio, get_speech_timestamps model = load_silero_vad() wav = read_audio('path_to_audio_file') -speech_timestamps = get_speech_timestamps(wav, model) +speech_timestamps = get_speech_timestamps( + wav, + model, + return_seconds=True, # Return speech timestamps in seconds (default is samples) +) ``` **Using torch.hub**: @@ -76,7 +82,11 @@ model, utils = torch.hub.load(repo_or_dir='snakers4/silero-vad', model='silero_v (get_speech_timestamps, _, read_audio, _, _) = utils wav = read_audio('path_to_audio_file') -speech_timestamps = get_speech_timestamps(wav, model) +speech_timestamps = get_speech_timestamps( + wav, + model, + return_seconds=True, # Return speech timestamps in seconds (default is samples) +) ```
@@ -165,4 +175,4 @@ Please see our [wiki](https://github.com/snakers4/silero-models/wiki) for releva - Voice activity detection for the [browser](https://github.com/ricky0123/vad) using ONNX Runtime Web -- [Rust](https://github.com/snakers4/silero-vad/tree/master/examples/rust-example), [Go](https://github.com/snakers4/silero-vad/tree/master/examples/go), [Java](https://github.com/snakers4/silero-vad/tree/master/examples/java-example) and [other](https://github.com/snakers4/silero-vad/tree/master/examples) examples +- [Rust](https://github.com/snakers4/silero-vad/tree/master/examples/rust-example), [Go](https://github.com/snakers4/silero-vad/tree/master/examples/go), [Java](https://github.com/snakers4/silero-vad/tree/master/examples/java-example), [C++](https://github.com/snakers4/silero-vad/tree/master/examples/cpp), [C#](https://github.com/snakers4/silero-vad/tree/master/examples/csharp) and [other](https://github.com/snakers4/silero-vad/tree/master/examples) community examples diff --git a/pyproject.toml b/pyproject.toml index 589e78b..b0c76cd 100644 --- a/pyproject.toml +++ b/pyproject.toml @@ -3,7 +3,7 @@ requires = ["hatchling"] build-backend = "hatchling.build" [project] name = "silero-vad" -version = "5.1.1" +version = "5.1.2" authors = [ {name="Silero Team", email="hello@silero.ai"}, ]