Added HumAwareVAD to VAD Gallery (#194)

2026-02-05 18:09:23 +08:00 · 2025-03-21 00:11:39 +05:30
parent 3fc441a6f0
commit ea36257abf
1 changed files with 32 additions and 1 deletions
--- a/docs/turn_taking_gallery.md
+++ b/docs/turn_taking_gallery.md
@@ -45,6 +45,37 @@ document.querySelectorAll('.tag-button').forEach(button => {
 <div class="grid cards" markdown>
 -   :speaking_head:{ .lg .middle }:eyes:{ .lg .middle } __HumAware VAD__
 {: data-tags="vad-models"}
    ---
    Description
    **HumAware-VAD** is a fine-tuned version of **Silero-VAD**, specifically trained to **distinguish humming from actual speech**.
    Standard VAD models often misclassify humming as speech, leading to inaccurate speech segmentation.
    **HumAware-VAD** improves detection accuracy in environments with background humming, music, and vocal sounds.  
    **Install Instructions**  
    ```sh
    pip install humaware-vad
    ```  
    **Use with FastRTC**  
    ```sh
    git clone https://github.com/CuriousMonkey7/HumAwareVad.git
    cd HumAwareVad
    python app.py
    ```
    <video width=98% src="https://github.com/user-attachments/assets/ea06a06f-d9e8-4203-a982-56326d160c9e" controls style="text-align: center"></video>  
    [:octicons-arrow-right-24: Demo](https://github.com/CuriousMonkey7/HumAwareVad/blob/main/app.py)  
    [:octicons-code-16: Repository](https://github.com/CuriousMonkey7/HumAwareVad/blob/main/humaware_vad/__init__.py)  
 </div>
 <div class="grid cards" markdown>
 -   :speaking_head:{ .lg .middle }:eyes:{ .lg .middle } __Walkie Talkie__
 {: data-tags="turn-taking-algorithm"}