Rebrand to FastRTC (#60)

* Add code * add code * add code * Rename messages * rename * add code * Add demo * docs + demos + bug fixes * add code * styles * user guide * Styles * Add code * misc docs updates * print nit * whisper + pr * url for images * whsiper update * Fix bugs * remove demo files * version number * Fix pypi readme * Fix * demos * Add llama code editor * Update llama code editor and object detection cookbook * Add more cookbook demos * add code * Fix links for PR deploys * add code * Fix the install * add tts * TTS docs * Typo * Pending bubbles for reply on pause * Stream redesign (#63) * better error handling * Websocket error handling * add code --------- Co-authored-by: Freddy Boulton <freddyboulton@hf-freddy.local> * remove docs from dist * Some docs typos * more typos * upload changes + docs * docs * better phone * update docs * add code * Make demos better * fix docs + websocket start_up * remove mention of FastAPI app * fastphone tweaks * add code * ReplyOnStopWord fixes * Fix cookbook * Fix pypi readme * add code * bump versions * sambanova cookbook * Fix tags * Llm voice chat * kyutai tag * Add error message to all index.html * STT module uses Moonshine * Not required from typing extensions * fix llm voice chat * Add vpn warning * demo fixes * demos * Add more ui args and gemini audio-video * update cookbook * version 9 --------- Co-authored-by: Freddy Boulton <freddyboulton@hf-freddy.local>
2026-02-04 17:39:23 +08:00 · 2025-02-24 01:13:42 -05:00
parent 36190066ec
commit 853d6a06b5
131 changed files with 12349 additions and 4741 deletions
--- a/docs/cookbook.md
+++ b/docs/cookbook.md
@@ -1,6 +1,65 @@
+
+
+<style>
+.tag-button {
+    cursor: pointer;
+    opacity: 0.5;
+    transition: opacity 0.2s ease;
+}
+
+.tag-button > code {
+    color: var(--supernova);
+}
+
+.tag-button.active {
+    opacity: 1;
+}
+</style>
+
+A collection of applications built with FastRTC. Click on the tags below to find the app you're looking for!
+
+
+<div class="tag-buttons">
+  <button class="tag-button" data-tag="audio"><code>audio</code></button>
+  <button class="tag-button" data-tag="video"><code>video</code></button>
+  <button class="tag-button" data-tag="llm"><code>llm</code></button>
+  <button class="tag-button" data-tag="computer-vision"><code>computer-vision</code></button>
+  <button class="tag-button" data-tag="real-time-api"><code>real-time-api</code></button>
+  <button class="tag-button" data-tag="voice-chat"><code>voice chat</code></button>
+  <button class="tag-button" data-tag="code-generation"><code>code generation</code></button>
+  <button class="tag-button" data-tag="stopword"><code>stopword</code></button>
+  <button class="tag-button" data-tag="transcription"><code>transcription</code></button>
+  <button class="tag-button" data-tag="sambanova"><code>SambaNova</code></button>
+  <button class="tag-button" data-tag="groq"><code>Groq</code></button>
+  <button class="tag-button" data-tag="elevenlabs"><code>ElevenLabs</code></button>
+  <button class="tag-button" data-tag="elevenlabs"><code>Kyutai</code></button>
+</div>
+
+<script>
+function filterCards() {
+    const activeButtons = document.querySelectorAll('.tag-button.active');
+    const selectedTags = Array.from(activeButtons).map(button => button.getAttribute('data-tag'));
+    const cards = document.querySelectorAll('.grid.cards > ul > li > p[data-tags]');
+    
+    cards.forEach(card => {
+        const cardTags = card.getAttribute('data-tags').split(',');
+        const shouldShow = selectedTags.length === 0 || selectedTags.some(tag => cardTags.includes(tag));
+        card.parentElement.style.display = shouldShow ? 'block' : 'none';
+    });
+}
+document.querySelectorAll('.tag-button').forEach(button => {
+    button.addEventListener('click', () => {
+        button.classList.toggle('active');
+        filterCards();
+    });
+});
+</script>
+
+
 <div class="grid cards" markdown>

 -   :speaking_head:{ .lg .middle }:eyes:{ .lg .middle } __Gemini Audio Video Chat__
+{: data-tags="audio,video,real-time-api"}

    ---

@@ -8,35 +67,130 @@

    <video width=98% src="https://github.com/user-attachments/assets/9636dc97-4fee-46bb-abb8-b92e69c08c71" controls style="text-align: center"></video>

-    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/gemini-audio-video-chat)
+    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/gemini-audio-video)
+
+    [:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/gemini-audio-video)
    
-    [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/gemini-audio-video-chat/blob/main/app.py)
+    [:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/gemini-audio-video)

 -   :speaking_head:{ .lg .middle } __Google Gemini Real Time Voice API__
+{: data-tags="audio,real-time-api,voice-chat"}

    ---

    Talk to Gemini in real time using Google's voice API.

-    <video width=98% src="https://github.com/user-attachments/assets/da8c8a2a-5d99-4ac7-8927-0f7812e4146f" controls style="text-align: center"></video>
+    <video width=98% src="https://github.com/user-attachments/assets/ea6d18cb-8589-422b-9bba-56332d9f61de" controls style="text-align: center"></video>

-    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/gemini-voice)
-    
-    [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/gemini-voice/blob/main/app.py)
+    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/talk-to-gemini)
+
+    [:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/talk-to-gemini-gradio)
+
+    [:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/talk-to-gemini/blob/main/app.py)

 -   :speaking_head:{ .lg .middle } __OpenAI Real Time Voice API__
+{: data-tags="audio,real-time-api,voice-chat"}

    ---

    Talk to ChatGPT in real time using OpenAI's voice API.

-    <video width=98% src="https://github.com/user-attachments/assets/41a63376-43ec-496a-9b31-4f067d3903d6" controls style="text-align: center"></video>
+    <video width=98% src="https://github.com/user-attachments/assets/178bdadc-f17b-461a-8d26-e915c632ff80" controls style="text-align: center"></video>

-    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/openai-realtime-voice)
+    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/talk-to-openai)
+
+    [:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/talk-to-openai-gradio)
    
-    [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/openai-realtime-voice/blob/main/app.py)
+    [:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/talk-to-openai/blob/main/app.py)
+
+-   :robot:{ .lg .middle } __Hello Computer__
+{: data-tags="llm,stopword,sambanova"}
+
+    ---
+
+    Say computer before asking your question!
+    <video width=98% src="https://github.com/user-attachments/assets/afb2a3ef-c1ab-4cfb-872d-578f895a10d5" controls style="text-align: center"></video>
+
+    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/hello-computer)
+
+    [:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/hello-computer-gradio)
+    
+    [:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/hello-computer/blob/main/app.py)
+
+-   :robot:{ .lg .middle } __Llama Code Editor__
+{: data-tags="audio,llm,code-generation,groq,stopword"}
+
+    ---
+
+    Create and edit HTML pages with just your voice! Powered by Groq!
+
+    <video width=98% src="https://github.com/user-attachments/assets/98523cf3-dac8-4127-9649-d91a997e3ef5" controls style="text-align: center"></video>
+
+    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/llama-code-editor)
+    
+    [:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/llama-code-editor/blob/main/app.py)
+
+-   :speaking_head:{ .lg .middle } __Talk to Claude__
+{: data-tags="audio,llm,voice-chat"}
+
+    ---
+
+    Use the Anthropic and Play.Ht APIs to have an audio conversation with Claude.
+
+    <video width=98% src="https://github.com/user-attachments/assets/fb6ef07f-3ccd-444a-997b-9bc9bdc035d3" controls style="text-align: center"></video>
+
+    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/talk-to-claude)
+
+    [:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/talk-to-claude-gradio)
+    
+    [:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/talk-to-claude/blob/main/app.py)
+
+-   :musical_note:{ .lg .middle } __LLM Voice Chat__
+{: data-tags="audio,llm,voice-chat,groq,elevenlabs"}
+
+    ---
+
+    Talk to an LLM with ElevenLabs!
+
+    <video width=98% src="https://github.com/user-attachments/assets/584e898b-91af-4816-bbb0-dd3216eb80b0" controls style="text-align: center"></video>
+
+    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/llm-voice-chat)
+
+    [:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/llm-voice-chat-gradio)
+
+    [:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/llm-voice-chat/blob/main/app.py)
+
+-   :musical_note:{ .lg .middle } __Whisper Transcription__
+{: data-tags="audio,transcription,groq"}
+
+    ---
+
+    Have whisper transcribe your speech in real time!
+
+    <video width=98% src="https://github.com/user-attachments/assets/87603053-acdc-4c8a-810f-f618c49caafb" controls style="text-align: center"></video>
+
+    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/whisper-realtime)
+
+    [:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/whisper-realtime-gradio)
+    
+    [:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/whisper-realtime/blob/main/app.py)
+
+-   :robot:{ .lg .middle } __Talk to Sambanova__
+{: data-tags="llm,stopword,sambanova"}
+
+    ---
+
+    Talk to Llama 3.2 with the SambaNova API.
+    <video width=98% src="https://github.com/user-attachments/assets/92e4a45a-b5e9-45cd-b7f4-9339ceb343e1" controls style="text-align: center"></video>
+
+    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/talk-to-sambanova)
+
+    [:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/talk-to-sambanova-gradio)
+    
+    [:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/talk-to-sambanova/blob/main/app.py)

 -   :speaking_head:{ .lg .middle } __Hello Llama: Stop Word Detection__
+{: data-tags="audio,llm,code-generation,stopword,sambanova"}

    ---

@@ -49,19 +203,9 @@
    
    [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/hey-llama-code-editor/blob/main/app.py)

-   :robot:{ .lg .middle } __Llama Code Editor__
-
-    ---
-
-    Create and edit HTML pages with just your voice! Powered by SambaNova systems.
-
-    <video width=98% src="https://github.com/user-attachments/assets/a09647f1-33e1-4154-a5a3-ffefda8a736a" controls style="text-align: center"></video>
-
-    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/llama-code-editor)
-    
-    [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/llama-code-editor/blob/main/app.py)

 -   :speaking_head:{ .lg .middle } __Audio Input/Output with mini-omni2__
+{: data-tags="audio,llm,voice-chat"}

    ---

@@ -73,19 +217,8 @@
    
    [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/mini-omni2-webrtc/blob/main/app.py)

-   :speaking_head:{ .lg .middle } __Talk to Claude__
-
-    ---
-
-    Use the Anthropic and Play.Ht APIs to have an audio conversation with Claude.
-
-    <video width=98% src="https://github.com/user-attachments/assets/650bc492-798e-4995-8cef-159e1cfc2185" controls style="text-align: center"></video>
-
-    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/talk-to-claude)
-    
-    [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/talk-to-claude/blob/main/app.py)
-
 -   :speaking_head:{ .lg .middle } __Kyutai Moshi__
+{: data-tags="audio,llm,voice-chat,kyutai"}

    ---

@@ -98,6 +231,7 @@
    [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/talk-to-moshi/blob/main/app.py)

 -   :speaking_head:{ .lg .middle } __Talk to Ultravox__
+{: data-tags="audio,llm,voice-chat"}

    ---

@@ -111,6 +245,7 @@


 -   :speaking_head:{ .lg .middle } __Talk to Llama 3.2 3b__
+{: data-tags="audio,llm,voice-chat"}

    ---

@@ -124,6 +259,7 @@


 -   :robot:{ .lg .middle } __Talk to Qwen2-Audio__
+{: data-tags="audio,llm,voice-chat"}

    ---

@@ -137,18 +273,20 @@


 -   :camera:{ .lg .middle } __Yolov10 Object Detection__
+{: data-tags="video,computer-vision"}

    ---

    Run the Yolov10 model on a user webcam stream in real time!

-    <video width=98% src="https://github.com/user-attachments/assets/c90d8c9d-d2d5-462e-9e9b-af969f2ea73c" controls style="text-align: center"></video>
+    <video width=98% src="https://github.com/user-attachments/assets/f82feb74-a071-4e81-9110-a01989447ceb" controls style="text-align: center"></video>

-    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/webrtc-yolov10n)
+    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/object-detection)

-    [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/webrtc-yolov10n/blob/main/app.py)
+    [:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/object-detection/blob/main/app.py)

 -   :camera:{ .lg .middle } __Video Object Detection with RT-DETR__
+{: data-tags="video,computer-vision"}

    ---

@@ -159,6 +297,7 @@
    [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/rt-detr-object-detection-webrtc/blob/main/app.py)

 -   :speaker:{ .lg .middle } __Text-to-Speech with Parler__
+{: data-tags="audio"}

    ---