A collection of applications built with FastRTC. Click on the tags below to find the app you're looking for!
- :speaking_head:{ .lg .middle }:eyes:{ .lg .middle } **Gemini Audio Video Chat**
{: data-tags="audio,video,real-time-api"}
---
Stream BOTH your webcam video and audio feeds to Google Gemini. You can also upload images to augment your conversation!
[:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/gemini-audio-video)
[:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/gemini-audio-video)
[:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/gemini-audio-video/blob/main/app.py)
- :speaking_head:{ .lg .middle } **Google Gemini Real Time Voice API**
{: data-tags="audio,real-time-api,voice-chat"}
---
Talk to Gemini in real time using Google's voice API.
[:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/talk-to-gemini)
[:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/talk-to-gemini-gradio)
[:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/talk-to-gemini/blob/main/app.py)
- :speaking_head:{ .lg .middle } **OpenAI Real Time Voice API**
{: data-tags="audio,real-time-api,voice-chat"}
---
Talk to ChatGPT in real time using OpenAI's voice API.
[:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/talk-to-openai)
[:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/talk-to-openai-gradio)
[:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/talk-to-openai/blob/main/app.py)
- :robot:{ .lg .middle } **Hello Computer**
{: data-tags="llm,stopword,sambanova"}
---
Say computer before asking your question!
[:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/hello-computer)
[:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/hello-computer-gradio)
[:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/hello-computer/blob/main/app.py)
- :robot:{ .lg .middle } **Llama Code Editor**
{: data-tags="audio,llm,code-generation,groq,stopword"}
---
Create and edit HTML pages with just your voice! Powered by Groq!
[:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/llama-code-editor)
[:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/llama-code-editor/blob/main/app.py)
- :speaking_head:{ .lg .middle } **SmolAgents with Voice**
{: data-tags="audio,llm,voice-chat,agentic"}
---
Build a voice-based smolagent to find a coworking space!
[:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/burtenshaw/coworking_agent/)
[:octicons-code-16: Code](https://huggingface.co/spaces/burtenshaw/coworking_agent/blob/main/app.py)
- :speaking_head:{ .lg .middle } **Talk to Claude**
{: data-tags="audio,llm,voice-chat"}
---
Use the Anthropic and Play.Ht APIs to have an audio conversation with Claude.
[:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/burtenshaw/coworking_agent)
[:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/burtenshaw/coworking_agent)
[:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/talk-to-claude/blob/main/app.py)
- :musical_note:{ .lg .middle } **LLM Voice Chat**
{: data-tags="audio,llm,voice-chat,groq,elevenlabs"}
---
Talk to an LLM with ElevenLabs!
[:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/llm-voice-chat)
[:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/llm-voice-chat-gradio)
[:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/llm-voice-chat/blob/main/app.py)
- :musical_note:{ .lg .middle } **Whisper Transcription**
{: data-tags="audio,transcription,groq"}
---
Have whisper transcribe your speech in real time!
[:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/whisper-realtime)
[:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/whisper-realtime-gradio)
[:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/whisper-realtime/blob/main/app.py)
- :robot:{ .lg .middle } **Talk to Sambanova**
{: data-tags="llm,stopword,sambanova"}
---
Talk to Llama 3.2 with the SambaNova API.
[:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/talk-to-sambanova)
[:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/talk-to-sambanova-gradio)
[:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/talk-to-sambanova/blob/main/app.py)
- :speaking_head:{ .lg .middle } **Hello Llama: Stop Word Detection**
{: data-tags="audio,llm,code-generation,stopword,sambanova"}
---
A code editor built with Llama 3.3 70b that is triggered by the phrase "Hello Llama".
Build a Siri-like coding assistant in 100 lines of code!
[:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/hey-llama-code-editor)
[:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/hey-llama-code-editor/blob/main/app.py)
- :speaking_head:{ .lg .middle } **Audio Input/Output with mini-omni2**
{: data-tags="audio,llm,voice-chat"}
---
Build a GPT-4o like experience with mini-omni2, an audio-native LLM.
[:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/mini-omni2-webrtc)
[:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/mini-omni2-webrtc/blob/main/app.py)
- :speaking_head:{ .lg .middle } **Kyutai Moshi**
{: data-tags="audio,llm,voice-chat,kyutai"}
---
Kyutai's moshi is a novel speech-to-speech model for modeling human conversations.
[:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/talk-to-moshi)
[:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/talk-to-moshi/blob/main/app.py)
- :speaking_head:{ .lg .middle } **Talk to Ultravox**
{: data-tags="audio,llm,voice-chat"}
---
Talk to Fixie.AI's audio-native Ultravox LLM with the transformers library.
[:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/talk-to-ultravox)
[:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/talk-to-ultravox/blob/main/app.py)
- :speaking_head:{ .lg .middle } **Talk to Llama 3.2 3b**
{: data-tags="audio,llm,voice-chat"}
---
Use the Lepton API to make Llama 3.2 talk back to you!
[:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/llama-3.2-3b-voice-webrtc)
[:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/llama-3.2-3b-voice-webrtc/blob/main/app.py)
- :robot:{ .lg .middle } **Talk to Qwen2-Audio**
{: data-tags="audio,llm,voice-chat"}
---
Qwen2-Audio is a SOTA audio-to-text LLM developed by Alibaba.
[:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/talk-to-qwen-webrtc)
[:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/talk-to-qwen-webrtc/blob/main/app.py)
- :camera:{ .lg .middle } **Yolov10 Object Detection**
{: data-tags="video,computer-vision"}
---
Run the Yolov10 model on a user webcam stream in real time!
[:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/object-detection)
[:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/object-detection/blob/main/app.py)
- :camera:{ .lg .middle } **Video Object Detection with RT-DETR**
{: data-tags="video,computer-vision"}
---
Upload a video and stream out frames with detected objects (powered by RT-DETR) model.
[:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/rt-detr-object-detection-webrtc)
[:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/rt-detr-object-detection-webrtc/blob/main/app.py)
- :speaker:{ .lg .middle } **Text-to-Speech with Parler**
{: data-tags="audio"}
---
Stream out audio generated by Parler TTS!
[:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/parler-tts-streaming-webrtc)
[:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/parler-tts-streaming-webrtc/blob/main/app.py)
- :speaking_head:{ .lg .middle } **Real Time Transcription with On-device Whisper 🤗**
{: data-tags="audio,transcription,local"}
---
Transcribe speech in real time using Whisper via the Transformers library, running on your device!
[:octicons-code-16: Code](https://github.com/sofi444/realtime-transcription-fastrtc/blob/main/main.py)
- :speaking_head:{ .lg .middle } __Talk to Claude - Electron App__
{: data-tags="audio,electron"}
---
An Electron desktop application that uses FastRTC to enable voice conversations with Claude.
[:octicons-arrow-right-24: Demo](https://github.com/swairshah/voice-agent)
[:octicons-code-16: Code](https://github.com/swairshah/voice-agent)