A collection of applications built with FastRTC. Click on the tags below to find the app you're looking for!
- :speaking_head:{ .lg .middle }:eyes:{ .lg .middle } __Gemini Audio Video Chat__ {: data-tags="audio,video,real-time-api"} --- Stream BOTH your webcam video and audio feeds to Google Gemini. You can also upload images to augment your conversation! [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/gemini-audio-video) [:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/gemini-audio-video) [:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/gemini-audio-video) - :speaking_head:{ .lg .middle } __Google Gemini Real Time Voice API__ {: data-tags="audio,real-time-api,voice-chat"} --- Talk to Gemini in real time using Google's voice API. [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/talk-to-gemini) [:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/talk-to-gemini-gradio) [:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/talk-to-gemini/blob/main/app.py) - :speaking_head:{ .lg .middle } __OpenAI Real Time Voice API__ {: data-tags="audio,real-time-api,voice-chat"} --- Talk to ChatGPT in real time using OpenAI's voice API. [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/talk-to-openai) [:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/talk-to-openai-gradio) [:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/talk-to-openai/blob/main/app.py) - :robot:{ .lg .middle } __Hello Computer__ {: data-tags="llm,stopword,sambanova"} --- Say computer before asking your question! [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/hello-computer) [:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/hello-computer-gradio) [:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/hello-computer/blob/main/app.py) - :robot:{ .lg .middle } __Llama Code Editor__ {: data-tags="audio,llm,code-generation,groq,stopword"} --- Create and edit HTML pages with just your voice! Powered by Groq! [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/llama-code-editor) [:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/llama-code-editor/blob/main/app.py) - :speaking_head:{ .lg .middle } __Talk to Claude__ {: data-tags="audio,llm,voice-chat"} --- Use the Anthropic and Play.Ht APIs to have an audio conversation with Claude. [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/talk-to-claude) [:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/talk-to-claude-gradio) [:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/talk-to-claude/blob/main/app.py) - :musical_note:{ .lg .middle } __LLM Voice Chat__ {: data-tags="audio,llm,voice-chat,groq,elevenlabs"} --- Talk to an LLM with ElevenLabs! [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/llm-voice-chat) [:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/llm-voice-chat-gradio) [:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/llm-voice-chat/blob/main/app.py) - :musical_note:{ .lg .middle } __Whisper Transcription__ {: data-tags="audio,transcription,groq"} --- Have whisper transcribe your speech in real time! [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/whisper-realtime) [:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/whisper-realtime-gradio) [:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/whisper-realtime/blob/main/app.py) - :robot:{ .lg .middle } __Talk to Sambanova__ {: data-tags="llm,stopword,sambanova"} --- Talk to Llama 3.2 with the SambaNova API. [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/talk-to-sambanova) [:octicons-arrow-right-24: Gradio UI](https://huggingface.co/spaces/fastrtc/talk-to-sambanova-gradio) [:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/talk-to-sambanova/blob/main/app.py) - :speaking_head:{ .lg .middle } __Hello Llama: Stop Word Detection__ {: data-tags="audio,llm,code-generation,stopword,sambanova"} --- A code editor built with Llama 3.3 70b that is triggered by the phrase "Hello Llama". Build a Siri-like coding assistant in 100 lines of code! [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/hey-llama-code-editor) [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/hey-llama-code-editor/blob/main/app.py) - :speaking_head:{ .lg .middle } __Audio Input/Output with mini-omni2__ {: data-tags="audio,llm,voice-chat"} --- Build a GPT-4o like experience with mini-omni2, an audio-native LLM. [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/mini-omni2-webrtc) [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/mini-omni2-webrtc/blob/main/app.py) - :speaking_head:{ .lg .middle } __Kyutai Moshi__ {: data-tags="audio,llm,voice-chat,kyutai"} --- Kyutai's moshi is a novel speech-to-speech model for modeling human conversations. [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/talk-to-moshi) [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/talk-to-moshi/blob/main/app.py) - :speaking_head:{ .lg .middle } __Talk to Ultravox__ {: data-tags="audio,llm,voice-chat"} --- Talk to Fixie.AI's audio-native Ultravox LLM with the transformers library. [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/talk-to-ultravox) [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/talk-to-ultravox/blob/main/app.py) - :speaking_head:{ .lg .middle } __Talk to Llama 3.2 3b__ {: data-tags="audio,llm,voice-chat"} --- Use the Lepton API to make Llama 3.2 talk back to you! [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/llama-3.2-3b-voice-webrtc) [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/llama-3.2-3b-voice-webrtc/blob/main/app.py) - :robot:{ .lg .middle } __Talk to Qwen2-Audio__ {: data-tags="audio,llm,voice-chat"} --- Qwen2-Audio is a SOTA audio-to-text LLM developed by Alibaba. [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/talk-to-qwen-webrtc) [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/talk-to-qwen-webrtc/blob/main/app.py) - :camera:{ .lg .middle } __Yolov10 Object Detection__ {: data-tags="video,computer-vision"} --- Run the Yolov10 model on a user webcam stream in real time! [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/fastrtc/object-detection) [:octicons-code-16: Code](https://huggingface.co/spaces/fastrtc/object-detection/blob/main/app.py) - :camera:{ .lg .middle } __Video Object Detection with RT-DETR__ {: data-tags="video,computer-vision"} --- Upload a video and stream out frames with detected objects (powered by RT-DETR) model. [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/rt-detr-object-detection-webrtc) [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/rt-detr-object-detection-webrtc/blob/main/app.py) - :speaker:{ .lg .middle } __Text-to-Speech with Parler__ {: data-tags="audio"} --- Stream out audio generated by Parler TTS! [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/parler-tts-streaming-webrtc) [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/parler-tts-streaming-webrtc/blob/main/app.py)