🗣️ Audio Input/Output with mini-omni2Build a GPT-4o like experience with mini-omni2, an audio-native LLM. |
🗣️ Talk to ClaudeUse the Anthropic and Play.Ht APIs to have an audio conversation with Claude. |
🗣️ Kyutai MoshiKyutai's moshi is a novel speech-to-speech model for modeling human conversations. |
🗣️ Hello Llama: Stop Word DetectionA code editor built with Llama 3.3 70b that is triggered by the phrase "Hello Llama". Build a Siri-like coding assistant in 100 lines of code! |
🤖 Llama Code EditorCreate and edit HTML pages with just your voice! Powered by SambaNova systems. |
🗣️ Talk to UltravoxTalk to Fixie.AI's audio-native Ultravox LLM with the transformers library. |
🗣️ Talk to Llama 3.2 3bUse the Lepton API to make Llama 3.2 talk back to you! |
🤖 Talk to Qwen2-AudioQwen2-Audio is a SOTA audio-to-text LLM developed by Alibaba. |
📷 Yolov10 Object DetectionRun the Yolov10 model on a user webcam stream in real time! |
📷 Video Object Detection with RT-DETRUpload a video and stream out frames with detected objects (powered by RT-DETR) model. |
🔊 Text-to-Speech with ParlerStream out audio generated by Parler TTS! |