Docs (#20)

* Docs * code
2026-02-05 18:09:23 +08:00 · 2024-11-18 14:06:46 -05:00
parent d7acdd7eb4
commit 2434a65747
12 changed files with 575 additions and 3 deletions
--- a/docs/cookbook.md
+++ b/docs/cookbook.md
@@ -0,0 +1,87 @@
+<div class="grid cards" markdown>
+
+-   :speaking_head:{ .lg .middle } __Audio Input/Output with mini-omni2__
+
+    ---
+
+    Build a GPT-4o like experience with mini-omni2, an audio-native LLM.
+
+    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/mini-omni2-webrtc)
+    
+    [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/mini-omni2-webrtc/blob/main/app.py)
+
+-   :speaking_head:{ .lg .middle } __Talk to Claude__
+
+    ---
+
+    Use the Anthropic and Play.Ht APIs to have an audio conversation with Claude
+
+    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/talk-to-claude)
+    
+    [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/talk-to-claude/blob/main/app.py)
+
+-   :speaking_head:{ .lg .middle } __Talk to Llama 3.2 3b__
+
+    ---
+
+    Use the Lepton API to make Llama 3.2 talk back to you!
+
+    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/llama-3.2-3b-voice-webrtc)
+
+    [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/llama-3.2-3b-voice-webrtc/blob/main/app.py)
+
+
+-   :speaking_head:{ .lg .middle } __Talk to Ultravox__
+
+    ---
+
+    Talk to Fixie.AI's audio-native Ultravox LLM with the transformers library.
+
+    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/talk-to-ultravox)
+
+    [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/talk-to-ultravox/blob/main/app.py)
+
+
+-   :robot:{ .lg .middle } __Talk to Qwen2-Audio__
+
+    ---
+
+    Qwen2-Audio is a SOTA audio-to-text LLM developed by Alibaba.
+
+    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/talk-to-qwen-webrtc)
+
+    [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/talk-to-qwen-webrtc/blob/main/app.py)
+
+
+-   :camera:{ .lg .middle } __Yolov10 Object Detection__
+
+    ---
+
+    Run the Yolov10 model on a user webcam stream in real time!
+
+    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/webrtc-yolov10n)
+
+    [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/webrtc-yolov10n/blob/main/app.py)
+
+-   :camera:{ .lg .middle } __Video Object Detection with RT-DETR__
+
+    ---
+
+    Upload a video and stream out frames with detected objects (powered by RT-DETR) model.
+
+    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/rt-detr-object-detection-webrtc)
+
+    [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/rt-detr-object-detection-webrtc/blob/main/app.py)
+
+-   :speaker:{ .lg .middle } __Text-to-Speech with Parler__
+
+    ---
+
+    Stream out audio generated by Parler TTS!
+
+    [:octicons-arrow-right-24: Demo](https://huggingface.co/spaces/freddyaboulton/parler-tts-streaming-webrtc)
+
+    [:octicons-code-16: Code](https://huggingface.co/spaces/freddyaboulton/parler-tts-streaming-webrtc/blob/main/app.py)
+
+
+</div>