diff --git a/.github/workflows/docs.yml b/.github/workflows/docs.yml index f58f8c4..3cb301a 100644 --- a/.github/workflows/docs.yml +++ b/.github/workflows/docs.yml @@ -3,8 +3,16 @@ on: push: branches: - main + pull_request: + branches: + - main + permissions: contents: write + pull-requests: write + deployments: write + pages: write + jobs: deploy: runs-on: ubuntu-latest @@ -24,5 +32,19 @@ jobs: path: .cache restore-keys: | mkdocs-material- - - run: pip install mkdocs-material - - run: mkdocs gh-deploy --force \ No newline at end of file + - run: pip install mkdocs-material + - name: Build docs + run: mkdocs build + + - name: Deploy to GH Pages (main) + if: github.event_name == 'push' + run: mkdocs gh-deploy --force + + - name: Deploy PR Preview + if: github.event_name == 'pull_request' + uses: rossjrw/pr-preview-action@v1 + with: + source-dir: ./site + preview-branch: gh-pages + umbrella-dir: pr-preview + action: auto \ No newline at end of file diff --git a/.gitignore b/.gitignore index 5a9f2b3..5ebb485 100644 --- a/.gitignore +++ b/.gitignore @@ -12,6 +12,9 @@ node_modules backend/**/templates/ demo/MobileNetSSD_deploy.caffemodel demo/MobileNetSSD_deploy.prototxt.txt +demo/scratch +.gradio +.vscode .DS_Store test/ .env \ No newline at end of file diff --git a/README.md b/README.md index 5ddb403..d7b85ca 100644 --- a/README.md +++ b/README.md @@ -1,57 +1,130 @@ -
+
-🗣️ Audio Input/Output with mini-omni2-Build a GPT-4o like experience with mini-omni2, an audio-native LLM. - +🗣️👀 Gemini Audio Video Chat+Stream BOTH your webcam video and audio feeds to Google Gemini. You can also upload images to augment your conversation! + + |
+
+🗣️ Google Gemini Real Time Voice API+Talk to Gemini in real time using Google's voice API. + + + |
+
+🗣️ OpenAI Real Time Voice API+Talk to ChatGPT in real time using OpenAI's voice API. + + + |
+
+🤖 Hello Computer+Say computer before asking your question! + + + |
+
+🤖 Llama Code Editor+Create and edit HTML pages with just your voice! Powered by SambaNova systems. + + |
🗣️ Talk to ClaudeUse the Anthropic and Play.Ht APIs to have an audio conversation with Claude. - + + |
+
+🎵 Whisper Transcription+Have whisper transcribe your speech in real time! + + + |
+
+📷 Yolov10 Object Detection+Run the Yolov10 model on a user webcam stream in real time! + + |
-🤖 Llama Code Editor-Create and edit HTML pages with just your voice! Powered by SambaNova systems. - - - |
-
-🗣️ Talk to Ultravox-Talk to Fixie.AI's audio-native Ultravox LLM with the transformers library. - - - |
-
-🗣️ Talk to Llama 3.2 3b-Use the Lepton API to make Llama 3.2 talk back to you! - - - |
-
-🤖 Talk to Qwen2-Audio-Qwen2-Audio is a SOTA audio-to-text LLM developed by Alibaba. - - - |
-
-📷 Yolov10 Object Detection-Run the Yolov10 model on a user webcam stream in real time! - - - |
-
-📷 Video Object Detection with RT-DETR-Upload a video and stream out frames with detected objects (powered by RT-DETR) model. - - |
-
-🔊 Text-to-Speech with Parler-Stream out audio generated by Parler TTS! - - |
-- | -
This may take a few moments
+ + +