Commit Graph

221 Commits

Author SHA1 Message Date
bingochaos
4bf0fbc716 add whl 2025-06-17 20:58:54 +08:00
bingochaos
909dbddf08 Merge remote-tracking branch 'origin/main' into open-avatar-chat-0.4.0 2025-06-17 20:39:40 +08:00
Freddy Boulton
0264cbd133 version 28 (#351) 2025-06-13 12:23:27 -04:00
Freddy Boulton
3abe0a4d8a Fix interactive video (#350)
* n

* remove template

* Add templates

* remove print
2025-06-13 12:22:38 -04:00
Freddy Boulton
b9c3f01e9f version 0.0.27 (#348) 2025-06-12 18:51:08 -04:00
AlbertMingXu
8780265659 chore: dispatch starting_recording and stop_recording. (#342)
Co-authored-by: Ming Xu <albertxu@amazon.com>
2025-06-09 18:36:32 -04:00
Freddy Boulton
6875d6610a Add Integrated Textbox to the docs + spaces (#343)
* Add to docs

* Fix requirements
2025-06-09 18:30:17 -04:00
Freddy Boulton
c97b1885c0 Version 0.0.26 (#339)
* bump

* Fix
2025-06-05 18:58:18 -04:00
Shane Blair
f45b23c770 [FIX] Allow usage of Cloudflare tokens if hf_token is missing (#338) 2025-06-05 09:42:56 -04:00
Freddy Boulton
1877720231 Add text mode (#321)
* Pretty good spot

* Working draft

* Fix other mode

* Add js to git

* Working

* Add code

* fix

* Fix

* Add code

* Fix submit race condition

* demo

* fix

* Fix

* Fix
2025-06-03 19:24:21 -04:00
Mahimai Raja
1179f8ef21 feat: added whisper cpp to speech to text documentation page (#324) 2025-05-30 13:53:29 -04:00
Freddy Boulton
0c146ee45e Pass Websocket to the context if available (#329)
* Add code

* Code

* Fix

* Add code
2025-05-30 13:38:59 -04:00
Freddy Boulton
3fc258cd1b Code (#332) 2025-05-29 17:46:19 -04:00
omahs
b74c372afd Fix typos (#330) 2025-05-29 14:27:27 -04:00
Sofia Casadei
6f02a2f2a9 chunk speech after s if no pause detected by VAD (#328)
* chunk speech after s if no pause detected by VAD

* add attr descriptions in AlgoOptions

* Fix

---------

Co-authored-by: Freddy Boulton <41651716+freddyaboulton@users.noreply.github.com>
2025-05-27 14:54:33 -04:00
Freddy Boulton
db6d411538 Fix (#322) 2025-05-21 11:09:10 -04:00
Freddy Boulton
c191f1ce90 Surpress Startup Logs (#319)
* Add code

* code
2025-05-20 12:30:36 -04:00
Freddy Boulton
ae95e973f6 Code (#313) 2025-05-13 12:11:58 -04:00
Freddy Boulton
8f61ad855d Add code (#311) 2025-05-12 10:22:14 -04:00
Mohamed Ted Meftah
bf71b2b0e9 fix: fail to use CLOUDFLARE_TURN_KEY_* even if HF_TOKEN is missing (#307) 2025-05-12 09:24:39 -04:00
Freddy Boulton
4ac69ee219 Increase timeout (#310)
* Increase timeout

* Version 24

* Build
2025-05-12 08:56:10 -04:00
Freddy Boulton
a6a093740f Add code (#300) 2025-04-23 16:23:32 -04:00
Freddy Boulton
02aef9da58 Add ability to Hide Title in Built-in UI + llama 4 cartesia tweaks (#299)
* merge title

* Fix
2025-04-23 16:01:54 -04:00
Freddy Boulton
745701c79c Add first-class support for Cartesia text-to-speech (#298)
* Demo

* patient intake

* cartesia

* Add cartesia

* Fix

* lint

* Move test

* Fix

* Fix

* Fix

* Fix
2025-04-23 15:15:57 -04:00
Freddy Boulton
24349dee0c Fix TURN credentials for interactive video + other Gemini Audio Video demo tweaks (#297)
* Gemini

* Add code

* demo tweaks
2025-04-23 12:52:47 -04:00
Aman Chauhan
f3308b6e81 Fixed path for telephone/handler in handle_incoming_call (#280)
Co-authored-by: Freddy Boulton <41651716+freddyaboulton@users.noreply.github.com>
2025-04-23 12:39:45 -04:00
Shubham Rasal
16bfb5be13 Update text_to_speech_gallery.md (#296) 2025-04-23 12:20:10 -04:00
Freddy Boulton
f8a214f132 Release (#294)
* commit

* Fix
2025-04-22 14:45:48 -04:00
Freddy Boulton
b2a01d5fe9 commit (#293) 2025-04-22 14:44:13 -04:00
Freddy Boulton
074e9c9345 Fix websocket interruption (#291)
* Code

* Fix

* add code

* interruptions

* Add code

* code

* Add code

* Add code

* code
2025-04-22 14:40:19 -04:00
Freddy Boulton
a68023101d Fix Websocket Client Processing (#286)
* Fix

* Add code
2025-04-17 12:21:13 -04:00
Freddy Boulton
c9bca428af Set ice candidates server (#285)
* Add code

* Add code

* Code
2025-04-17 10:20:53 -04:00
neil.xh
f476f9cf29 gs对话接入
本次代码评审新增并完善了gs视频聊天功能,包括前后端接口定义、状态管理及UI组件实现,并引入了新的依赖库以支持更多互动特性。
Link: https://code.alibaba-inc.com/xr-paas/gradio_webrtc/codereview/21273476
* 更新python 部分

* 合并videochat前端部分

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 替换audiowave

* 导入路径修改

* 合并websocket mode逻辑

* feat: gaussian avatar chat

* 增加其他渲染的入参

* feat: ws连接和使用

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 右边距离超出容器宽度,则向左移动

* 配置传递

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 高斯包异常

* 同步webrtc_utils

* 更新webrtc_utils

* 兼容on_chat_datachannel

* 修复设备名称列表没有正常显示的问题

* copy 传递 webrtc_id

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 保证webrtc 完成后再进行websocket连接

* feat: 音频表情数据接入

* dist 上传

* canvas 隐藏

* feat: 高斯文件下载进度透出

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 修改无法获取权限问题

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 先获取权限再获取设备

* fix: gs资源下载完成前不处理ws数据

* fix: merge

* 话术调整

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 修复设备切换后重新对话,又切换回默认设备的问题

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 更新localvideo 尺寸

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 不能默认default

* 修改音频权限问题

* 更新打包结果

* fix: 对话按钮状态跟gs资源挂钩,删除无用代码

* fix: merge

* feat: gs渲染模块从npm包引入

* fix

* 新增对话记录

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 样式修改

* 更新包

* fix: gs数字人初始化位置和静音

* 对话记录滚到底部

* 至少100%高度

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 略微上移文本框

* 开始连接时清空对话记录

* fix: update gs render npm

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 逻辑保证

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* feat: 音频初始化配置是否静音

* actionsbar在有字幕时调整位置

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 样式优化

* feat: 增加readme

* fix: 资源图片

* fix: docs

* fix: update gs render sdk

* fix: gs模式下画面位置计算

* fix: update readme

* 设备判断,太窄处理

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 是否有权限和是否有设备分开

* feat: gs 下载和加载钩子函数分离

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* fix: update gs render sdk

* 替换

* dist

* 上传文件

* del
2025-04-16 19:09:04 +08:00
Freddy Boulton
72fdc74e82 Change Repo URL (#282)
* Cartesia

* README
2025-04-15 21:48:38 -04:00
Freddy Boulton
b0a666ef55 Add a Medical Agent Example to showcase function calling (#281)
* Demo

* patient intake
2025-04-15 18:37:54 -04:00
Freddy Boulton
d710c06210 Fix openai demo (#279) 2025-04-15 09:42:22 -04:00
Freddy Boulton
54d07bc3c8 Add code (#276) 2025-04-14 09:57:15 -04:00
Shaon Debnath
5835e74377 Add docs for outbound calls with twilio (#273)
* Add docs for outbound calls with twilio

* Add code

---------

Co-authored-by: Freddy Boulton <41651716+freddyaboulton@users.noreply.github.com>
2025-04-14 09:30:48 -04:00
Freddy Boulton
dcde624449 Fix msg (#275) 2025-04-14 09:20:32 -04:00
Marcus Valtonen Örnhag
d42740372c Update old links in pyproject.toml (#270)
* Update old links

* Add email + lint

---------

Co-authored-by: Marcus Valtonen Örnhag <marcus.valtonen.ornhag@ericsson.com>
Co-authored-by: Freddy Boulton <41651716+freddyaboulton@users.noreply.github.com>
2025-04-10 10:12:22 -04:00
Freddy Boulton
73153cb3c9 cookbook (#267) 2025-04-09 10:41:13 -04:00
Václav Volhejn
58bccddd93 Fix audio type conversion (#259)
* Fix conversion between audio dtypes

* Run Pytest in CI

* Add pytest tests path in pyproject.toml

* Fix usages

* Use other PR's test format (more or less)

* Support legacy arguments

* Fix pyproject.toml and test location

* Omit `test` arg in CI, given by pyproject.toml

---------

Co-authored-by: Freddy Boulton <alfonsoboulton@gmail.com>
2025-04-09 10:00:23 -04:00
Freddy Boulton
fdf6bea1c6 code (#265) 2025-04-09 09:38:18 -04:00
Freddy Boulton
837330dcd8 Cloudflare turn integration (#264)
* Turn integration

* Add code:

* type hint

* Fix typehint

* add code

* format

* WIP

* trickle ice

* bump version

* Better docs

* Modify

* code

* Mute icon for whisper

* Add code

* llama 4 demo

* code

* OpenAI interruptions

* fix docs
2025-04-09 09:36:51 -04:00
Marcus Valtonen Örnhag
f70b27bd41 Enforce modern typing (#258)
* Allow UP

* Upgrade typing

* test smolagents

* Change to contextlib

---------

Co-authored-by: Marcus Valtonen Örnhag <marcus.valtonen.ornhag@ericsson.com>
2025-04-08 16:46:12 -04:00
Erik Wasmosy
a07e9439b6 Add started_talking log message in ReplyOnPause and in api.md (#260) 2025-04-07 17:35:53 -04:00
Marcus Valtonen Örnhag
2331079c0f Introduce unit tests (#248)
* Proof-of-concept: unittests

* Add pytest-asyncio dep

* Import Body from stream

* Add test for allow_extra_tracks

* Cleanup decorators

* add test to linting

* fix ruff issues

* Run formatter

* fix

* Dont test every python version

---------

Co-authored-by: Marcus Valtonen Örnhag <marcus.valtonen.ornhag@ericsson.com>
Co-authored-by: Freddy Boulton <alfonsoboulton@gmail.com>
2025-04-07 17:35:25 -04:00
Marcus Valtonen Örnhag
0767030997 Introduce static type checking with pyright (#255) 2025-04-05 14:19:05 -04:00
Freddy Boulton
d7995b8116 Dont run docs ci on prs from forks (#257) 2025-04-04 15:38:55 -04:00
Freddy Boulton
3147b5979c Add API Reference and llms.txt (#256)
* stream api reference

* docs

* Add code

* Add code

* code
2025-04-04 15:32:06 -04:00