Commit Graph

41 Commits

Author SHA1 Message Date
neil.xh
f476f9cf29 gs对话接入
本次代码评审新增并完善了gs视频聊天功能,包括前后端接口定义、状态管理及UI组件实现,并引入了新的依赖库以支持更多互动特性。
Link: https://code.alibaba-inc.com/xr-paas/gradio_webrtc/codereview/21273476
* 更新python 部分

* 合并videochat前端部分

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 替换audiowave

* 导入路径修改

* 合并websocket mode逻辑

* feat: gaussian avatar chat

* 增加其他渲染的入参

* feat: ws连接和使用

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 右边距离超出容器宽度,则向左移动

* 配置传递

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 高斯包异常

* 同步webrtc_utils

* 更新webrtc_utils

* 兼容on_chat_datachannel

* 修复设备名称列表没有正常显示的问题

* copy 传递 webrtc_id

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 保证webrtc 完成后再进行websocket连接

* feat: 音频表情数据接入

* dist 上传

* canvas 隐藏

* feat: 高斯文件下载进度透出

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 修改无法获取权限问题

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 先获取权限再获取设备

* fix: gs资源下载完成前不处理ws数据

* fix: merge

* 话术调整

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 修复设备切换后重新对话,又切换回默认设备的问题

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 更新localvideo 尺寸

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 不能默认default

* 修改音频权限问题

* 更新打包结果

* fix: 对话按钮状态跟gs资源挂钩,删除无用代码

* fix: merge

* feat: gs渲染模块从npm包引入

* fix

* 新增对话记录

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 样式修改

* 更新包

* fix: gs数字人初始化位置和静音

* 对话记录滚到底部

* 至少100%高度

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 略微上移文本框

* 开始连接时清空对话记录

* fix: update gs render npm

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 逻辑保证

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* feat: 音频初始化配置是否静音

* actionsbar在有字幕时调整位置

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 样式优化

* feat: 增加readme

* fix: 资源图片

* fix: docs

* fix: update gs render sdk

* fix: gs模式下画面位置计算

* fix: update readme

* 设备判断,太窄处理

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 是否有权限和是否有设备分开

* feat: gs 下载和加载钩子函数分离

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* fix: update gs render sdk

* 替换

* dist

* 上传文件

* del
2025-04-16 19:09:04 +08:00
Václav Volhejn
06885d06c4 Ignore output_frame_size parameter (#210) 2025-04-01 14:10:27 -04:00
Marcus Valtonen Örnhag
1f0462371e Improve error message if track kind and modality mismatch (#230)
Co-authored-by: Marcus Valtonen Örnhag <marcus.valtonen.ornhag@ericsson.com>
2025-04-01 14:05:53 -04:00
Freddy Boulton
f742c93235 add code (#223) 2025-03-28 21:12:58 -04:00
Freddy Boulton
8ed27fba78 Close Stream from Backend (#222)
* Close from backend

* Add code
2025-03-28 20:47:34 -04:00
Freddy Boulton
6235b2de61 Add text-to-speech-gallery + reword galleries to be "Plugin Ecosystem" (#218)
* Add code

* Update docs/text_to_speech_gallery.md

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* Update docs/text_to_speech_gallery.md

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

---------

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-03-27 19:06:52 -04:00
Freddy Boulton
7692ffad00 Add code (#211) 2025-03-25 14:42:46 -04:00
Freddy Boulton
e231f793e8 trigger release (#201)
* trigger release

* add code
2025-03-20 21:01:13 -04:00
Freddy Boulton
6742894d3d Add support for trickle ice (#193)
* cherry-pick trickle-ice

* Add code

* Add code

* format
2025-03-20 20:50:45 -04:00
Freddy Boulton
3fed4cb2ad Some Video Fixes (#200)
* FPS control:

* add code

* Add code
2025-03-20 20:45:46 -04:00
Václav Volhejn
3fc441a6f0 Create py.typed (#196) 2025-03-20 14:24:45 -04:00
Freddy Boulton
728a366924 Add js assets (#192) 2025-03-19 12:19:57 -04:00
Freddy Boulton
2a70b4f3ed add code (#189) 2025-03-18 21:38:00 -04:00
Freddy Boulton
44aac8d964 Fix issue when the audio stream mixes sample rates and numpy array data types (#188)
* Fix code

* Fix

* keep same
2025-03-18 18:53:47 -04:00
Freddy Boulton
93b14aae94 Fast phone (#183) 2025-03-17 12:22:03 -04:00
Sofian Mejjoute
66f0a81b76 feat: Add optional startup function to ReplyOnPause (#170)
* feat: Add optional startup function to ReplyOnPause

* feat: Implement startup_fn in ReplyOnStopWords

* refactor: Remove redundant startup_fn implementation in ReplyOnStopWords

* tweaks

* revert

---------

Co-authored-by: Freddy Boulton <alfonsoboulton@gmail.com>
2025-03-11 19:11:29 -04:00
Freddy Boulton
514310691d Bump version (#164)
* Code'

* fix
2025-03-11 13:05:39 -04:00
Freddy Boulton
ee049cd4bc Add code (#160) 2025-03-10 17:03:54 -04:00
Sourabh
de95bc2caa fix: ensure 'model' is copied in ReplyOnPause.copy() (#155) 2025-03-10 11:44:43 -04:00
Freddy Boulton
dcad14768b Add code (#153) 2025-03-08 12:36:58 -05:00
Freddy Boulton
e26eb4567f Add code (#151) 2025-03-07 19:27:33 -05:00
Sourabh
f95c3c78be fix: remove unused user-provided Silero option (#150) 2025-03-07 18:19:26 -05:00
Freddy Boulton
504eb452f0 stt models (#147) 2025-03-07 17:03:11 -05:00
Freddy Boulton
cbbfa17679 Add Method for loading community Vad Models (#136)
* Add code

* add code
2025-03-07 16:27:18 -05:00
Freddy Boulton
6748a8df49 fixes (#145) 2025-03-07 14:15:37 -05:00
Freddy Boulton
11dae295da add code (#137) 2025-03-06 19:56:46 -05:00
Freddy Boulton
8f6287cea3 Improve Interruption Handling (#134)
* Clear websocket queue on interrupt

* add code
2025-03-06 13:42:56 -05:00
Aki Miyazaki
a0b46f4059 UnboundLocalError: local variable 'button' referenced before assignment (#126)
Co-authored-by: Aki <akxucker@gmail.com>
2025-03-06 12:43:30 -05:00
Freddy Boulton
df0706e048 Fix kokoro batch issue (#128)
* Fix kokoro batch issue

* code

* fix batch size

---------

Co-authored-by: Freddy Boulton <freddyboulton@hf-freddy.local>
2025-03-05 19:43:02 -05:00
Freddy Boulton
6517a93472 Clean up cancelled generators (#124)
* fix links

* fix upload

* add code

* Add code

---------

Co-authored-by: Freddy Boulton <freddyboulton@hf-freddy.local>
2025-03-04 18:08:10 -05:00
Freddy Boulton
6ea54777af ReplyOnPause and ReplyOnStopWords can be interrupted (#119)
* Add all this code

* add code

* Fix demo

---------

Co-authored-by: Freddy Boulton <freddyboulton@hf-freddy.local>
2025-03-03 21:47:16 -05:00
Freddy Boulton
d8396a2714 Fix async echo example (#118)
* Fix async example

* Version bump

---------

Co-authored-by: Freddy Boulton <freddyboulton@hf-freddy.local>
2025-03-03 14:06:24 -05:00
Freddy Boulton
146143efb7 Fix backup url (#116)
Co-authored-by: Freddy Boulton <freddyboulton@hf-freddy.local>
2025-03-02 18:35:32 -05:00
Freddy Boulton
9cc0278985 Video Bugfix + generator (#96)
* Code

* Fix demo

* move to init

---------

Co-authored-by: Freddy Boulton <freddyboulton@hf-freddy.local>
2025-02-27 12:30:19 -05:00
Freddy Boulton
43e42c1b22 Tidy up connection logic (#90)
* Add code:

* code

* code

---------

Co-authored-by: Freddy Boulton <freddyboulton@hf-freddy.local>
2025-02-26 18:21:26 -05:00
Freddy Boulton
da872627b9 Set border radius (#84)
Co-authored-by: Freddy Boulton <freddyboulton@hf-freddy.local>
2025-02-26 11:53:25 -05:00
Freddy Boulton
ff0cb7197f Fix credentials (#75)
Co-authored-by: Freddy Boulton <freddyboulton@hf-freddy.local>
2025-02-25 19:07:40 -05:00
Freddy Boulton
2604d9f7db More tweaks (#71)
* tweaks

* Make icon colors match

---------

Co-authored-by: Freddy Boulton <freddyboulton@hf-freddy.local>
2025-02-25 10:58:46 -05:00
Freddy Boulton
5a4693ee0b Raise errors automatically (#69)
* Add auto errors

* change code

---------

Co-authored-by: Freddy Boulton <freddyboulton@hf-freddy.local>
2025-02-24 20:21:25 -05:00
Freddy Boulton
3323ef28a8 Tweak (#66)
* thanks

* title fix

* version bump

---------

Co-authored-by: Freddy Boulton <freddyboulton@hf-freddy.local>
2025-02-24 01:30:45 -05:00
Freddy Boulton
853d6a06b5 Rebrand to FastRTC (#60)
* Add code

* add code

* add code

* Rename messages

* rename

* add code

* Add demo

* docs + demos + bug fixes

* add code

* styles

* user guide

* Styles

* Add code

* misc docs updates

* print nit

* whisper + pr

* url for images

* whsiper update

* Fix bugs

* remove demo files

* version number

* Fix pypi readme

* Fix

* demos

* Add llama code editor

* Update llama code editor and object detection cookbook

* Add more cookbook demos

* add code

* Fix links for PR deploys

* add code

* Fix the install

* add tts

* TTS docs

* Typo

* Pending bubbles for reply on pause

* Stream redesign (#63)

* better error handling

* Websocket error handling

* add code

---------

Co-authored-by: Freddy Boulton <freddyboulton@hf-freddy.local>

* remove docs from dist

* Some docs typos

* more typos

* upload changes + docs

* docs

* better phone

* update docs

* add code

* Make demos better

* fix docs + websocket start_up

* remove mention of FastAPI app

* fastphone tweaks

* add code

* ReplyOnStopWord fixes

* Fix cookbook

* Fix pypi readme

* add code

* bump versions

* sambanova cookbook

* Fix tags

* Llm voice chat

* kyutai tag

* Add error message to all index.html

* STT module uses Moonshine

* Not required from typing extensions

* fix llm voice chat

* Add vpn warning

* demo fixes

* demos

* Add more ui args and gemini audio-video

* update cookbook

* version 9

---------

Co-authored-by: Freddy Boulton <freddyboulton@hf-freddy.local>
2025-02-24 01:13:42 -05:00