Commit Graph

165 Commits

Author SHA1 Message Date
bingochaos
68e66d839e update lam version to 0.0.8 2025-06-30 23:11:02 +08:00
neil.xh
f476f9cf29 gs对话接入
本次代码评审新增并完善了gs视频聊天功能,包括前后端接口定义、状态管理及UI组件实现,并引入了新的依赖库以支持更多互动特性。
Link: https://code.alibaba-inc.com/xr-paas/gradio_webrtc/codereview/21273476
* 更新python 部分

* 合并videochat前端部分

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 替换audiowave

* 导入路径修改

* 合并websocket mode逻辑

* feat: gaussian avatar chat

* 增加其他渲染的入参

* feat: ws连接和使用

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 右边距离超出容器宽度,则向左移动

* 配置传递

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 高斯包异常

* 同步webrtc_utils

* 更新webrtc_utils

* 兼容on_chat_datachannel

* 修复设备名称列表没有正常显示的问题

* copy 传递 webrtc_id

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 保证webrtc 完成后再进行websocket连接

* feat: 音频表情数据接入

* dist 上传

* canvas 隐藏

* feat: 高斯文件下载进度透出

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 修改无法获取权限问题

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 先获取权限再获取设备

* fix: gs资源下载完成前不处理ws数据

* fix: merge

* 话术调整

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 修复设备切换后重新对话,又切换回默认设备的问题

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 更新localvideo 尺寸

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 不能默认default

* 修改音频权限问题

* 更新打包结果

* fix: 对话按钮状态跟gs资源挂钩,删除无用代码

* fix: merge

* feat: gs渲染模块从npm包引入

* fix

* 新增对话记录

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 样式修改

* 更新包

* fix: gs数字人初始化位置和静音

* 对话记录滚到底部

* 至少100%高度

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 略微上移文本框

* 开始连接时清空对话记录

* fix: update gs render npm

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 逻辑保证

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* feat: 音频初始化配置是否静音

* actionsbar在有字幕时调整位置

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 样式优化

* feat: 增加readme

* fix: 资源图片

* fix: docs

* fix: update gs render sdk

* fix: gs模式下画面位置计算

* fix: update readme

* 设备判断,太窄处理

* Merge branch 'feature/update-fastrtc-0.0.19' of gitlab.alibaba-inc.com:xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* 是否有权限和是否有设备分开

* feat: gs 下载和加载钩子函数分离

* Merge branch 'feature/update-fastrtc-0.0.19' of http://gitlab.alibaba-inc.com/xr-paas/gradio_webrtc into feature/update-fastrtc-0.0.19

* fix: update gs render sdk

* 替换

* dist

* 上传文件

* del
2025-04-16 19:09:04 +08:00
Václav Volhejn
06885d06c4 Ignore output_frame_size parameter (#210) 2025-04-01 14:10:27 -04:00
Marcus Valtonen Örnhag
1f0462371e Improve error message if track kind and modality mismatch (#230)
Co-authored-by: Marcus Valtonen Örnhag <marcus.valtonen.ornhag@ericsson.com>
2025-04-01 14:05:53 -04:00
Freddy Boulton
5636736c56 bump version (#226) 2025-03-31 13:03:27 -04:00
Freddy Boulton
f742c93235 add code (#223) 2025-03-28 21:12:58 -04:00
Freddy Boulton
8ed27fba78 Close Stream from Backend (#222)
* Close from backend

* Add code
2025-03-28 20:47:34 -04:00
Siddharth Garg
71743acb64 Add Kroko-ASR model to STT gallery (#219) 2025-03-28 12:37:58 -04:00
Marcus Valtonen Örnhag
e3f08f87f2 Remove twice instantiated event (#221)
Co-authored-by: Marcus Valtonen Örnhag <marcus.valtonen.ornhag@ericsson.com>
2025-03-28 12:25:24 -04:00
Freddy Boulton
6235b2de61 Add text-to-speech-gallery + reword galleries to be "Plugin Ecosystem" (#218)
* Add code

* Update docs/text_to_speech_gallery.md

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* Update docs/text_to_speech_gallery.md

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

---------

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-03-27 19:06:52 -04:00
Freddy Boulton
b18f6331a3 Add code (#212) 2025-03-25 14:45:23 -04:00
Freddy Boulton
7692ffad00 Add code (#211) 2025-03-25 14:42:46 -04:00
Freddy Boulton
e231f793e8 trigger release (#201)
* trigger release

* add code
2025-03-20 21:01:13 -04:00
Freddy Boulton
6742894d3d Add support for trickle ice (#193)
* cherry-pick trickle-ice

* Add code

* Add code

* format
2025-03-20 20:50:45 -04:00
Freddy Boulton
3fed4cb2ad Some Video Fixes (#200)
* FPS control:

* add code

* Add code
2025-03-20 20:45:46 -04:00
Freddy Boulton
bce7cb95a6 Add code (#197) 2025-03-20 14:47:20 -04:00
Sourabh
ea36257abf Added HumAwareVAD to VAD Gallery (#194) 2025-03-20 14:41:39 -04:00
Václav Volhejn
3fc441a6f0 Create py.typed (#196) 2025-03-20 14:24:45 -04:00
Freddy Boulton
728a366924 Add js assets (#192) 2025-03-19 12:19:57 -04:00
Freddy Boulton
103381dac2 bump version (#190) 2025-03-18 21:43:58 -04:00
Freddy Boulton
2a70b4f3ed add code (#189) 2025-03-18 21:38:00 -04:00
Freddy Boulton
44aac8d964 Fix issue when the audio stream mixes sample rates and numpy array data types (#188)
* Fix code

* Fix

* keep same
2025-03-18 18:53:47 -04:00
Václav Volhejn
5a196868dd Fix outdated import (#185) 2025-03-18 12:12:28 -04:00
Freddy Boulton
93b14aae94 Fast phone (#183) 2025-03-17 12:22:03 -04:00
MechanicCoder
efff9d44dc Add example for "Talk to Azure OpenAi" (#181)
* Add example for "Talk to Azure OpenAi"

* Code

---------

Co-authored-by: Freddy Boulton <alfonsoboulton@gmail.com>
2025-03-17 12:15:22 -04:00
Sofia Casadei
3ae8f89ad5 Add on-device whisper example to cookbook (#179) 2025-03-16 18:25:49 -04:00
swairshah
8a1b15d620 add fastrc with Elecron app example to cookbook (#178) 2025-03-16 18:24:49 -04:00
Freddy Boulton
c9b67726ba Add code (#173) 2025-03-13 19:56:37 -04:00
Freddy Boulton
4fb28f3bf2 0.0.15 (#172) 2025-03-13 19:01:27 -04:00
Sofian Mejjoute
66f0a81b76 feat: Add optional startup function to ReplyOnPause (#170)
* feat: Add optional startup function to ReplyOnPause

* feat: Implement startup_fn in ReplyOnStopWords

* refactor: Remove redundant startup_fn implementation in ReplyOnStopWords

* tweaks

* revert

---------

Co-authored-by: Freddy Boulton <alfonsoboulton@gmail.com>
2025-03-11 19:11:29 -04:00
Freddy Boulton
514310691d Bump version (#164)
* Code'

* fix
2025-03-11 13:05:39 -04:00
Freddy Boulton
24ed2ca178 Add docs on how to contribute (#161)
* Add code

* add code

* Add code
2025-03-10 17:23:25 -04:00
Freddy Boulton
ee049cd4bc Add code (#160) 2025-03-10 17:03:54 -04:00
Ryan Ellis
7ad579c07a Added to gallery (#159)
* Added to gallery

* Add code

* Fix

---------

Co-authored-by: Freddy Boulton <alfonsoboulton@gmail.com>
2025-03-10 15:17:07 -04:00
Freddy Boulton
51f1fafa3a Add microphone mute (#158)
* Add code

* add code
2025-03-10 14:53:08 -04:00
Freddy Boulton
ed59834378 Fix Warning (#157) 2025-03-10 13:17:47 -04:00
Sourabh
de95bc2caa fix: ensure 'model' is copied in ReplyOnPause.copy() (#155) 2025-03-10 11:44:43 -04:00
Freddy Boulton
dcad14768b Add code (#153) 2025-03-08 12:36:58 -05:00
Freddy Boulton
e26eb4567f Add code (#151) 2025-03-07 19:27:33 -05:00
Sourabh
f95c3c78be fix: remove unused user-provided Silero option (#150) 2025-03-07 18:19:26 -05:00
Freddy Boulton
2766a941d2 Community stt models (#149)
* stt models

* add code
2025-03-07 17:04:57 -05:00
Freddy Boulton
504eb452f0 stt models (#147) 2025-03-07 17:03:11 -05:00
Freddy Boulton
cbbfa17679 Add Method for loading community Vad Models (#136)
* Add code

* add code
2025-03-07 16:27:18 -05:00
Rohan Richard
6905810f37 Adding nextjs + 11labs + openai streaming demo (#139)
* adding nextjs + 11labs + openai streaming demo

* removing package-lock
2025-03-07 14:24:23 -05:00
Freddy Boulton
4cac472ff4 release (#146) 2025-03-07 14:21:12 -05:00
Freddy Boulton
6748a8df49 fixes (#145) 2025-03-07 14:15:37 -05:00
Freddy Boulton
11dae295da add code (#137) 2025-03-06 19:56:46 -05:00
Michael Hart
7dfee78261 Simplify Cloudflare config with new endpoint (#135)
Old instructions will still work, but we now have an endpoint that matches the `RTCPeerConnection` signature exactly.
2025-03-06 19:25:14 -05:00
Mahimai Raja
f59e8c3a49 feat: Added documentation for twilio integration (#125)
* feat: Added documentation for twilio integration

* Add code

---------

Co-authored-by: Freddy Boulton <alfonsoboulton@gmail.com>
2025-03-06 17:23:56 -05:00
Freddy Boulton
8f6287cea3 Improve Interruption Handling (#134)
* Clear websocket queue on interrupt

* add code
2025-03-06 13:42:56 -05:00