73 Commits

Author SHA1 Message Date
Freddy Boulton
3abe0a4d8a Fix interactive video (#350)
* n

* remove template

* Add templates

* remove print
2025-06-13 12:22:38 -04:00
AlbertMingXu
8780265659 chore: dispatch starting_recording and stop_recording. (#342)
Co-authored-by: Ming Xu <albertxu@amazon.com>
2025-06-09 18:36:32 -04:00
Shane Blair
f45b23c770 [FIX] Allow usage of Cloudflare tokens if hf_token is missing (#338) 2025-06-05 09:42:56 -04:00
Freddy Boulton
1877720231 Add text mode (#321)
* Pretty good spot

* Working draft

* Fix other mode

* Add js to git

* Working

* Add code

* fix

* Fix

* Add code

* Fix submit race condition

* demo

* fix

* Fix

* Fix
2025-06-03 19:24:21 -04:00
Freddy Boulton
0c146ee45e Pass Websocket to the context if available (#329)
* Add code

* Code

* Fix

* Add code
2025-05-30 13:38:59 -04:00
Sofia Casadei
6f02a2f2a9 chunk speech after s if no pause detected by VAD (#328)
* chunk speech after s if no pause detected by VAD

* add attr descriptions in AlgoOptions

* Fix

---------

Co-authored-by: Freddy Boulton <41651716+freddyaboulton@users.noreply.github.com>
2025-05-27 14:54:33 -04:00
Freddy Boulton
c191f1ce90 Surpress Startup Logs (#319)
* Add code

* code
2025-05-20 12:30:36 -04:00
Freddy Boulton
ae95e973f6 Code (#313) 2025-05-13 12:11:58 -04:00
Mohamed Ted Meftah
bf71b2b0e9 fix: fail to use CLOUDFLARE_TURN_KEY_* even if HF_TOKEN is missing (#307) 2025-05-12 09:24:39 -04:00
Freddy Boulton
4ac69ee219 Increase timeout (#310)
* Increase timeout

* Version 24

* Build
2025-05-12 08:56:10 -04:00
Freddy Boulton
02aef9da58 Add ability to Hide Title in Built-in UI + llama 4 cartesia tweaks (#299)
* merge title

* Fix
2025-04-23 16:01:54 -04:00
Freddy Boulton
745701c79c Add first-class support for Cartesia text-to-speech (#298)
* Demo

* patient intake

* cartesia

* Add cartesia

* Fix

* lint

* Move test

* Fix

* Fix

* Fix

* Fix
2025-04-23 15:15:57 -04:00
Freddy Boulton
24349dee0c Fix TURN credentials for interactive video + other Gemini Audio Video demo tweaks (#297)
* Gemini

* Add code

* demo tweaks
2025-04-23 12:52:47 -04:00
Aman Chauhan
f3308b6e81 Fixed path for telephone/handler in handle_incoming_call (#280)
Co-authored-by: Freddy Boulton <41651716+freddyaboulton@users.noreply.github.com>
2025-04-23 12:39:45 -04:00
Freddy Boulton
074e9c9345 Fix websocket interruption (#291)
* Code

* Fix

* add code

* interruptions

* Add code

* code

* Add code

* Add code

* code
2025-04-22 14:40:19 -04:00
Freddy Boulton
a68023101d Fix Websocket Client Processing (#286)
* Fix

* Add code
2025-04-17 12:21:13 -04:00
Freddy Boulton
c9bca428af Set ice candidates server (#285)
* Add code

* Add code

* Code
2025-04-17 10:20:53 -04:00
Freddy Boulton
b0a666ef55 Add a Medical Agent Example to showcase function calling (#281)
* Demo

* patient intake
2025-04-15 18:37:54 -04:00
Freddy Boulton
54d07bc3c8 Add code (#276) 2025-04-14 09:57:15 -04:00
Marcus Valtonen Örnhag
d42740372c Update old links in pyproject.toml (#270)
* Update old links

* Add email + lint

---------

Co-authored-by: Marcus Valtonen Örnhag <marcus.valtonen.ornhag@ericsson.com>
Co-authored-by: Freddy Boulton <41651716+freddyaboulton@users.noreply.github.com>
2025-04-10 10:12:22 -04:00
Václav Volhejn
58bccddd93 Fix audio type conversion (#259)
* Fix conversion between audio dtypes

* Run Pytest in CI

* Add pytest tests path in pyproject.toml

* Fix usages

* Use other PR's test format (more or less)

* Support legacy arguments

* Fix pyproject.toml and test location

* Omit `test` arg in CI, given by pyproject.toml

---------

Co-authored-by: Freddy Boulton <alfonsoboulton@gmail.com>
2025-04-09 10:00:23 -04:00
Freddy Boulton
837330dcd8 Cloudflare turn integration (#264)
* Turn integration

* Add code:

* type hint

* Fix typehint

* add code

* format

* WIP

* trickle ice

* bump version

* Better docs

* Modify

* code

* Mute icon for whisper

* Add code

* llama 4 demo

* code

* OpenAI interruptions

* fix docs
2025-04-09 09:36:51 -04:00
Marcus Valtonen Örnhag
f70b27bd41 Enforce modern typing (#258)
* Allow UP

* Upgrade typing

* test smolagents

* Change to contextlib

---------

Co-authored-by: Marcus Valtonen Örnhag <marcus.valtonen.ornhag@ericsson.com>
2025-04-08 16:46:12 -04:00
Erik Wasmosy
a07e9439b6 Add started_talking log message in ReplyOnPause and in api.md (#260) 2025-04-07 17:35:53 -04:00
Marcus Valtonen Örnhag
2331079c0f Introduce unit tests (#248)
* Proof-of-concept: unittests

* Add pytest-asyncio dep

* Import Body from stream

* Add test for allow_extra_tracks

* Cleanup decorators

* add test to linting

* fix ruff issues

* Run formatter

* fix

* Dont test every python version

---------

Co-authored-by: Marcus Valtonen Örnhag <marcus.valtonen.ornhag@ericsson.com>
Co-authored-by: Freddy Boulton <alfonsoboulton@gmail.com>
2025-04-07 17:35:25 -04:00
Marcus Valtonen Örnhag
0767030997 Introduce static type checking with pyright (#255) 2025-04-05 14:19:05 -04:00
Freddy Boulton
3147b5979c Add API Reference and llms.txt (#256)
* stream api reference

* docs

* Add code

* Add code

* code
2025-04-04 15:32:06 -04:00
Freddy Boulton
c93176d83a Add code (#254) 2025-04-04 12:08:46 -04:00
Freddy Boulton
8dd17d3216 Add ability to trigger ReplyOnPause without waiting for pause (#250)
* Add code

* Send text or audio demo
2025-04-03 20:19:50 -04:00
Marcus Valtonen Örnhag
aed34825e3 Allow extra tracks (#231) (#249)
* Allow extra tracks (#231)

* Add allow_extra_tracks to WebRTC component

---------

Co-authored-by: Marcus Valtonen Örnhag <marcus.valtonen.ornhag@ericsson.com>
2025-04-03 19:13:13 -04:00
Freddy Boulton
d07bb41a9e Fix (#242) 2025-04-02 18:31:08 -04:00
Freddy Boulton
cfde58fce6 Add code (#241) 2025-04-02 16:39:50 -04:00
Václav Volhejn
cc8d82f233 Improve error handling for websockets (#238)
* Improve error handling for websockets

* Correctly run clean_up
2025-04-02 16:29:03 -04:00
Václav Volhejn
06885d06c4 Ignore output_frame_size parameter (#210) 2025-04-01 14:10:27 -04:00
Marcus Valtonen Örnhag
1f0462371e Improve error message if track kind and modality mismatch (#230)
Co-authored-by: Marcus Valtonen Örnhag <marcus.valtonen.ornhag@ericsson.com>
2025-04-01 14:05:53 -04:00
Freddy Boulton
f742c93235 add code (#223) 2025-03-28 21:12:58 -04:00
Freddy Boulton
8ed27fba78 Close Stream from Backend (#222)
* Close from backend

* Add code
2025-03-28 20:47:34 -04:00
Freddy Boulton
6235b2de61 Add text-to-speech-gallery + reword galleries to be "Plugin Ecosystem" (#218)
* Add code

* Update docs/text_to_speech_gallery.md

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* Update docs/text_to_speech_gallery.md

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

---------

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-03-27 19:06:52 -04:00
Freddy Boulton
7692ffad00 Add code (#211) 2025-03-25 14:42:46 -04:00
Freddy Boulton
e231f793e8 trigger release (#201)
* trigger release

* add code
2025-03-20 21:01:13 -04:00
Freddy Boulton
6742894d3d Add support for trickle ice (#193)
* cherry-pick trickle-ice

* Add code

* Add code

* format
2025-03-20 20:50:45 -04:00
Freddy Boulton
3fed4cb2ad Some Video Fixes (#200)
* FPS control:

* add code

* Add code
2025-03-20 20:45:46 -04:00
Václav Volhejn
3fc441a6f0 Create py.typed (#196) 2025-03-20 14:24:45 -04:00
Freddy Boulton
728a366924 Add js assets (#192) 2025-03-19 12:19:57 -04:00
Freddy Boulton
2a70b4f3ed add code (#189) 2025-03-18 21:38:00 -04:00
Freddy Boulton
44aac8d964 Fix issue when the audio stream mixes sample rates and numpy array data types (#188)
* Fix code

* Fix

* keep same
2025-03-18 18:53:47 -04:00
Freddy Boulton
93b14aae94 Fast phone (#183) 2025-03-17 12:22:03 -04:00
Sofian Mejjoute
66f0a81b76 feat: Add optional startup function to ReplyOnPause (#170)
* feat: Add optional startup function to ReplyOnPause

* feat: Implement startup_fn in ReplyOnStopWords

* refactor: Remove redundant startup_fn implementation in ReplyOnStopWords

* tweaks

* revert

---------

Co-authored-by: Freddy Boulton <alfonsoboulton@gmail.com>
2025-03-11 19:11:29 -04:00
Freddy Boulton
514310691d Bump version (#164)
* Code'

* fix
2025-03-11 13:05:39 -04:00
Freddy Boulton
ee049cd4bc Add code (#160) 2025-03-10 17:03:54 -04:00