CosyVoice

mirror of https://github.com/FunAudioLLM/CosyVoice.git synced 2026-02-05 18:09:24 +08:00

Author	SHA1	Message	Date
lyuxiang.lx	39ffc50dec	add flow cache inference code	2025-04-07 21:23:09 +08:00
Xiang Lyu	08312f4c46	Merge pull request #1100 from jingfelix/fix/dockerfile-dependency fix: missing dependency in `runtime/python/Dockerfile`	2025-03-25 08:30:26 +08:00
jingfelix	c6d8737336	fix: missing dependency in runtime/python/Dockerfile Signed-off-by: jingfelix <jingfelix@outlook.com>	2025-03-22 23:34:59 +08:00
ShengqiangLi	a22873e360	feat: Support DPO	2025-03-16 18:33:42 +08:00
Xiang Lyu	c97b445df4	Merge pull request #1036 from hwangsihu/main Fixed an issue where onnxruntime would not install on Windows	2025-03-10 15:48:04 +08:00
hwangsihu	265507f213	Fixed an issue where onnxruntime would not install on Windows	2025-03-10 16:07:01 +09:00
lyuxiang.lx	a69b7e275d	fix vocoder train	2025-03-07 17:34:13 +08:00
lyuxiang.lx	fcc054f64e	fix hifigan bug	2025-02-18 14:45:43 +08:00
Xiang Lyu	fd45708e4b	Merge pull request #977 from hanasay/main Convert audio to mono while extract speech token	2025-02-16 12:51:04 +08:00
hanasay	296ed4f526	Convert audio to mono while extract speech token modified： tools/extract_speech_token.py	2025-02-14 15:25:45 +08:00
lyuxiang.lx	890300513c	fix bug	2025-02-11 15:56:20 +08:00
lyuxiang.lx	f77c6a85aa	fix bug	2025-02-11 00:07:12 +08:00
lyuxiang.lx	b6d66ce2e3	update	2025-02-08 13:58:41 +08:00
lyuxiang.lx	8e4f252d32	add llm bistream	2025-02-08 12:15:37 +08:00
lyuxiang.lx	79b7dff8d2	add llm train	2025-02-07 17:17:12 +08:00
Xiang Lyu	95e99e0417	Merge pull request #940 from c4fun/fastapi-cosyvoice2 Add a inference_instruct2 route to support and defaultly supports cosyvoice2 in fastapi server	2025-02-07 16:30:27 +08:00
c4fun	ba6d8c07ba	revert the main function to original and only preserve the inference endpoint	2025-02-07 16:28:30 +08:00
lyuxiang.lx	2a3e033ee1	fix lint	2025-02-07 16:23:03 +08:00
Xiang Lyu	da3f129977	Merge pull request #936 from huyyxy/main feat(docker): update CUDA base image to 12.4.1 for TensorRT support	2025-02-06 11:39:53 +08:00
c4fun	2889c25863	supports and defaultly supports cosyvoice2 in fastapi server	2025-01-27 20:51:57 +08:00
lyuxiang.lx	24f796a2b1	Merge branch 'main' into dev/lyuxiang.lx	2025-01-26 16:56:18 +08:00
lyuxiang.lx	fd1a951a6c	add flow unified training	2025-01-26 16:56:06 +08:00
huyyxy	aa65200713	feat(docker): update CUDA base image to 12.4.1 for TensorRT support - Upgrade base image from nvidia/cuda:11.8.0-cudnn8-devel-ubuntu22.04 to nvidia/cuda:12.4.1-cudnn-devel-ubuntu22.04 - Enable CUDA 12.4 runtime environment - Ensure TensorRT dependency compatibility - Validation steps: - Verify CUDA version via nvidia-smi after build - Test import tensorrt in container without errors Closes #935	2025-01-26 12:33:50 +08:00
Xiang Lyu	86e26f54c7	Merge pull request #930 from sd0ric4/main feat: add POST endpoints to resolve browser error about GET request wi…	2025-01-26 10:17:15 +08:00
sd0ric4	f1c214377c	fix: add POST endpoints to resolve browser error about GET request with body	2025-01-24 19:41:14 +08:00
lyuxiang.lx	aea75207dd	fix cache bug	2025-01-24 11:07:26 +08:00
Xiang Lyu	369ea80bd4	Merge pull request #926 from Vinkle-hzt/main fix bistream extra token	2025-01-23 22:02:25 +08:00
huzetao.hzt	69518b2bde	fix bistream extra token	2025-01-23 19:08:18 +08:00
lyuxiang.lx	1c062ab381	add flow decoder cache	2025-01-23 16:48:13 +08:00
Xiang Lyu	276cfa02b6	Merge pull request #925 from FunAudioLLM/dev/lyuxiang.lx fix pitch computation	2025-01-23 15:45:42 +08:00
lyuxiang.lx	190840b8dc	fix pitch computation	2025-01-23 15:44:03 +08:00
lyuxiang.lx	c6c3f27ecc	fix typo	2025-01-23 11:27:10 +08:00
Xiang Lyu	49761d2474	Merge pull request #924 from FunAudioLLM/dev/lyuxiang.lx add llm bistream	2025-01-23 10:19:21 +08:00
lyuxiang.lx	07e477519b	add llm bistream	2025-01-23 10:12:06 +08:00
Xiang Lyu	41c5e8cd6d	Merge pull request #887 from Wauplin/patch-1 Fix diffusers / huggingface_hub compatibility in requirements.txt	2025-01-15 18:23:05 +08:00
Lucain	66ceaff472	Fix diffusers / huggingface_hub compatibility in requirements.txt As mentioned in https://github.com/FunAudioLLM/CosyVoice/issues/516#issuecomment-2592067949 and https://github.com/FunAudioLLM/CosyVoice/issues/527#issuecomment-2592067100, it is more future-proof to upgrade `diffusers` version rather than downgrading `huggingface_hub` to an old one. This will also fix the `cannot import name 'cached_download' from 'huggingface_hub'` issue without relying on outdated packages. Sorry again for the inconvenience 🙏	2025-01-15 10:21:08 +01:00
Xiang Lyu	07a314767f	Merge pull request #884 from FunAudioLLM/dev/lyuxiang.lx update	2025-01-14 22:56:21 +08:00
lyuxiang.lx	0b75c3a03f	update	2025-01-14 22:55:13 +08:00
Xiang Lyu	b4dea3d64a	Merge pull request #878 from FunAudioLLM/dev/lyuxiang.lx update	2025-01-13 10:31:12 +08:00
lyuxiang.lx	43f9e9ab20	update	2025-01-13 10:30:13 +08:00
Xiang Lyu	025f6f0f7f	Merge pull request #875 from lsby/main fix docker python version	2025-01-13 10:27:49 +08:00
Xiang Lyu	69051d11ec	Merge pull request #876 from FunAudioLLM/dev/lyuxiang.lx fix bug	2025-01-12 21:21:25 +08:00
lyuxiang.lx	59fa786769	fix bug	2025-01-12 21:18:41 +08:00
hbybyyang	f38f594303	fix docker python version	2025-01-12 15:59:58 +08:00
Xiang Lyu	eb4d5d053f	Merge pull request #868 from FunAudioLLM/dev/lyuxiang.lx move prompt wav to asset	2025-01-10 17:53:32 +08:00
lyuxiang.lx	d450c32296	update	2025-01-10 17:52:25 +08:00
lyuxiang.lx	e84d72a4d9	move prompt wav to asset	2025-01-10 17:51:21 +08:00
Xiang Lyu	06e86619c2	Merge pull request #867 from FunAudioLLM/dev/lyuxiang.lx Dev/lyuxiang.lx	2025-01-10 16:46:11 +08:00
lyuxiang.lx	e257c16796	Merge branch 'dev/lyuxiang.lx' of github.com:FunAudioLLM/CosyVoice into dev/lyuxiang.lx	2025-01-10 16:44:01 +08:00
lyuxiang.lx	87475ccf41	fix conflict	2025-01-10 16:43:31 +08:00

... 3 4 5 6 7 ...

528 Commits