Xiang Lyu
634edfadf0
Merge pull request #983 from Shengqiang-Li/main
...
feat: Support DPO
2025-04-08 12:14:51 +08:00
lyuxiang.lx
f0b8e892f6
fix lint
2025-04-07 23:04:01 +08:00
lyuxiang.lx
cfc68f379c
only keep online export
2025-04-07 22:54:13 +08:00
lyuxiang.lx
4951d2ad1a
update
2025-04-07 22:45:47 +08:00
lyuxiang.lx
d9ffd592f6
use static_chunk_size in flow training
2025-04-07 22:34:45 +08:00
lyuxiang.lx
7902d1c17f
set use_flow_cache=True when export
2025-04-07 22:27:17 +08:00
lyuxiang.lx
39ffc50dec
add flow cache inference code
2025-04-07 21:23:09 +08:00
ShengqiangLi
a22873e360
feat: Support DPO
2025-03-16 18:33:42 +08:00
Xiang Lyu
c97b445df4
Merge pull request #1036 from hwangsihu/main
...
Fixed an issue where onnxruntime would not install on Windows
2025-03-10 15:48:04 +08:00
hwangsihu
265507f213
Fixed an issue where onnxruntime would not install on Windows
2025-03-10 16:07:01 +09:00
lyuxiang.lx
a69b7e275d
fix vocoder train
2025-03-07 17:34:13 +08:00
lyuxiang.lx
fcc054f64e
fix hifigan bug
2025-02-18 14:45:43 +08:00
Xiang Lyu
fd45708e4b
Merge pull request #977 from hanasay/main
...
Convert audio to mono while extract speech token
2025-02-16 12:51:04 +08:00
hanasay
296ed4f526
Convert audio to mono while extract speech token
...
modified: tools/extract_speech_token.py
2025-02-14 15:25:45 +08:00
lyuxiang.lx
890300513c
fix bug
2025-02-11 15:56:20 +08:00
lyuxiang.lx
f77c6a85aa
fix bug
2025-02-11 00:07:12 +08:00
lyuxiang.lx
b6d66ce2e3
update
2025-02-08 13:58:41 +08:00
lyuxiang.lx
8e4f252d32
add llm bistream
2025-02-08 12:15:37 +08:00
lyuxiang.lx
79b7dff8d2
add llm train
2025-02-07 17:17:12 +08:00
Xiang Lyu
95e99e0417
Merge pull request #940 from c4fun/fastapi-cosyvoice2
...
Add a inference_instruct2 route to support and defaultly supports cosyvoice2 in fastapi server
2025-02-07 16:30:27 +08:00
c4fun
ba6d8c07ba
revert the main function to original and only preserve the inference endpoint
2025-02-07 16:28:30 +08:00
lyuxiang.lx
2a3e033ee1
fix lint
2025-02-07 16:23:03 +08:00
Xiang Lyu
da3f129977
Merge pull request #936 from huyyxy/main
...
feat(docker): update CUDA base image to 12.4.1 for TensorRT support
2025-02-06 11:39:53 +08:00
c4fun
2889c25863
supports and defaultly supports cosyvoice2 in fastapi server
2025-01-27 20:51:57 +08:00
lyuxiang.lx
24f796a2b1
Merge branch 'main' into dev/lyuxiang.lx
2025-01-26 16:56:18 +08:00
lyuxiang.lx
fd1a951a6c
add flow unified training
2025-01-26 16:56:06 +08:00
huyyxy
aa65200713
feat(docker): update CUDA base image to 12.4.1 for TensorRT support
...
- Upgrade base image from nvidia/cuda:11.8.0-cudnn8-devel-ubuntu22.04 to nvidia/cuda:12.4.1-cudnn-devel-ubuntu22.04
- Enable CUDA 12.4 runtime environment
- Ensure TensorRT dependency compatibility
- Validation steps:
- Verify CUDA version via nvidia-smi after build
- Test import tensorrt in container without errors
Closes #935
2025-01-26 12:33:50 +08:00
Xiang Lyu
86e26f54c7
Merge pull request #930 from sd0ric4/main
...
feat: add POST endpoints to resolve browser error about GET request wi…
2025-01-26 10:17:15 +08:00
sd0ric4
f1c214377c
fix: add POST endpoints to resolve browser error about GET request with body
2025-01-24 19:41:14 +08:00
lyuxiang.lx
aea75207dd
fix cache bug
2025-01-24 11:07:26 +08:00
Xiang Lyu
369ea80bd4
Merge pull request #926 from Vinkle-hzt/main
...
fix bistream extra token
2025-01-23 22:02:25 +08:00
huzetao.hzt
69518b2bde
fix bistream extra token
2025-01-23 19:08:18 +08:00
lyuxiang.lx
1c062ab381
add flow decoder cache
2025-01-23 16:48:13 +08:00
Xiang Lyu
276cfa02b6
Merge pull request #925 from FunAudioLLM/dev/lyuxiang.lx
...
fix pitch computation
2025-01-23 15:45:42 +08:00
lyuxiang.lx
190840b8dc
fix pitch computation
2025-01-23 15:44:03 +08:00
lyuxiang.lx
c6c3f27ecc
fix typo
2025-01-23 11:27:10 +08:00
Xiang Lyu
49761d2474
Merge pull request #924 from FunAudioLLM/dev/lyuxiang.lx
...
add llm bistream
2025-01-23 10:19:21 +08:00
lyuxiang.lx
07e477519b
add llm bistream
2025-01-23 10:12:06 +08:00
Xiang Lyu
41c5e8cd6d
Merge pull request #887 from Wauplin/patch-1
...
Fix diffusers / huggingface_hub compatibility in requirements.txt
2025-01-15 18:23:05 +08:00
Lucain
66ceaff472
Fix diffusers / huggingface_hub compatibility in requirements.txt
...
As mentioned in https://github.com/FunAudioLLM/CosyVoice/issues/516#issuecomment-2592067949 and https://github.com/FunAudioLLM/CosyVoice/issues/527#issuecomment-2592067100 , it is more future-proof to upgrade `diffusers` version rather than downgrading `huggingface_hub` to an old one. This will also fix the `cannot import name 'cached_download' from 'huggingface_hub'` issue without relying on outdated packages.
Sorry again for the inconvenience 🙏
2025-01-15 10:21:08 +01:00
Xiang Lyu
07a314767f
Merge pull request #884 from FunAudioLLM/dev/lyuxiang.lx
...
update
2025-01-14 22:56:21 +08:00
lyuxiang.lx
0b75c3a03f
update
2025-01-14 22:55:13 +08:00
Xiang Lyu
b4dea3d64a
Merge pull request #878 from FunAudioLLM/dev/lyuxiang.lx
...
update
2025-01-13 10:31:12 +08:00
lyuxiang.lx
43f9e9ab20
update
2025-01-13 10:30:13 +08:00
Xiang Lyu
025f6f0f7f
Merge pull request #875 from lsby/main
...
fix docker python version
2025-01-13 10:27:49 +08:00
Xiang Lyu
69051d11ec
Merge pull request #876 from FunAudioLLM/dev/lyuxiang.lx
...
fix bug
2025-01-12 21:21:25 +08:00
lyuxiang.lx
59fa786769
fix bug
2025-01-12 21:18:41 +08:00
hbybyyang
f38f594303
fix docker python version
2025-01-12 15:59:58 +08:00
Xiang Lyu
eb4d5d053f
Merge pull request #868 from FunAudioLLM/dev/lyuxiang.lx
...
move prompt wav to asset
2025-01-10 17:53:32 +08:00
lyuxiang.lx
d450c32296
update
2025-01-10 17:52:25 +08:00