Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
audio-generation
cantonese
chatbot
chatgpt
chinese
cosyvoice
cross-lingual
english
fine-grained
fine-tuning
gpt-4o
japanese
korean
multi-lingual
natural-language-generation
python
text-to-speech
tts
voice-cloning
Updated 2026-01-30 01:31:04 +08:00
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
deep-learning
diffusion-model
diffusion-models
flow-matching
machine-learning
non-autoregressive
probabilistic
probabilistic-machine-learning
text-to-speech
tts
tts-api
tts-engines
Updated 2026-01-20 06:11:39 +08:00
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
onnx
onnx-runtime
onnxruntime
pytorch
speech
speech-processing
vad
voice-activity-detection
voice-commands
voice-control
voice-detection
voice-recognition
Updated 2025-12-30 12:05:45 +08:00
Generate ARKit expression from audio in realtime
Updated 2025-10-24 13:53:58 +08:00
Realtime Video and Audio Streaming with WebRTC and Gradio
Updated 2025-06-30 23:11:02 +08:00
Updated 2025-06-30 11:09:20 +08:00