# Matcha-TTS: A fast TTS architecture with conditional flow matching ##### [Shivam Mehta][shivam_profile], [Ruibo Tu][ruibo_profile], [Jonas Beskow][jonas_profile], [Éva Székely][eva_profile], and [Gustav Eje Henter][gustav_profile]

We propose 🍵 Matcha-TTS, a new approach to non-autoregressive neural TTS, that uses [conditional flow matching](https://arxiv.org/abs/2210.02747) (similar to [rectified flows](https://arxiv.org/abs/2209.03003)) to speed up ODE-based speech synthesis. Our method: - Is probabilistic - Has compact memory footprint - Sounds highly natural - Is very fast to synthesise from See below for audio examples, or read [our ICASSP 2024 paper][arxiv_link] for more details. Code is available in our [GitHub repository][github_link], along with pre-trained models. You can also [try 🍵 Matcha-TTS in your browser on HuggingFace 🤗 spaces][hf_space]. [shivam_profile]: https://www.kth.se/profile/smehta [ruibo_profile]: https://www.kth.se/profile/ruibo [jonas_profile]: https://www.kth.se/profile/beskow [eva_profile]: https://www.kth.se/profile/szekely [gustav_profile]: https://people.kth.se/~ghe/ [this_page]: https://shivammehta25.github.io/Matcha-TTS [arxiv_link]: https://arxiv.org/abs/2309.03199 [grad_tts_paper]: https://arxiv.org/abs/2105.06337 [vits_paper]: https://arxiv.org/abs/2106.06103 [fastspeech2_paper]: https://arxiv.org/abs/2006.04558 [github_link]: https://github.com/shivammehta25/Matcha-TTS [hf_space]: https://huggingface.co/spaces/shivammehta25/Matcha-TTS ## Stimuli from the listening test > Click the buttons in the table to load and play the different stimuli. Currently loaded stimulus: MAT-10 : Sentence 1

Audio player:

Transcription:

It had established periodic regular review of the status of four hundred individuals;

System Condition Sentence 1 Sentence 2 Sentence 3 Sentence 4 Sentence 5 Sentence 6
Vocoded
speech
VOC
Matcha-TTS MAT-10
MAT-4
MAT-2
Grad-TTS GRAD-10
GRAD-4
Grad-TTS+CFM GCFM-4
FastSpeech 2 FS2
VITS VITS
## Effect of the number of ODE solver steps

Steps:

System Sentence 1 Sentence 2 Sentence 3
Matcha-TTS
Grad-TTS
Grad-TTS + CFM
## Citation information ``` @inproceedings{mehta2024matcha, title={Matcha-{TTS}: A fast {TTS} architecture with conditional flow matching}, author={Mehta, Shivam and Tu, Ruibo and Beskow, Jonas and Sz{\'e}kely, {\'E}va and Henter, Gustav Eje}, booktitle={Proc. ICASSP}, year={2024} } ``` [![MatchaTTS](https://hits.seeyoufarm.com/api/count/incr/badge.svg?url=https://shivammehta25.github.io/Matcha-TTS&count_bg=%23409CFF&title_bg=%23555555&icon=&icon_color=%23E7E7E7&title=Matcha-TTS&edge_flat=false)][this_page]