Are there any free/open-source TTS options out there that are on the same level as Google Cloud’s? I tried a lot of free ones, but they are absolutely awful and still sound like my Amiga did 30 years ago. With LLMs being available as open source, I am hoping there’s also a good TTS offering I just haven’t found yet.
Piper is my choice. Very easy to use from the command line, fairly good sounding voices. Prior to that, for years (decades?) I used espeak-ng, had a very robotic voice but articulated almost everything very clearly, and I got used to it so didn’t actually mind.
Came here to recommend Piper. It’s an excellent TTS engine.
Espeak doesn’t get better, but nor does it get worse
Wow.
Have you tried Piper?
Yes, but if you compare it to https://cloud.google.com/text-to-speech?hl=en (scroll down a bit and you can try it) and the Neural2 model, it sounds like shit. I mean, it’s great to see that there are efforts, but it just pales in comparison.
Well, it’s about as good as you’re going to get right now.
https://github.com/rsxdalv/tts-generation-webui and https://github.com/gitmylo/audio-webui. I use them all the time. Taking a sample of 10s i get amazing results.
Cool, I’ll give those a try!
Balabolka was/is my go to for TTS. It creates audio files as well for later if you need. Used it to make plenty of audio books in the past.
I would say Elevenlabs is the best but unfortunately not free.
If you need it for a short while it might be worth it.
I tried Piper with different models, and a couple of FOSS alternatives but the output quality was definitely subpar.
I would say soon we will have good FOSS models, but for the time being that’s not the case.