lucian.harhata
Member
Heya all
I've been testing so far best TTS tools out there that I can use in videos:
My findings:
Eleven Labs
XTTS-v2 & other open source
I also used TTS Arena https://huggingface.co/spaces/TTS-AGI/TTS-Arena to check current live leader board, and Fish Audio gets pretty close.
My conclusion:
Anybody else tested TTS so far? What are your findings / feedback so far?
I've been testing so far best TTS tools out there that I can use in videos:
- Eleven Labs (proprietary https://elevenlabs.io/, cloud)
- Fish Audio (open source https://fish.audio/, cloud)
- XTTS-v2 (open source, locally)
- other open source solutions (Tortoise TTS etc)
My findings:
Eleven Labs
- great voice, can change intonation and understand text
- very good end result
- paid version
- limited number of chars, i.e. 30k chars for $5; you end up using the chars quite quickly
- good voice, it gets close to Eleven Labs
- but it does not understand all text, if there are some characters in the text it goes wild
- i got some strange shussh type of sounds that were very weird (no voice, just like a wind sound)
- paid version so far shows on their pricing page that it is Unlimited usage
- so far i have not been charged more and created heaps
XTTS-v2 & other open source
- quite time consuming to set them up
- sometimes you'll get errors with missing packages
- a loooooong time to produce small amounts of text to voice, i.e. for 2 sentences took like 2 minutes on a powerful PC
- I gave up bcz of wasted time
- quality of audio was mediocre compared to the other two above
- the only one that gets close to Fish Audio is XTTS-v2, but I am not impressed personally
I also used TTS Arena https://huggingface.co/spaces/TTS-AGI/TTS-Arena to check current live leader board, and Fish Audio gets pretty close.
My conclusion:
- Eleven Labs is good for something high quality sound, quite impressive.
- But for going at scale, I think costs might build up quite fast.
- Fish Audio gets close in quality to Eleven Labs but there is more to tweaking it / and looks to have some bugs.
- I'll report back as i get more insights into their bugs.
Anybody else tested TTS so far? What are your findings / feedback so far?