lucian.harhata
Member
Heya all
I've been testing so far best TTS tools out there that I can use in videos:
My findings:
Eleven Labs
XTTS-v2 & other open source
I also used TTS Arena https://huggingface.co/spaces/TTS-AGI/TTS-Arena to check current live leader board, and Fish Audio gets pretty close.
My conclusion:
Anybody else tested TTS so far? What are your findings / feedback so far?
				
			I've been testing so far best TTS tools out there that I can use in videos:
- Eleven Labs (proprietary https://elevenlabs.io/, cloud)
 - Fish Audio (open source https://fish.audio/, cloud)
 - XTTS-v2 (open source, locally)
 - other open source solutions (Tortoise TTS etc)
 
My findings:
Eleven Labs
- great voice, can change intonation and understand text
 - very good end result
 - paid version
 - limited number of chars, i.e. 30k chars for $5; you end up using the chars quite quickly
 
- good voice, it gets close to Eleven Labs
 - but it does not understand all text, if there are some characters in the text it goes wild
 - i got some strange shussh type of sounds that were very weird (no voice, just like a wind sound)
 - paid version so far shows on their pricing page that it is Unlimited usage
- so far i have not been charged more and created heaps
 
 
XTTS-v2 & other open source
- quite time consuming to set them up
 - sometimes you'll get errors with missing packages
 - a loooooong time to produce small amounts of text to voice, i.e. for 2 sentences took like 2 minutes on a powerful PC
 - I gave up bcz of wasted time
 - quality of audio was mediocre compared to the other two above
 - the only one that gets close to Fish Audio is XTTS-v2, but I am not impressed personally
 
I also used TTS Arena https://huggingface.co/spaces/TTS-AGI/TTS-Arena to check current live leader board, and Fish Audio gets pretty close.
My conclusion:
- Eleven Labs is good for something high quality sound, quite impressive.
- But for going at scale, I think costs might build up quite fast.
 
 - Fish Audio gets close in quality to Eleven Labs but there is more to tweaking it / and looks to have some bugs.
- I'll report back as i get more insights into their bugs.
 
 
Anybody else tested TTS so far? What are your findings / feedback so far?