This open text-to-speech model needs just seconds of audio to clone your voice
El Reg shows you how to run Zypher's speech-replicating AI on your own box. Hands on Palo Alto-based AI startup Zyphra unveiled a pair of open text-to-speech (TTS) models this week said to be capable of cloning your voice with as little as five seconds of sample audio. In our testing, we generated realistic results with less than half a minute of recorded speech.
|
|
Full Story |
This topic does not have any threads posted yet!
You cannot post until you login.