Imini
UltraI-TTS ephathelene nezingxoxo nezwi lokuklonya kanye nesandi esingasho lutho
Ngo Imini
ing the ability to generate a text-to-speech model that is 100% accurate and accurate to the real-world. Dia is a 1.6B parameter text-to-speech model that is 100% accurate and accurate to the real-world. Dia is a 1.6B parameter text-to-speech model that is 100% accurate and accurate to the real-world. Dia is a 1.6B parameter model
Izici ezibalulekile
Ukukhiqizwa kwebhokisi lenkulumo
Yenza ukuxhumana okujwayelekile okuningi-okusho ngezwi elihlukile nokuthatha isikhala.
Izisindo ezingasho lutho
Engeza [ukuthanda], [ukuthanda], [ubuhlungu], (ubuhlungu) ukuze uveze amagama ajwayelekile.
Ukuklonywa kwezwi
Uhlu lwezinhlamvu ezisuka kumasekondi angama-5-10 wezinhlamvu ezibhekiswe kuzo ukuze ukhulume ngokuzimela.
Ukuxhumana okujwayelekile
Amapharamitha ka-1.6B akhiqiza ukukhuluma okujwayelekile nokulalela okujwayelekile.
Sebenzisa izimo
Indlela yokusetshenziswa Imini
-
1
Ubhalise mahhala noma uvule idemo
Dala i-akhawunti emahhala ye-TextToSpeechAI ukuze ufune ama-credits akho okuqala, noma uvule idemo engabhaliswanga ukuzama ukuxhumana kwe-Dia ngokushesha.
-
2
Khetha i-engine ye-Dia
Ku-TTS dashboard khetha i-Dia kusuka ku-engine list. I-Dia iyi-dialogue-oriented, ultra-tier model ne-multi-speaker ne-voice-cloning support.
-
3
Bhala iskripti lezingxoxo ngezithonjana
Yenza ingxoxo yakho usebenzisa [S1] ne [S2] ukuphawula umsindo wesikhulumi ngayinye, bese ufaka izixhumanisi ezingasho lutho ezifana ne [laughs], [sighs], [coughs], noma (gasps) lapho ufuna khona umphumela ojwayelekile.
-
4
Dala umsindo
Chofoza yenza ukuthumela iskripthi sakho seDia ku-GPUs zethu ezihoxisiwe. I-Dia inikeza umsindo we-dialog ophindwe kabili nge-turn-taking kanye ne-nonverbal tags yakho kwifayela le-audio elilodwa.
-
5
Layisha phansi noma thinta i-API
Layisha ngezansi umbhalo oqediwe wezingxoxo kufomethi oyikhethile, noma uyisebenzise ngokuzenzakalela ngokushicilela iskripthi esifanayo [S1]/[S2] ku-TextToSpeechAI API nge-akhawunti yakho.
Imini I-API
Yenza ulwimi ngokuzenzakalela usebenzisa i-TextToSpeechAI REST API.
curl -X POST "https://api.texttospeechai.com/v1/generate/" \
-H "Authorization: Bearer YOUR_API_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"text": "[S1] Ngikubonga! Unjani namuhla? [udlala] [S2] Ngisebenza kahle, ngiyabonga ngokubuza!",
"voice": "en_US-lessac-medium"
}'
Imibuzo ebuzwa kaningi
Technical Specs
- Generation Speed Medium
- Output Quality Excellent
- Voice Cloning Supported
- Languages 1
- GPU VRAM 10GB
- Credits/1000 chars 50