Ibhokisi lokuxoxa

Premium

Uklonyeliswa kwezwi losuku-zero ngezwi elizwakalayo ngeelwimi ezingu-23

Fast Isivinini
Very Good Ubunjani
Yebo Ukuklonya
23 Izilimi

Ngo Ibhokisi lokuxoxa

s, voice clone, voice clone, voice clone, voice clone, voice clone, voice clone, voice clone, voice clone, voice clone, voice clone, voice clone, voice clone, voice clone, voice clone, voice clone, voice clone, voice clone, voice clone, voice clone, voice clone

Izici ezibalulekile

Ukulungiswa kwezwi

Uhlu lwezinhlamvu ezingu-1000

Izilimi

Kusuka ku-Arabic kuya ku-Chinese, kufaka phakathi izilimi eziningi ezinkulu zezwe.

Amathegi achazayo

Engeza [laugh], [cough], [chuckle] izibizo ezijwayelekile ze-paralinguistic.

Ukuqonda okukhawulelwe

I-Sub-200ms latency ne-Turbo variant yezinhlelo zesikhathi sangempela.

Sebenzisa izimo

Ukuklonywa kwezwi lokuhlela okuqukethwe Izisebenziso zomsindo eziningi Uhlobo lwezwi lokudweba lombhalo lwemidlalo Ama-assistants omsindo akhethekile

Indlela yokusetshenziswa Ibhokisi lokuxoxa

  1. 1

    Ukungena noma ukuvula idemo

    Dala i-akhawunti emahhala engu-TextToSpeechAI ukuze uqinisekise ama-credits aqalayo angama-200, noma sebenzisa ikhasi lokubonisa ukuzama i-Chatterbox ngaphandle kokungenisa.

  2. 2

    Khetha ibhokisi lokuxoxa bese ungeza i-clip yokubhekisa

    Khetha i-Chatterbox engine, bese ufaka umsindo omncane (imizuzu eminingana) wezwi ofuna ukulisebenzisa. I-Chatterbox zero-shot isebenzisa ngokushesha - akukho qeqesho oludingekayo.

  3. 3

    Faka umbhalo wakho ngezithonjana ezikhethiwe

    Bhala noma chofoza umbhalo okhuluma nganoma yisiphi isilimi esixhasiwe esingu-23, bese ufaka [laugh], [cough], noma [chuckle] tags lapho ufuna khona umsindo ojwayelekile we-paralinguistic.

  4. 4

    Dala umsindo

    Chofoza ukwakha bese u-TextToSpeechAI unikeza umbhalo wakho umsindo we-Chatterbox oklonyeziwe ku-GPU ehostelwe, uchitha ama-credits angama-25 ngamagama angama-1,000.

  5. 5

    Layisha phezulu noma sebenzisa i-API

    Layisha ngezansi ifayela lomsindo eliqediwe, noma usebenzise ukukhishwa okuzenzakalelayo nge-TextToSpeechAI REST API ku-api.texttospeechai.com usebenzisa i-akhawunti yakho.

Ibhokisi lokuxoxa I-API

Yenza ulwimi ngokuzenzakalela usebenzisa i-TextToSpeechAI REST API.

curl -X POST "https://api.texttospeechai.com/v1/generate/" \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "text": "I\u002DChatterbox ingaku\u002Dklonela umsindo wakho kusuka emizuzwini embalwa yosandi futhi ikhulume ngezilimi ezingu\u002D23.",
    "voice": "en_US-lessac-medium"
  }'

Imibuzo ebuzwa kaningi

Ibhokisi lokuxoxa liyimodeli yokukhuluma esebenzisa umbhalo-ku-ukukhuluma eyenziwe yi-Resemble AI. Iyakwazi ukuveza noma yisiphi isikhulumi kusuka kumasekondi ambalwa kuphela wokuxhumana kwezwi futhi ikhiqize isikhulumi esicacile, esizwakalayo ngezilimi ezingu-23, zonke ngaphandle kokuzivocavoca kwezwi ngalinye.

Yebo, i-Chatterbox i-MIT licensed - both the code and the model weights - so you can use it freely in commercial products. I-audio ekhiqizwe ifaka i-neural watermark ekhethiwe engakwazi ukukhubazeka, futhi akukho kusetshenziswa kwe-royalties.

Unikeza isiqephu esincane sokuxhumana nganoma iyiphi into ekhulumayo (imizuzu embalwa ilungile) futhi i-Chatterbox ikhipha i-timbre nesimo sokukhuluma esingenayo isikhulumi. Iphinde ikhiqize amagama amasha kakhulu kulo msindo ngaphandle kokuhlela noma uqeqesho, okuwukuthi "i-zero-shot" isho ukuthini.

Ibhokisi lokuxoxa lifunda amathegi akhethekile elinemigqa embhalweni wakho ukungeza umsindo ojwayelekile ongasho lutho: [laugh] ufaka umsindo, [cough] ufaka umsindo, futhi [chuckle] ufaka umsindo oqinile. Faka nje ithegi lapho ufuna khona umsindo, isibonelo "Kuhle kakhulu [laugh] kodwa ngempela...".

Bhala isihloko ngokuqondile ngaphakathi kwesihloko sakho esingeniswe endaweni lapho umsindo kufanele ukwenzeka khona, obhekene nengxenye encane yesihloko sakho. Ibhokisi lokuxoxa linikeza umsindo we-paralinguistic emlanjeni ohlobene, lihlanganisa umsindo obhekene nokukhuluma ukuze kuzwakale ngokuzenzakalela ngaphezu kokuxhumeka.

I-Chatterbox isekela izilimi ezingu-23, kufaka phakathi isi-Arabhu, isi-Danish, isiJalimane, isiGreki, isiNgisi, isiSpanishi, isiFinnish, isiFrentshi, isiHebhere, isiHindi, isi-Italian, isiJapane, isiKorea, isiMalay, isiDutch, isiNorway, isiPolish, isiPutukezi, isiRussia, isiSwedish, isiSwahili, isiTurkish, nesiChinese. Umsindo ofanayo ohlonywe ungakhuluma ngalezi zici zesiNgisi.

Ibhokisi lokuxoxa likhiqiza amagama ngokushesha ku-GPU, futhi i-Turbo ifinyelela ngezansi kwe-200ms latency yokusetshenziswa kwesikhathi sangempela sokuxhumana. Umgangatho ulungile kakhulu, nge-prosody ejwayelekile ne-voice reproduction ethembekile kusuka kuma-reference clip aphansi.

I-Chatterbox idinga cishe i-4-8GB ye-VRAM ngokuya ngezinhlobo, nemodeli ye-Turbo esebenza ngokunethezeka ku-4GB. Ku-TextToSpeechAI awudingi noma yiziphi i-GPU zasendaweni - ukukhiqizwa kusebenza ku-infrastructure yethu ehostelwe.

Ibhokisi lokuxoxa liyinjini esezingeni eliphakeme ebiza ama-credits angama-25 ngamagama angama-1,000. Ama-akhawunti amasha athola ama-credits angama-200 amahhala ukuzama ukuklonya umsindo, futhi uchitha ama-credits kuphela ku-text owenzayo.

Zonke zixhasa ukuklonywa kwezwi lokushaya-ngokwesikhashana, kodwa iChatterbox ifaka izinhlelo eziningi zesiNgisi (23 vs 2) futhi ifaka izixhumanisi ezichazayo zesiNgisi. I-F5-TTS ingakha i-prosodia yaseNgisini ejwayelekile, ngakho-ke khetha i-Chatterbox ukuklonywa kwesiNgisi esiningi kanye namazwi achazayo, ne-F5-TTS ukuhlonishwa kwesiNgisi kuphela.

Zonke zinikeza ukuklonywa kwezwi okusezingeni eliphakeme. I-Chatterbox isekela amagama angu-23 namathegi angaphakathi, ngenkathi i-OpenVoice ifaka ukulawulwa kwe-tone-style (okuhle, obuhlungu, obuhlungu, nokuningi) okuncane kwe-Chatterbox. Khetha i-Chatterbox ukufaka amagama abanzi futhi i-OpenVoice uma ufuna ukudweba amagama acacile.

Yebo. Bhala i-akhawunti emahhala engu-TextToSpeechAI ukuze uthole ama-credits aqalayo angama-200, noma sebenzisa ikhasi lokubonisa ukulalela i-Chatterbox ngaphandle kokungenisa. Layisha phezulu i-clip encane yokubhekisa, bhala umbhalo wakho, futhi wenze umsindo ohlobene emaminithini.

Technical Specs

  • Generation Speed Fast
  • Output Quality Very Good
  • Voice Cloning Supported
  • Languages 23
  • GPU VRAM 4-8GB
  • Credits/1000 chars 25

Try Ibhokisi lokuxoxa Now

Generate your first audio free. No credit card required.

Start Free