About Bark
Bark is a transformer-based text-to-audio model that can generate highly expressive speech with emotions, laughter, sighs, and other non-verbal sounds. Unlike traditional TTS, Bark understands context and can produce speech that sounds genuinely expressive and human-like. It supports multiple languages and can even generate music and sound effects.
Key Features
Emotional Expression
Generate speech with laughter, sighs, gasps, and genuine emotions.
Emotion Markers
Use [laughter], [sighs], CAPS for emphasis, and ... for hesitation.
Multilingual
Supports 13+ languages with natural accents and pronunciation.
Music & Effects
Can generate simple music and environmental sounds.
Speaker Presets
Multiple pre-trained speaker voices with different styles.
Open Source
MIT licensed with full commercial use rights.
Use Cases
Bark Voices
View All 130Bark Chinese Speaker 0
ZHBark Chinese Speaker 1
ZHBark Chinese Speaker 2
ZHBark Chinese Speaker 3
ZHBark Chinese Speaker 4
ZHBark Chinese Speaker 5
ZHBark Chinese Speaker 6
ZHBark Chinese Speaker 7
ZHBark Chinese Speaker 8
ZHBark Chinese Speaker 9
ZHBark English Speaker 0
ENBark English Speaker 1
ENHow to Use Bark
-
1
Sign up free and open the demo
Create a free TextToSpeechAI account to claim your starter credits, or use the no-signup demo to try Bark right away. Free credits are enough to generate several expressive Bark clips before you upgrade.
-
2
Pick a Bark voice
Open the voice library and select a Bark speaker preset that matches the tone you want. Bark voices are tagged as the premium tier (25 credits per 1000 characters) and are tuned for emotional, character-style narration.
-
3
Enter text with emotion markers
Type your script and embed Bark emotion markers inline: [laughter] for laughs, [sighs] for sighs, [gasps] for gasps, ... for a pause, and CAPS for emphasis. For example: "Oh wow! [laughter] This is AMAZING... I can't believe it!"
-
4
Generate the audio
Click Generate and Bark renders your text into expressive speech, turning each marker into the matching sound. Generation is slower than lightweight engines because of Bark's transformer model, so allow a few extra seconds per sentence.
-
5
Download or use the API
Preview the result, then download it as MP3, WAV, or OGG. To automate Bark in your own app, call the TextToSpeechAI API with a Bark voice and the same marker-rich text to get back the expressive audio.
Bark API
Generate speech programmatically using the TextToSpeechAI REST API.
curl -X POST "https://api.texttospeechai.com/v1/generate/" \
-H "Authorization: Bearer YOUR_API_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"text": "Oh wow! [laughter] This is amazing... I just LOVE how expressive this sounds!",
"voice": "bark-zh_0"
}'
Frequently Asked Questions
Technical Specs
- Generation Speed Slow
- Output Quality Very Good
- Voice Cloning Not Supported
- Languages 13
- GPU VRAM 8-12GB
- Credits/1000 chars 25