Parler-TTS

Premium

Text-Described Voice Generation

Moderate Speed
Very Good Quality
No Cloning
1 Languages

About Parler-TTS

Parler-TTS is a unique text-to-speech model that generates voices based on text descriptions. Instead of selecting from pre-defined voices, you describe the voice you want: "A young woman speaks clearly with an American accent" or "An elderly British man speaks slowly in a deep voice." Parler-TTS then generates speech matching your description.

Key Features

Text Descriptions

Generate voices by describing desired characteristics.

Creative Control

Specify age, gender, accent, speed, and speaking style.

Unique Voices

Create voices that do not exist in pre-made libraries.

Natural Output

Generates high-quality, natural-sounding speech.

Efficient

Fast inference for described voice generation.

Open Source

Apache 2.0 licensed for commercial use.

Use Cases

Character Voice Design Creative Projects Prototype Voiceovers Game Development Audiobook Characters Custom Voice Creation

Parler-TTS Voices

View All 10
American Female
EN
American Male
EN
British Female
EN
British Male
EN
Calm Voice
EN
Cheerful Voice
EN
Conversational Voice
EN
Female Narrator
EN
Male Narrator
EN
Professional Voice
EN

How to Use Parler-TTS

  1. 1

    Sign up free or try the demo

    Create a free TextToSpeechAI account for 200 starter credits, or open the demo to try Parler-TTS instantly without signing up.

  2. 2

    Select Parler-TTS and write a voice description

    Choose Parler-TTS as your engine, then write a plain-text voice description such as "A young woman speaks clearly with an American accent." Include age, gender, accent, pace, and mood to shape the voice.

  3. 3

    Enter the text to speak

    Type or paste the script you want spoken. Parler-TTS renders this text in the voice defined by your description, so keep the description and the script in the same language (English works best).

  4. 4

    Generate the speech

    Click generate to send the job to our GPU backend. Parler-TTS synthesizes natural-sounding audio matching your described voice, billed at the Premium tier of 25 credits per 1000 characters.

  5. 5

    Download or call the API

    Download the finished audio as MP3, WAV, or OGG, or automate generation through the TextToSpeechAI API by passing your text and saved voice description in each request.

Parler-TTS API

Generate speech programmatically using the TextToSpeechAI REST API.

curl -X POST "https://api.texttospeechai.com/v1/generate/" \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "text": "A cheerful young woman speaks with an American accent.",
    "voice": "parler-female_american"
  }'

Frequently Asked Questions

Parler-TTS is a text-to-speech model that generates voices from text descriptions. Instead of choosing pre-made voices, you describe what you want: "A calm, mature woman with an Australian accent speaking at a moderate pace."

Parler-TTS is open-source under Apache 2.0 license. On TextToSpeechAI, we charge 25 credits per 1000 characters (Premium tier) for its unique voice generation capabilities.

Parler-TTS primarily supports English. The voice descriptions work best in English, though the model can handle various English accents (American, British, Australian, etc.).

Describe voice characteristics naturally: "A young woman speaks clearly with a British accent" or "An elderly man with a deep voice speaks slowly and carefully." Include age, gender, accent, speed, and mood.

Parler-TTS has moderate generation speed, typically 2-5 seconds per sentence on GPU. The voice description processing adds minimal overhead compared to the actual speech generation.

No, Parler-TTS generates voices from descriptions rather than cloning existing voices. For voice cloning, use StyleTTS2, F5-TTS, OpenVoice, or Tortoise.

Parler-TTS requires 4-8GB of VRAM depending on the model size. The mini version works with 4GB, while the full model benefits from 8GB for optimal performance.

Yes, Parler-TTS is Apache 2.0 licensed and supports commercial use. Since voices are generated from descriptions, there are no voice ownership concerns.

Include your voice description in the API request along with your text. Our API processes the description and generates matching speech. You can save favorite descriptions for reuse.

Parler-TTS produces very good, natural-sounding audio with prosody that matches your described voice. It outputs WAV natively, and on TextToSpeechAI you can download it as MP3, WAV, or OGG with automatic conversion.

Both are expressive, open-source engines, but they differ in control. Parler-TTS lets you steer the voice with a plain-text description (age, accent, pace, mood), while Bark adds nonverbal cues like [laughter] and music. Choose Parler-TTS when you want a specific described voice and Bark when you want spontaneous emotional delivery.

Yes. Sign up for a free account on TextToSpeechAI to receive 200 starter credits, or use the demo to hear Parler-TTS without an account. That is enough to test several voice descriptions before choosing a credit pack.

Technical Specs

  • Generation Speed Moderate
  • Output Quality Very Good
  • Voice Cloning Not Supported
  • Languages 1
  • GPU VRAM 4-8GB
  • Credits/1000 chars 25

Try Parler-TTS Now

Generate your first audio free. No credit card required.

Start Free