Kubeez · Audio · ElevenLabs V3-class

    Text-to-speech

    100+ voices, seven categories, previews before you spend credits

    Kubeez ships a full voice catalogue—more than 100 professional AI voices you can audition in the picker, grouped into seven categories from Conversational and Narration to Characters, Social Media, Entertainment, Educational, and Advertisement. Run a single voiceover or multi-line dialogue with Text-to-Dialogue V3. Section images below are Nano Banana 2 marketing stills; audio comes from the dialogue engine.

    Showcase

    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)
    Text-to-speech context (illustrative Nano Banana 2 still)

    A VOICE LIBRARY, NOT A SINGLE ROBOT

    Pick a voice that matches the script—then iterate lines without leaving the Audio workspace.

    100+ VOICES

    Browse a large, production-oriented catalogue with search, categories, and favorites—so brand spots, tutorials, and characters do not all sound the same.

    SEVEN CATEGORIES

    Filter by Conversational, Narration, Characters, Social Media, Entertainment, Educational, or Advertisement to shortlist faster.

    SOLO OR DIALOGUE

    Single-speaker reads or multi-line scripts with different voices per line—built for explainers, games, and ads.

    Text-to-speech in the app and MCP

    Use MCP tool "generate_dialogue" for automated dialogue jobs—the same voice ids and engine behavior as Audio → Dialogue.

    API & integrations

    Where TTS earns its keep

    BRAND & ADS

    Consistent read, flexible takes

    Lock a voice that fits the campaign, then revise copy without re-casting—credits stay in Kubeez.

    TTS context: voice booth and studio microphone
    LEARNING & INTERNAL

    Clear narration at scale

    Educational category voices for courses and onboarding—preview before you render long lessons.

    TTS context: e-learning desk with headset mic
    AUTOMATION

    MCP tool "{{modelId}}"

    Script dialogue generation from agents using the same pipeline as the Kubeez UI—pair with your existing MCP setup.

    TTS context: game VO session with script at mic

    One plan. Every video model.

    Pay for what you generate. Cancel anytime.

    Pro

    Most popular

    Access full experience.

    ~01234567890123456789,01234567890123456789 USDmonth
    0123456789,012345678901234567890123456789credits / month

    What you'll generate

    • Images~012345678901234567890123456789
    • Songs~012345678901234567890123456789
    • Videos~012345678901234567890123456789
    • Separations~012345678901234567890123456789
    • Ads~01234567890123456789
    • Caption Minutes~01234567890123456789

    Starter

    Low tier generations for regular use.

    ~01234567890123456789,01234567890123456789 USDmonth
    0123456789,012345678901234567890123456789credits / month

    What you'll generate

    • Images~012345678901234567890123456789
    • Songs~01234567890123456789
    • Videos~01234567890123456789
    • Separations~012345678901234567890123456789
    • Ads~01234567890123456789
    • Caption Minutes~01234567890123456789

    Studio Pro

    Studio value with extra headroom.

    ~01234567890123456789,01234567890123456789 USDmonth
    0123456789,012345678901234567890123456789credits / month

    What you'll generate

    • Images~0123456789,012345678901234567890123456789
    • Songs~012345678901234567890123456789
    • Videos~012345678901234567890123456789
    • Separations~0123456789,012345678901234567890123456789
    • Ads~012345678901234567890123456789
    • Caption Minutes~012345678901234567890123456789

    Powerhouse

    Most cost efficient.

    ~012345678901234567890123456789,01234567890123456789 USDmonth
    01234567890123456789,012345678901234567890123456789credits / month

    What you'll generate

    • Images~0123456789,012345678901234567890123456789
    • Songs~012345678901234567890123456789
    • Videos~012345678901234567890123456789
    • Separations~0123456789,012345678901234567890123456789
    • Ads~012345678901234567890123456789
    • Caption Minutes~012345678901234567890123456789
    Or pay-as-you-go

    Test any model with one-time credit packs

    Credits never expire and don't require a subscription — buy a pack, generate, see if Kubeez fits.

    Top-up credits

    Quick refill for short bursts of generation.

    Credits

    500

    You pay

    $7.24

    Save up to 3% with a subscription

    Top-up credits

    Mid-size pack at the standard per-credit rate.

    Credits

    1,500

    You pay

    $21.73

    Save up to 3% with a subscription
    −5%

    Top-up credits

    Larger pack with a 5% volume discount built in.

    Credits

    5,000

    You pay

    $68.83

    Save up to 3% with a subscription
    −10%

    Top-up credits

    Best per-credit price for sustained production.

    Credits

    15,000

    You pay

    $195.61

    Save up to 3% with a subscription

    Text-to-speech

    When the script is ready but the mic is not

    Run text-to-speech in KubeeztextToSpeechIntro.finalCta.titleEm

    Open Audio → Dialogue, pick from 100+ voices, and keep credits in one wallet with music and video.

    textToSpeechIntro.finalCta.nudge