AI Voice — Sesame CSM-1B

Kommersiële gebruik OK 380+modelle Geen watermerk nie Geen teken-op benodig
Model:
+ GPT-5, Claude, Gemini
TTS-enjin Self-hosted Apache 2.0
Sesame CSM-1B — Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.
0 karakters ~0 tokens
Duur skubbe met karaktertelling
Genererende spraak...

Wat doen Sesame CSM-1B klink so?

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.

Probeer die boks hierbo met: Dag, my naam is Sam, en ek lees hierdie voorbeeld om die stem te toon. takies wat die kanonieke TTS depreteer frase is.

Wanneer om te gebruik Sesame CSM-1B

Media controller element

Lang-vorm vertelling met konsekwente toon. Plak 'n hoofstuk op' n slag, aflaai as WAV of MP3, en stice eksterne.

Verdonkerte intro's

Kort open buffers en ad-lees. Verstel spoed vir energie, formaat-switch na MP3 vir kleiner lêers.

IVR + stempos

Telefoon-stelsel gee aanleiding tot. Studio-quality uitset sonder 'n boekring, opname of NDAs met stemtalent.

Toeganklikheid

Voeg klank by geskrewe inhoud vir lae-vision- en dislektiese lesers. Val-in op enige bladsy.

Voeg lÃaer by...

"Welcome to the show, today we are exploring the future of AI."
"Your package has arrived. Please retrieve it from the front desk."
"Once upon a time, in a quiet village far away, lived a curious child."
"Press one for sales, two for support, or stay on the line for an agent."
"Breaking news: scientists have discovered a new species of deep-sea fish."
"Thank you for choosing us. We appreciate your business and look forward to serving you again."

Verf met patroon

Self-hosted op ons GPUs. Generation trek van jou daaglikse vry swembad eerste, wanneer dit opraak, betaalde goedere begin by $5 → 200 (R5 000) forms. R5s per karakter, minimum 100 per clip.

Volledige model verwysing → · Sien alle TTS-stemme → · Vergelyk 2 stemme sy- by-side →

Gevorderde opsies
Resultaat
Tokens loop laag. Get More Tokens
Want better results? Premium modelle (GPT-5, Claude, Gemini) deliver higher quality. View Plans

❤️ Love Free.ai? Tell your friends!

Teken op om 'n verwysingale skakel te kry en verdien 25 000 briewe per vriend.

Wil jy meer hê? Sign up free for 10,000 tokens
Meld aan om vry te wees

Proses jou versoek...

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio …

Hoe om te gebruik AI Voice — Sesame CSM-1B

1
Tik jou invoer in

Tik teks, oplaai 'n lêer, of beskryf wat jy wil hê. Nee rekening benodig.

2
Kliek genereer

Ons kunsmatige intelligensie verwerk jou versoek in sekondes deur die beste ope-bou modelle te gebruik.

3
Aflaai klaar gemaak

Laai af, kopieer of deel jou resultaat. Vry vir persoonlike en kommersiële gebruik.

Gebruik hierdie program deur middel van API

Outomate hierdie program van jou eie kode. OpenAI- compatibleREST- end point, Beer-token auth, nee ekstra SDK benodig. Token kos ooreenstem die web koppelvlak.

curl -X POST https://api.free.ai/v1/tts/ \
  -H "Authorization: Bearer sk-free-..." \
  -H "Content-Type: application/json" \
  -d '{"text": "Hello from Free.ai", "voice": "af_heart", "model": "kokoro"}'

AI Voice — Sesame CSM-1B — FAQ

Sesame CSM-1B supports a wide range of languages. The exact list depends on the engine; the form on this page accepts any text and the engine will render in its supported languages. See /voice/ for the full multi-engine picker if you need a specific language.

Most engines render neutral-American English by default and a region-appropriate accent for non-English languages. Premium engines may expose accent variants — paste a sample to compare.

SSML support varies by engine. Pause, prosody, and emphasis tags are honored on most premium engines and on a few self-hosted ones. Plain text always works — no markup required.

Streaming TTS is available on premium engines via the /v1/tts/ API endpoint with stream=true. The web UI on this page returns the full clip once rendering finishes.

Sesame CSM-1B runs on our own GPUs. Generation draws from your daily free pool first. Once depleted, paid tokens start at $5 → 200,000 tokens. Roughly ~5 tokens per character, minimum 100 per clip.

Up to 5,000 characters per request on the web UI. For longer pieces (audiobooks, full chapters), use /voice/audiobook/ which chunks and stitches automatically, or call the API in a loop.

Yes — POST a list of strings to /v1/tts/batch/, or use the workspace UI at /workspace/ to chain TTS into a longer pipeline (e.g., translate → speak → stitch).

Yes — POST text to /v1/tts/ with model="Sesame CSM-1B" (or the slug on this page). Returns WAV or MP3. See /api/ for full reference + SDK snippets.

This page is text-to-speech, not voice cloning — the voice is the engine's default. For voice cloning (uploading a reference audio), see /voice/clone/, which requires you to either own the voice rights or have explicit written consent.

Self-hosted engines run on Free.ai-owned GPUs; nothing leaves our servers. Premium engines pass text to upstream model providers under our DPA. We do not train on your inputs and do not sell data.

Yes — Free.ai grants commercial use of generated audio. The engine's underlying license (Apache 2.0, MIT, or vendor terms) is shown above and on the model reference page; in practice this means voiceovers, ads, podcasts, and apps are all in-scope.

Yes — failed jobs auto-refund to the source (daily pool or paid tokens). If a refund does not show up the same day, email contact@free.ai.

Teken gratis op vir 10 000 tekens

Skep vrye rekening

Geen kredietkaart benodig nie

Hoe sal jy hierdie instrument uitwerk?

4.3/5 from 3 ratings

Like this tool? Share it!