AI Voice — Sesame CSM-1B

Kushandiswa kwekutengesa OK 380 + mamodheru Hapana mvura Hapana kumbobvira kushanyira
Model:
+ GPT-5, Claude, Gemini
TTS injini Yakachengetwa nemunhu Apache 2.0
Sesame CSM-1B — Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.
0 mavara ~0 tokens
Kubhadhara zviyero nehuwandu hwemavara
Kugadzira mashoko...

Chii chinonzi Sesame CSM-1B Chii chinoita se?

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.

Sarudza bhokisi pamusoro ne: Hello, zita rangu ndiSam, uye ini ndiri kuverenga iyi sampli kuti ndiratidze mashoko. — iyo ndiyo canonical TTS demo phrase.

Kana uchida kushandisa Sesame CSM-1B

Audiobooks

Kunyora bhuku rimwe nerimwe, kurodha pasi se WAV kana MP3, uye kuisa kunze.

Podcast intros

Short kuvhura bumpers uye ad-inoverenga. Adjust speed for energy, format-switch to MP3 for smaller files.

IVR + voicemail

Studio-mhando output pasina booking, kurodha, kana NDAs nebasa rezwi.

Kugona Kusvika

Dzvanya pane chero peji kuti uwane mavhidhiyo, mavhidhiyo, uye mavhidhiyo akanyorwa kune vaverengi vane matambudziko ekuona uye dyslexic.

Mifananidzo yemashoko

"Welcome to the show, today we are exploring the future of AI."
"Your package has arrived. Please retrieve it from the front desk."
"Once upon a time, in a quiet village far away, lived a curious child."
"Press one for sales, two for support, or stay on the line for an agent."
"Breaking news: scientists have discovered a new species of deep-sea fish."
"Thank you for choosing us. We appreciate your business and look forward to serving you again."

Kubhadhara

Kuumbwa kunotora kubva kune yako yezuva nezuva yemahara pool yekutanga; kana ichienda kunze, yakabhadharwa token mapakeji anotanga pa $ 5 → 200,000 tokens. Pasina kumbobvira ~ 5 tokens pachara, minimum 100 pa clip.

Full model reference → · Ona zvese TTS zvinyorwa → · Kuenzanisa 2 mavhoti pedyo pedyo →

_Zvirongwa
Chikamu
Tokens iri pasi. Get More Tokens
Want better results? Premium mamodheru (GPT-5, Claude, Gemini) deliver higher quality. View Plans

❤️ Love Free.ai? Tinya pano kuti utore Free.ai!

Sign up to get a referral link and earn 25,000 tokens per friend.

Uchida zvakawanda? Sign up free for 10,000 tokens
Sign Up Free

Kugadzirisa yako mibvunzo...

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio …

Maitiro ekuisa AI Voice — Sesame CSM-1B

1
Sarudza yako input

Tinya meseji, wedzera faira, kana kuti nyora zvaunoda. Hapana account yaunoda.

2
Tinya kuumba

Our AI inoongorora yako mibvunzo mumasekondi nekushandisa yakanakisa open-source mamodheru.

3
Dhawunirodha & shandisa

Dhawunirodha, kopa kana kugovera yako mhinduro. Yemahara yemunhu uye yekutengesa kushandiswa.

Usashandisa iyi chirongwa kuburikidza neAPI

Automatize iyi chirongwa kubva yako pachako code. OpenAI-inowirirana REST endpoint, Bearer-token auth, hapana zvishoma SDK zvinodiwa. Token mutengo kusangana web interface.

curl -X POST https://api.free.ai/v1/tts/ \
  -H "Authorization: Bearer sk-free-..." \
  -H "Content-Type: application/json" \
  -d '{"text": "Hello from Free.ai", "voice": "af_heart", "model": "kokoro"}'

AI Voice — Sesame CSM-1B — FAQ

Sesame CSM-1B supports a wide range of languages. The exact list depends on the engine; the form on this page accepts any text and the engine will render in its supported languages. See /voice/ for the full multi-engine picker if you need a specific language.

Most engines render neutral-American English by default and a region-appropriate accent for non-English languages. Premium engines may expose accent variants — paste a sample to compare.

SSML support varies by engine. Pause, prosody, and emphasis tags are honored on most premium engines and on a few self-hosted ones. Plain text always works — no markup required.

Streaming TTS is available on premium engines via the /v1/tts/ API endpoint with stream=true. The web UI on this page returns the full clip once rendering finishes.

Sesame CSM-1B runs on our own GPUs. Generation draws from your daily free pool first. Once depleted, paid tokens start at $5 → 200,000 tokens. Roughly ~5 tokens per character, minimum 100 per clip.

Up to 5,000 characters per request on the web UI. For longer pieces (audiobooks, full chapters), use /voice/audiobook/ which chunks and stitches automatically, or call the API in a loop.

Yes — POST a list of strings to /v1/tts/batch/, or use the workspace UI at /workspace/ to chain TTS into a longer pipeline (e.g., translate → speak → stitch).

Yes — POST text to /v1/tts/ with model="Sesame CSM-1B" (or the slug on this page). Returns WAV or MP3. See /api/ for full reference + SDK snippets.

This page is text-to-speech, not voice cloning — the voice is the engine's default. For voice cloning (uploading a reference audio), see /voice/clone/, which requires you to either own the voice rights or have explicit written consent.

Self-hosted engines run on Free.ai-owned GPUs; nothing leaves our servers. Premium engines pass text to upstream model providers under our DPA. We do not train on your inputs and do not sell data.

Yes — Free.ai grants commercial use of generated audio. The engine's underlying license (Apache 2.0, MIT, or vendor terms) is shown above and on the model reference page; in practice this means voiceovers, ads, podcasts, and apps are all in-scope.

Yes — failed jobs auto-refund to the source (daily pool or paid tokens). If a refund does not show up the same day, email contact@free.ai.

Sign up for free for 10,000 tokens

Create Free Account

Hapana mari yekubhadhara inodiwa

Ungaishandisa sei iyi chirongwa?

4.3/5 from 3 ratings

Love Free.ai? Tinya pano kuti utore Free.ai!