AI Voice — Sesame CSM-1B

Ọrụ ọhaneze OK 380+ Models Enweghị akara mmiri Enweghị mkpa ịbanye
Móòdù:
+ GPT-5, Claude, Gemini
TTS engine Òtù Apache 2.0
Sesame CSM-1B — Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.
0 Ụdị ~0 token
Nhazi ọnụọgụgụ
Na-ebipụta okwu...

Gịnị bụ Sesame CSM-1B Oge

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.

Jiri bóks n'elu na: Hello, aha m bụ Sam, m na-agụsa ihe atụ a ka m gosi ụda. — nke ahụ bụ okwu TTS demo canonical.

Mgbe a ga-eji ya Sesame CSM-1B

Akwụkwọ ụda

Nkọwapụta ogologo-ụdị na ụda dị n'otu. Pịa isiokwu n'otu oge, budata dịka WAV mọọbụ MP3, nakwa stitch n'ebe ọha.

Podcast intros

Ńgbàmpụ̀ ná ád-reads. Hazie ọsọ maka ike, gbanwee n'ụdị ka MP3 maka faịlụ ndị dị obere.

IVR + oziọsụ

Sistemụ ekwentị na-ajụ. Ọnụahịa studio-ọdịnaya na-apụta na-enweghị ntọala, ntọala, mọọbụ NDAs na ụda.

Nhazi

Tinye ụda n'ebe ngwe edere maka ndị na-agụ akwụkwọ na-eleghị anya na ndị na-agụ akwụkwọ na-asụgharịghị n'asụsụ. Tinye n'ihe ọbụla na ihuakwụkwọ.

Nkọwa ndị ahụ

"Welcome to the show, today we are exploring the future of AI."
"Your package has arrived. Please retrieve it from the front desk."
"Once upon a time, in a quiet village far away, lived a curious child."
"Press one for sales, two for support, or stay on the line for an agent."
"Breaking news: scientists have discovered a new species of deep-sea fish."
"Thank you for choosing us. We appreciate your business and look forward to serving you again."

Nhazi

Self-hosted na GPUs anyị. Ọrụ na-apụta site na pool gị n'ụbọchị; mgbe ọ na-apụ, akpa ndị a na-akwụ ụgwọ na $ 5 → 200,000 tokens. N'ụzọ dị mfe ~ 5 tokens kwa akara, 100 kwa clip.

Nkọwa zuru ezu → · Gosi ụda TTS niile → · Tụnyere ụda 2 n'otu n'otu →

Nkarachọ ndị ahụ
Ihenhọrọ ahụ
Token na-aga n'okpuru. Get More Tokens
Want better results? Premium models (GPT-5, Claude, Gemini) deliver higher quality. View Plans

❤️ Ị hụrụ Free.ai? Kpọtụrụ enyi gị!

Tinye aka ka ị nweta njikọ n'aka enyi gị ma nweta 25,000 token kwa enyi.

Ịchọrọ ihe ọzọ? Sign up free for 10,000 tokens
Akaụntụ

Na-arụ ọrụ n'ihe ịchọrọ...

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio …

Etu esi eji ya AI Voice — Sesame CSM-1B

1
Tinye init gị

Tinye ngwe, bubata faịlụ, mọọbụ depụta ihe ịchọrọ. Achọrọ akaụntụ ọbụla.

2
Pịa mepụta

Anyị AI na-arụ ọrụ gị n'ime sekọnd na-eji ihe kacha mma open-source models.

3
Bubata na akwado

Bubata, debata, mọọbụ kesaa nsonaazụ gị. Free maka ojiji nkeonwe na nke azụmahịa.

Jiri tùlè a site na API

Megharịa ihenhọrọ a site na koodị gị. OpenAI-compatible REST endpoint, Bearer-token auth, enweghị SDK ọzọ achọrọ. Token costs match the web interface.

curl -X POST https://api.free.ai/v1/tts/ \
  -H "Authorization: Bearer sk-free-..." \
  -H "Content-Type: application/json" \
  -d '{"text": "Hello from Free.ai", "voice": "af_heart", "model": "kokoro"}'

AI Voice — Sesame CSM-1B — FAQ

Sesame CSM-1B supports a wide range of languages. The exact list depends on the engine; the form on this page accepts any text and the engine will render in its supported languages. See /voice/ for the full multi-engine picker if you need a specific language.

Most engines render neutral-American English by default and a region-appropriate accent for non-English languages. Premium engines may expose accent variants — paste a sample to compare.

SSML support varies by engine. Pause, prosody, and emphasis tags are honored on most premium engines and on a few self-hosted ones. Plain text always works — no markup required.

Streaming TTS is available on premium engines via the /v1/tts/ API endpoint with stream=true. The web UI on this page returns the full clip once rendering finishes.

Sesame CSM-1B runs on our own GPUs. Generation draws from your daily free pool first. Once depleted, paid tokens start at $5 → 200,000 tokens. Roughly ~5 tokens per character, minimum 100 per clip.

Up to 5,000 characters per request on the web UI. For longer pieces (audiobooks, full chapters), use /voice/audiobook/ which chunks and stitches automatically, or call the API in a loop.

Yes — POST a list of strings to /v1/tts/batch/, or use the workspace UI at /workspace/ to chain TTS into a longer pipeline (e.g., translate → speak → stitch).

Yes — POST text to /v1/tts/ with model="Sesame CSM-1B" (or the slug on this page). Returns WAV or MP3. See /api/ for full reference + SDK snippets.

This page is text-to-speech, not voice cloning — the voice is the engine's default. For voice cloning (uploading a reference audio), see /voice/clone/, which requires you to either own the voice rights or have explicit written consent.

Self-hosted engines run on Free.ai-owned GPUs; nothing leaves our servers. Premium engines pass text to upstream model providers under our DPA. We do not train on your inputs and do not sell data.

Yes — Free.ai grants commercial use of generated audio. The engine's underlying license (Apache 2.0, MIT, or vendor terms) is shown above and on the model reference page; in practice this means voiceovers, ads, podcasts, and apps are all in-scope.

Yes — failed jobs auto-refund to the source (daily pool or paid tokens). If a refund does not show up the same day, email contact@free.ai.

Nweta 10,000 tokens

Kewapụta akaụntụ

Enweghị kaadị kredit achọrọ

Olee otú ị ga-esi họrọ ihenhọrọ a?

4.3/5 from 3 ratings

Ị hụrụ Free.ai? Kpọtụrụ enyi gị!