AI Voice — Sesame CSM-1B

Commercial use OK 380+ wangun Ora ana tandha banyu Ora perlu mlebu
_Model:
+ GPT-5, Claude, Gemini
Mesin TTS Self-hosted Apache 2.0
Sesame CSM-1B — Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.
0 aksara ~0 tokens
Skala biaya karo jumlah aksara
Ngembangake swara...

Apa Sesame CSM-1B Apa swarane?

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.

Di dieu, urang tiasa ningali yén kecap "sajarah" dipaké pikeun nyebut sajarah, sarta "sajarah" dipaké pikeun nyebut sajarah.

Nalika digunakake Sesame CSM-1B

Buku Suara

Narasi panjang kalayan nada anu konsisten. Tepek hiji bab dina hiji waktu, ngundeur salaku WAV atawa MP3, sarta gabungkeun sacara eksternal.

Podcast intros

Bukaan bumper pondok jeung ad-read. Nyadéngékeun kacepetan pikeun énergi, ngaganti format kana MP3 pikeun berkas anu langkung alit.

IVR + voicemail

Pangunjung sistem telepon. Output kualitas studio tanpa booking, rekaman, atawa NDAs kalawan talenta sora.

Kemudahan akses

Tambah audio bareng karo isi ditulis kanggo para pamaca cacat pamahaman lan dyslexic. Lebetkeun kana kaca mana wae.

Frasa conto

"Welcome to the show, today we are exploring the future of AI."
"Your package has arrived. Please retrieve it from the front desk."
"Once upon a time, in a quiet village far away, lived a curious child."
"Press one for sales, two for support, or stay on the line for an agent."
"Breaking news: scientists have discovered a new species of deep-sea fish."
"Thank you for choosing us. We appreciate your business and look forward to serving you again."

Rencana

Self-hosted on our GPUs. Generation draws from your daily free pool first; once that runs out, paid token packs start at $5 → 200,000 tokens. About ~5 tokens per character, minimum 100 per clip.

Referensi model lengkap → · Lihat kabeh swara TTS → · Ngbandingake2suara kang padha →

Pilihan lanjutan
Hasil
Kembangé cilik. Get More Tokens
Want better results? Premium (GPT-5, Claude, Gemini) deliver higher quality. View Plans

❤️ Love Free.ai? Nyathet kanca-kancamu!

Register kanggo nampa tautan referensi lan meunang 25.000 token per kanca.

Ingin luwih? Sign up free for 10,000 tokens
Daftar Free

Ngolah panjalukmu...

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio …

Cara Nggunakake AI Voice — Sesame CSM-1B

1
Ngetik inputmu

Ngetik teks, ngunggah file, utawa nerangake apa sing sampeyan karep. Ora perlu akun.

2
Klik kanggo mbangun

AI urang ngaproses panjaluk anjeun dina detik nganggo model open-source anu pangsaéna.

3
Muat turun & Bagikan

Muter, salinan, utawa bagi hasilmu. Bebas kanggo panggunaan pribadi lan komersial.

Gunake alat iki liwat API

Otomatisasi alat ieu ti kode anjeun sorangan. OpenAI-kompatibel REST titik akhir, Bearer-token otentikasi, teu perlu SDK tambahan. Token biaya cocog antarmuka web.

curl -X POST https://api.free.ai/v1/tts/ \
  -H "Authorization: Bearer sk-free-..." \
  -H "Content-Type: application/json" \
  -d '{"text": "Hello from Free.ai", "voice": "af_heart", "model": "kokoro"}'

AI Voice — Sesame CSM-1B — FAQ

Sesame CSM-1B supports a wide range of languages. The exact list depends on the engine; the form on this page accepts any text and the engine will render in its supported languages. See /voice/ for the full multi-engine picker if you need a specific language.

Most engines render neutral-American English by default and a region-appropriate accent for non-English languages. Premium engines may expose accent variants — paste a sample to compare.

SSML support varies by engine. Pause, prosody, and emphasis tags are honored on most premium engines and on a few self-hosted ones. Plain text always works — no markup required.

Streaming TTS is available on premium engines via the /v1/tts/ API endpoint with stream=true. The web UI on this page returns the full clip once rendering finishes.

Sesame CSM-1B runs on our own GPUs. Generation draws from your daily free pool first. Once depleted, paid tokens start at $5 → 200,000 tokens. Roughly ~5 tokens per character, minimum 100 per clip.

Up to 5,000 characters per request on the web UI. For longer pieces (audiobooks, full chapters), use /voice/audiobook/ which chunks and stitches automatically, or call the API in a loop.

Yes — POST a list of strings to /v1/tts/batch/, or use the workspace UI at /workspace/ to chain TTS into a longer pipeline (e.g., translate → speak → stitch).

Yes — POST text to /v1/tts/ with model="Sesame CSM-1B" (or the slug on this page). Returns WAV or MP3. See /api/ for full reference + SDK snippets.

This page is text-to-speech, not voice cloning — the voice is the engine's default. For voice cloning (uploading a reference audio), see /voice/clone/, which requires you to either own the voice rights or have explicit written consent.

Self-hosted engines run on Free.ai-owned GPUs; nothing leaves our servers. Premium engines pass text to upstream model providers under our DPA. We do not train on your inputs and do not sell data.

Yes — Free.ai grants commercial use of generated audio. The engine's underlying license (Apache 2.0, MIT, or vendor terms) is shown above and on the model reference page; in practice this means voiceovers, ads, podcasts, and apps are all in-scope.

Yes — failed jobs auto-refund to the source (daily pool or paid tokens). If a refund does not show up the same day, email contact@free.ai.

10.000 token gratis

Akun Free

Ora perlu kartu kredit

Kepiye sampeyan nganggep alat iki?

4.3/5 from 3 ratings

Love Free.ai? Nyathet kanca-kancamu!