AI Voice — Sesame CSM-1B

Penggunaan komersial OK 380+ model Tiada tanda air Tiada pendaftaran diperlukan
Model:
+ GPT-5, Claude, Gemini
Enjin TTS Dihost sendiri Apache 2.0
Sesame CSM-1B — Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.
0 aksara ~0 token
Skala kos dengan kiraan aksara
Menjana ucapan...

Apa yang Sesame CSM-1B Bunyi seperti?

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.

Cuba kotak di atas dengan: Hello, nama saya Sam, dan saya membaca sampel ini untuk mendemonstrasikan suara. — itu frasa demo TTS kanonikal.

Bila untuk digunakan Sesame CSM-1B

Buku Audio

Narasi bentuk panjang dengan nada konsisten. Tampal satu bab pada satu masa, muat turun sebagai WAV atau MP3, dan jahit secara luaran.

Intro Podcast

Pembukaan bumper pendek dan bacaan iklan. Selaraskan kelajuan untuk tenaga, tukar format ke MP3 untuk fail yang lebih kecil.

IVR + mel suara

Sistem-telefon-meminta. Output kualiti studio tanpa tempahan, rakaman, atau NDA dengan bakat suara.

Kebolehcapaian

Tambah audio bersama kandungan tertulis untuk pembaca kurang penglihatan dan dyslexic. Drop-in pada mana-mana halaman.

Frasa contoh

"Welcome to the show, today we are exploring the future of AI."
"Your package has arrived. Please retrieve it from the front desk."
"Once upon a time, in a quiet village far away, lived a curious child."
"Press one for sales, two for support, or stay on the line for an agent."
"Breaking news: scientists have discovered a new species of deep-sea fish."
"Thank you for choosing us. We appreciate your business and look forward to serving you again."

Harga

Dihostkan sendiri pada GPU kami. Generasi menarik dari kolam percuma harian anda dahulu; apabila ia habis, pakej token berbayar bermula pada $5 → 200,000 token. Kira-kira ~5 token per aksara, minimum 100 per klip.

Rujukan model penuh → · Lihat semua suara TTS → · Bandingkan 2 suara berdampingan →

Opsyen Lanjutan
Hasil
Token semakin habis. Get More Tokens
Want better results? Model premium (GPT-5, Claude, Gemini) deliver higher quality. View Plans

❤️ Love this tool? Share it!

Mendaftar untuk mendapatkan pautan rujukan dan memperoleh 25,000 token per rakan.

Nak lagi? Sign up free for 10,000 tokens
Daftar Masuk

Memproses permintaan anda...

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio …

Bagaimana untuk Guna AI Voice — Sesame CSM-1B

1
Masukkan input anda

Taip teks, muat naik fail, atau jelaskan apa yang anda mahu. Tiada akaun diperlukan.

2
Klik cipta

AI kami memproses permintaan anda dalam beberapa saat menggunakan model sumber terbuka terbaik.

3
Muat turun & kongsi

Muat turun, salin, atau kongsi hasil anda. Muat turun percuma untuk kegunaan peribadi dan komersial.

Guna alat ini melalui API

Automatikkan alat ini dari kod anda sendiri. Titik akhir REST serasi OpenAI, pengesahan token-pemegang, tiada SDK tambahan diperlukan. Kos token sepadan dengan antaramuka web.

curl -X POST https://api.free.ai/v1/tts/ \
  -H "Authorization: Bearer sk-free-..." \
  -H "Content-Type: application/json" \
  -d '{"text": "Hello from Free.ai", "voice": "af_heart", "model": "kokoro"}'

AI Voice — Sesame CSM-1B — FAQ

Sesame CSM-1B supports a wide range of languages. The exact list depends on the engine; the form on this page accepts any text and the engine will render in its supported languages. See /voice/ for the full multi-engine picker if you need a specific language.

Most engines render neutral-American English by default and a region-appropriate accent for non-English languages. Premium engines may expose accent variants — paste a sample to compare.

SSML support varies by engine. Pause, prosody, and emphasis tags are honored on most premium engines and on a few self-hosted ones. Plain text always works — no markup required.

Streaming TTS is available on premium engines via the /v1/tts/ API endpoint with stream=true. The web UI on this page returns the full clip once rendering finishes.

Sesame CSM-1B runs on our own GPUs. Generation draws from your daily free pool first. Once depleted, paid tokens start at $5 → 200,000 tokens. Roughly ~5 tokens per character, minimum 100 per clip.

Up to 5,000 characters per request on the web UI. For longer pieces (audiobooks, full chapters), use /voice/audiobook/ which chunks and stitches automatically, or call the API in a loop.

Yes — POST a list of strings to /v1/tts/batch/, or use the workspace UI at /workspace/ to chain TTS into a longer pipeline (e.g., translate → speak → stitch).

Yes — POST text to /v1/tts/ with model="Sesame CSM-1B" (or the slug on this page). Returns WAV or MP3. See /api/ for full reference + SDK snippets.

This page is text-to-speech, not voice cloning — the voice is the engine's default. For voice cloning (uploading a reference audio), see /voice/clone/, which requires you to either own the voice rights or have explicit written consent.

Self-hosted engines run on Free.ai-owned GPUs; nothing leaves our servers. Premium engines pass text to upstream model providers under our DPA. We do not train on your inputs and do not sell data.

Yes — Free.ai grants commercial use of generated audio. The engine's underlying license (Apache 2.0, MIT, or vendor terms) is shown above and on the model reference page; in practice this means voiceovers, ads, podcasts, and apps are all in-scope.

Yes — failed jobs auto-refund to the source (daily pool or paid tokens). If a refund does not show up the same day, email contact@free.ai.

Daftar percuma untuk 10,000 token

Cipta Akaun Bebas

Tiada kad kredit diperlukan

Bagaimana anda menilai alat ini?

4.3/5 from 3 ratings

Love this tool? Share it!