AI Voice — Sesame CSM-1B

Kommerziell Benotzung OK 380 Säiten Keng Waasserzeechen Keng Umeldungsinformatioun erfuerderlech
Modell:
+ GPT-5, Claude, Gemini
TTS-Engine Selbst-Hosting Apache 2.0
Sesame CSM-1B — Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.
0 Zeichen ~0 Token
D'Kosten si mat der Zeichenzuel skaléiert
Sprooch gëtt erstallt...

Wat ass Sesame CSM-1B Wéi klingt dat?

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.

Et ass eng vun de bekanntste Figuren an der Geschicht vum Film, an dat och wéinst senger Roll als "The Cannonball".

Wann ze benotzen Sesame CSM-1B

Audiobooks

Long-form narration with consistent tone. Paste a chapter at a time, download as WAV or MP3, and stitch externally.

Websäit vum Podcast

Kuerz Eröffnungsbumper an Ad-Reads. Passt d'Geschwindegkeet fir Energie un, Format-Switch op MP3 fir kleng Dateien.

Voicemail

Telefonssystem-Prompts. Studio-Qualité-Ausgab ouni Reservéierung, Opnam oder NDAs mat Stëmmtalent.

Zougänglechkeet

Audio neben geschriwen Inhalt fir schwaach gesinn an dyslexisch Lieser. Drop-in op all Säit.

Beispiller vu Phrasen

"Welcome to the show, today we are exploring the future of AI."
"Your package has arrived. Please retrieve it from the front desk."
"Once upon a time, in a quiet village far away, lived a curious child."
"Press one for sales, two for support, or stay on the line for an agent."
"Breaking news: scientists have discovered a new species of deep-sea fish."
"Thank you for choosing us. We appreciate your business and look forward to serving you again."

Präis

D'Generatioun zielt zuerst aus Ärem alldeegleche gratis Pool; wann dee verbraucht ass, fänken d'bezuelte Token-Pakete bei $5 → 200.000 Token un. Ongeféier ~5 Token pro Zeichen, mindestens 100 pro Clip.

Lëscht vu Modellen → · D'Lëscht vun de lëtzebuergesche Sproochen → · 2 Säiten 2 Säiten →

Erweitert Optiounen
Resultat
Den Haaptuert ass La Bassée. Get More Tokens
Want better results? Präis (GPT-5, Claude, Gemini) deliver higher quality. View Plans

❤️ Free.ai? Erzielt et Äre Frënn!

D'Asteroiden 1634 (1634) A. A. an 1635 (1635) A. A. sinn nom A. A. A. benannt, dem franséische Mathematiker.

Méi wëllen? Sign up free for 10,000 tokens
Gratis anmelden

Är Ufro gëtt veraarbecht...

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio …

Wéi ze benotzen AI Voice — Sesame CSM-1B

1
Gitt Är Input an

Text aginn, eng Datei erofladen oder beschreiwen wat Dir wëllt. Keng Kont néideg.

2
Klick erzeugen

D'Aarbechtszäiten an d'Aarbechtskonditiounen sinn an der Regel am beschten an der éischter Woch.

3
Download & share

Är Resultater erofzelueden, kopiéieren oder deelen. Gratis fir perséinlech a kommerziell Notzung.

Dësen Tool iwwer API benotzen

Automatiséieren dëse Tool vun Ärem eegene Code. OpenAI-kompatibel REST Endpoint, Bearer-Token Auth, keng extra SDK erfuerderlech. Token Käschte passen d'Web-Interface.

curl -X POST https://api.free.ai/v1/tts/ \
  -H "Authorization: Bearer sk-free-..." \
  -H "Content-Type: application/json" \
  -d '{"text": "Hello from Free.ai", "voice": "af_heart", "model": "kokoro"}'

AI Voice — Sesame CSM-1B — FAQ

Sesame CSM-1B supports a wide range of languages. The exact list depends on the engine; the form on this page accepts any text and the engine will render in its supported languages. See /voice/ for the full multi-engine picker if you need a specific language.

Most engines render neutral-American English by default and a region-appropriate accent for non-English languages. Premium engines may expose accent variants — paste a sample to compare.

SSML support varies by engine. Pause, prosody, and emphasis tags are honored on most premium engines and on a few self-hosted ones. Plain text always works — no markup required.

Streaming TTS is available on premium engines via the /v1/tts/ API endpoint with stream=true. The web UI on this page returns the full clip once rendering finishes.

Sesame CSM-1B runs on our own GPUs. Generation draws from your daily free pool first. Once depleted, paid tokens start at $5 → 200,000 tokens. Roughly ~5 tokens per character, minimum 100 per clip.

Up to 5,000 characters per request on the web UI. For longer pieces (audiobooks, full chapters), use /voice/audiobook/ which chunks and stitches automatically, or call the API in a loop.

Yes — POST a list of strings to /v1/tts/batch/, or use the workspace UI at /workspace/ to chain TTS into a longer pipeline (e.g., translate → speak → stitch).

Yes — POST text to /v1/tts/ with model="Sesame CSM-1B" (or the slug on this page). Returns WAV or MP3. See /api/ for full reference + SDK snippets.

This page is text-to-speech, not voice cloning — the voice is the engine's default. For voice cloning (uploading a reference audio), see /voice/clone/, which requires you to either own the voice rights or have explicit written consent.

Self-hosted engines run on Free.ai-owned GPUs; nothing leaves our servers. Premium engines pass text to upstream model providers under our DPA. We do not train on your inputs and do not sell data.

Yes — Free.ai grants commercial use of generated audio. The engine's underlying license (Apache 2.0, MIT, or vendor terms) is shown above and on the model reference page; in practice this means voiceovers, ads, podcasts, and apps are all in-scope.

Yes — failed jobs auto-refund to the source (daily pool or paid tokens). If a refund does not show up the same day, email contact@free.ai.

Et gëtt ongeféier 10.000 Aarten.

Kont erstellen

Keng Kreditkaart erfuerderlech

Wéi géift Dir dat Tool bewäerten?

4.3/5 from 3 ratings

Free.ai? Erzielt et Äre Frënn!