AI Voice — Sesame CSM-1B

Commercieel gebruik OK 380+ modellen Geen watermerk Geen aanmelding nodig
Model:
+ GPT-5, Claude, Gemini
TTS-motor Zelfgastheer Apache 2.0
Sesame CSM-1B — Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.
0 tekens ~0 tokens
Kostenschalen met aantal karakters
Speech aanmaken...

Wat doet... Sesame CSM-1B Klinkt dat als?

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.

Probeer de box hierboven met: Hallo, mijn naam is Sam, en ik lees dit monster om de stem te demonstreren. Dat is de canonieke TTS demo frase.

Wanneer moet u dit middel gebruiken? Sesame CSM-1B

Audioboeken

Lange-vorm verhaal met consistente toon. Plak een hoofdstuk tegelijk, download als WAV of MP3 en steek extern.

Podcast-intro's

Korte opening bumpers en ad-reads. Pas snelheid voor energie, format-schakelaar naar MP3 voor kleinere bestanden.

IVR + voicemail

Telefoon-systeem prompts. Studio-kwaliteit output zonder een boeking, opname, of NDA's met spraaktalent.

Toegankelijkheid

Voeg audio naast geschreven inhoud voor low-vision en dyslexic lezers. Drop-in op elke pagina.

Voorbeeldzinnen

"Welcome to the show, today we are exploring the future of AI."
"Your package has arrived. Please retrieve it from the front desk."
"Once upon a time, in a quiet village far away, lived a curious child."
"Press one for sales, two for support, or stay on the line for an agent."
"Breaking news: scientists have discovered a new species of deep-sea fish."
"Thank you for choosing us. We appreciate your business and look forward to serving you again."

Prijzen

Zelf-gehost op onze GPU's. Generatie trekt uit uw dagelijkse gratis zwembad eerst; zodra dat op is, betaalde token packs beginnen bij $5 → 200.000 tokens. Ruwweg ~5 tokens per karakter, minimum 100 per clip.

Volledige modelreferentie → · Alle TTS-stemmen bekijken → · Vergelijk 2 stemmen naast elkaar →

Geavanceerde opties
Resultaat
De tokens zijn bijna op. Get More Tokens
Want better results? Premiummodellen (GPT-5, Claude, Gemini) deliver higher quality. View Plans

❤️ Love this tool? Share it!

Aanmelden om een verwijzingslink te krijgen en 25.000 tokens per vriend te verdienen.

Wil je meer? Sign up free for 10,000 tokens
Gratis aanmelden

Bezig met verwerken van uw verzoek...

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio …

Hoe gebruikt u dit middel? AI Voice — Sesame CSM-1B

1
Voer uw invoer in

Typ tekst, upload een bestand of beschrijf wat u wilt. Er is geen account nodig.

2
Klik op genereren

Onze AI verwerkt uw verzoek in seconden met behulp van de beste open-source modellen.

3
Downloaden & delen

Download, kopieer of deel uw resultaat. Gratis voor persoonlijk en commercieel gebruik.

Gebruik deze tool via API

Automatiseer deze tool vanuit uw eigen code. OpenAI-compatibele REST-eindpunt, Bearer-token auth, geen extra SDK nodig. Tokenkosten komen overeen met de webinterface.

curl -X POST https://api.free.ai/v1/tts/ \
  -H "Authorization: Bearer sk-free-..." \
  -H "Content-Type: application/json" \
  -d '{"text": "Hello from Free.ai", "voice": "af_heart", "model": "kokoro"}'

AI Voice — Sesame CSM-1B — FAQ

Sesame CSM-1B supports a wide range of languages. The exact list depends on the engine; the form on this page accepts any text and the engine will render in its supported languages. See /voice/ for the full multi-engine picker if you need a specific language.

Most engines render neutral-American English by default and a region-appropriate accent for non-English languages. Premium engines may expose accent variants — paste a sample to compare.

SSML support varies by engine. Pause, prosody, and emphasis tags are honored on most premium engines and on a few self-hosted ones. Plain text always works — no markup required.

Streaming TTS is available on premium engines via the /v1/tts/ API endpoint with stream=true. The web UI on this page returns the full clip once rendering finishes.

Sesame CSM-1B runs on our own GPUs. Generation draws from your daily free pool first. Once depleted, paid tokens start at $5 → 200,000 tokens. Roughly ~5 tokens per character, minimum 100 per clip.

Up to 5,000 characters per request on the web UI. For longer pieces (audiobooks, full chapters), use /voice/audiobook/ which chunks and stitches automatically, or call the API in a loop.

Yes — POST a list of strings to /v1/tts/batch/, or use the workspace UI at /workspace/ to chain TTS into a longer pipeline (e.g., translate → speak → stitch).

Yes — POST text to /v1/tts/ with model="Sesame CSM-1B" (or the slug on this page). Returns WAV or MP3. See /api/ for full reference + SDK snippets.

This page is text-to-speech, not voice cloning — the voice is the engine's default. For voice cloning (uploading a reference audio), see /voice/clone/, which requires you to either own the voice rights or have explicit written consent.

Self-hosted engines run on Free.ai-owned GPUs; nothing leaves our servers. Premium engines pass text to upstream model providers under our DPA. We do not train on your inputs and do not sell data.

Yes — Free.ai grants commercial use of generated audio. The engine's underlying license (Apache 2.0, MIT, or vendor terms) is shown above and on the model reference page; in practice this means voiceovers, ads, podcasts, and apps are all in-scope.

Yes — failed jobs auto-refund to the source (daily pool or paid tokens). If a refund does not show up the same day, email contact@free.ai.

Gratis aanmelden voor 10.000 tokens

Vrije account aanmaken

Geen creditcard vereist

Hoe zou je dit gereedschap beoordelen?

4.3/5 from 3 ratings

Love this tool? Share it!