AI Voice — Sesame CSM-1B

Użu kummerċjali OK 380 + mudelli L-ebda marka tal-ilma Ebda sign-up meħtieġa
Mudell:
+ GPT-5, Claude, Gemini
Il-magna TTS Self-hosted Apache 2.0
Sesame CSM-1B — Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.
0 karattri ~0 tokens
Skali tal-ispejjeż b’għadd ta’ karattri
Qed niġġenera diskors...

X’jagħmel Sesame CSM-1B Kif tħossok?

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.

Ipprova l-kaxxa hawn fuq ma: Hello, jisimni Sam, u jien qed taqra dan il-kampjun biex juru l-vuċi. — li hija l-frażi demo TTS kanoniku.

Meta tuża Sesame CSM-1B

Kotba awdjo

Inkella, tista’ tniżżel il-fajl bħala WAV jew MP3, u mbagħad tgħaqqad il-kapitlu ma’ dak ta’ barra.

Intro tal-podcast

Aġġusta l-veloċità għall-enerġija, il-format-swiċċ għall-MP3 għall-fajls iżgħar.

IVR + voicemail

Output tal-kwalità tal-istudjo mingħajr prenotazzjoni, reġistrazzjoni, jew NDAs b'talent tal-vuċi.

Aċċessibbiltà

Żid l-awdjo flimkien mal-kontenut bil-miktub għal qarrejja b’vista baxxa u dawk b’dislessija.

Frażijiet ta’ eżempju

"Welcome to the show, today we are exploring the future of AI."
"Your package has arrived. Please retrieve it from the front desk."
"Once upon a time, in a quiet village far away, lived a curious child."
"Press one for sales, two for support, or stay on the line for an agent."
"Breaking news: scientists have discovered a new species of deep-sea fish."
"Thank you for choosing us. We appreciate your business and look forward to serving you again."

Prezzijiet

Awto-ospitati fuq il-GPUs tagħna. Ġenerazzjoni jiġbed mill-pool kuljum ħielsa tiegħek l-ewwel; ladarba li runs out, imħallsa token pakketti jibdew fil $ 5 → 200,000 tokens. bejn wieħed u ieħor ~ 5 tokens għal kull karattru, minimu 100 għal kull clip.

Referenza sħiħa tal-mudell → · Ara l-vuċijiet TTS kollha → · Qabbel 2 vuċijiet naħa b'naħa →

Għażliet avvanzati
Riżultat
Tokens qed jaħdem baxx. Get More Tokens
Want better results? Mudelli premium (GPT-5, Claude, Gemini) deliver higher quality. View Plans

❤️ Imħabba Free.ai? Għid lill-ħbieb tiegħek!

Irreġistra biex tikseb link ta' referenza u taqla' 25,000 tokens għal kull ħabib.

Trid aktar? Sign up free for 10,000 tokens
Irreġistra b'xejn

Ipproċessar tal-applikazzjoni tiegħek...

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio …

Kif għandek tuża AI Voice — Sesame CSM-1B

1
Daħħal l-input tiegħek

Ittajpja test, ittella' fajl, jew iddeskrivi dak li trid. M'hemmx bżonn ta' kont.

2
Ikklikkja Iġġenera

AI tagħna tipproċessa t-talba tiegħek f'sekondi billi tuża l-aħjar mudelli open-source.

3
Niżżel & jaqsmu

Niżżel, kopja, jew jaqsmu r-riżultat tiegħek. Ħieles għall-użu personali u kummerċjali.

Uża din l-għodda permezz tal-API

Awtomatizza din l-għodda mill-kodiċi tiegħek stess. OpenAI-kompatibbli REST endpoint, Bearer-token awth, l-ebda SDK żejda meħtieġa.

curl -X POST https://api.free.ai/v1/tts/ \
  -H "Authorization: Bearer sk-free-..." \
  -H "Content-Type: application/json" \
  -d '{"text": "Hello from Free.ai", "voice": "af_heart", "model": "kokoro"}'

AI Voice — Sesame CSM-1B — FAQ

Sesame CSM-1B supports a wide range of languages. The exact list depends on the engine; the form on this page accepts any text and the engine will render in its supported languages. See /voice/ for the full multi-engine picker if you need a specific language.

Most engines render neutral-American English by default and a region-appropriate accent for non-English languages. Premium engines may expose accent variants — paste a sample to compare.

SSML support varies by engine. Pause, prosody, and emphasis tags are honored on most premium engines and on a few self-hosted ones. Plain text always works — no markup required.

Streaming TTS is available on premium engines via the /v1/tts/ API endpoint with stream=true. The web UI on this page returns the full clip once rendering finishes.

Sesame CSM-1B runs on our own GPUs. Generation draws from your daily free pool first. Once depleted, paid tokens start at $5 → 200,000 tokens. Roughly ~5 tokens per character, minimum 100 per clip.

Up to 5,000 characters per request on the web UI. For longer pieces (audiobooks, full chapters), use /voice/audiobook/ which chunks and stitches automatically, or call the API in a loop.

Yes — POST a list of strings to /v1/tts/batch/, or use the workspace UI at /workspace/ to chain TTS into a longer pipeline (e.g., translate → speak → stitch).

Yes — POST text to /v1/tts/ with model="Sesame CSM-1B" (or the slug on this page). Returns WAV or MP3. See /api/ for full reference + SDK snippets.

This page is text-to-speech, not voice cloning — the voice is the engine's default. For voice cloning (uploading a reference audio), see /voice/clone/, which requires you to either own the voice rights or have explicit written consent.

Self-hosted engines run on Free.ai-owned GPUs; nothing leaves our servers. Premium engines pass text to upstream model providers under our DPA. We do not train on your inputs and do not sell data.

Yes — Free.ai grants commercial use of generated audio. The engine's underlying license (Apache 2.0, MIT, or vendor terms) is shown above and on the model reference page; in practice this means voiceovers, ads, podcasts, and apps are all in-scope.

Yes — failed jobs auto-refund to the source (daily pool or paid tokens). If a refund does not show up the same day, email contact@free.ai.

Irreġistra b'xejn għal 10,000 tokens

Oħloq Kont Ħieles

Ebda karta ta' kreditu meħtieġa

Kif tirraporta din l-għodda?

4.3/5 from 3 ratings

Imħabba Free.ai? Għid lill-ħbieb tiegħek!