AI Voice — Sesame CSM-1B

Naudojimas komerciniais tikslais 380+ modeliai Nėra vandens ženklo Nėra reikalo pasirašyti
Modelis:
+ GPT-5, Claude, Gemini
TTS variklis Savarankiškai Apache 2.0
Sesame CSM-1B — Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.
0 simboliai ~0 simbolių
Sąnaudų skalės su ženklų skaičiumi
Generuojama kalba...

Kas yra Sesame CSM-1B Skamba kaip?

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.

Išbandyk langelį viršuje: Sveikas, mano vardas yra Semas, ir aš skaičiau šį pavyzdį, kad pademonstruotų balsą. — Tai kanoninė TTS demo frazė.

Kada vartoti Sesame CSM-1B

Garso knygos

Ilgos formos pasakojimas su nuosekliu tonu. Įdėti skyrių vienu metu, parsisiųsti kaip WAV arba MP3, ir dygsnio išoriškai.

Podcast intros

Trumpi atidarymo buferiai ir ad-reads. Reguliuokite greitį energijai, formato perjungimą į MP3 mažesniems failams.

IVR + balso paštas

Telefono sistema greitina. Studio kokybės išvestis be užsakymo, įrašymo, arba NDA su balso talentas.

Prieinamumas

Pridėti garsą kartu su rašytiniu turiniu žemo matymo ir disleksiniais skaitytuvais. Įmesti bet kuriame puslapyje.

Mėginio frazės

"Welcome to the show, today we are exploring the future of AI."
"Your package has arrived. Please retrieve it from the front desk."
"Once upon a time, in a quiet village far away, lived a curious child."
"Press one for sales, two for support, or stay on the line for an agent."
"Breaking news: scientists have discovered a new species of deep-sea fish."
"Thank you for choosing us. We appreciate your business and look forward to serving you again."

Kainos

Savarankiškai mūsų GPU. Generacija atkreipia savo kasdien nemokamai baseinas pirmą kartą; kai tai baigiasi, mokamas žetonų paketus pradėti $5 → 200,000 žetonų.

Visas modelio nuoroda → · Žr. visus TTS balsus → · Palyginti 2 balsai šalia →

Sudėtingesnės parinktys
Rezultatas
Maži žetonai. Get More Tokens
Want better results? Premium modeliai (GPT-5, Claude, Gemini) deliver higher quality. View Plans

❤️ Love Free.ai? Tell your friends!

Sign up norėdami gauti kreipimosi nuorodą ir uždirbti 25,000 žetonų vienam draugui.

Nori daugiau? Sign up free for 10,000 tokens
Užsiregistruoti nemokamai

Apdorokite savo užklausą...

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio …

Kaip vartoti AI Voice — Sesame CSM-1B

1
Įveskite įvedinį

Įveskite tekstą, įkelkite failą arba apibūdinkite ką norite. Sąskaitos nereikia.

2
Spustelėkite generavimą

Mūsų AI apdoroja Jūsų užklausą per kelias sekundes, naudodami geriausius atviro kodo modelius.

3
Atsisiųsti ir dalintis

Atsisiųskite, nukopijuokite arba pasidalinkite savo rezultatais. Nemokamas asmeniniam ir komerciniam naudojimui.

Naudoti šį įrankį per API

Automatizuoti šį įrankį iš savo kodo. OpenAI suderinama REST vertinamoji baigtis, Beaker-token auth, papildomų SDK nereikia. Token išlaidos atitinka interneto sąsają.

curl -X POST https://api.free.ai/v1/tts/ \
  -H "Authorization: Bearer sk-free-..." \
  -H "Content-Type: application/json" \
  -d '{"text": "Hello from Free.ai", "voice": "af_heart", "model": "kokoro"}'

AI Voice — Sesame CSM-1B — FAQ

Sesame CSM-1B supports a wide range of languages. The exact list depends on the engine; the form on this page accepts any text and the engine will render in its supported languages. See /voice/ for the full multi-engine picker if you need a specific language.

Most engines render neutral-American English by default and a region-appropriate accent for non-English languages. Premium engines may expose accent variants — paste a sample to compare.

SSML support varies by engine. Pause, prosody, and emphasis tags are honored on most premium engines and on a few self-hosted ones. Plain text always works — no markup required.

Streaming TTS is available on premium engines via the /v1/tts/ API endpoint with stream=true. The web UI on this page returns the full clip once rendering finishes.

Sesame CSM-1B runs on our own GPUs. Generation draws from your daily free pool first. Once depleted, paid tokens start at $5 → 200,000 tokens. Roughly ~5 tokens per character, minimum 100 per clip.

Up to 5,000 characters per request on the web UI. For longer pieces (audiobooks, full chapters), use /voice/audiobook/ which chunks and stitches automatically, or call the API in a loop.

Yes — POST a list of strings to /v1/tts/batch/, or use the workspace UI at /workspace/ to chain TTS into a longer pipeline (e.g., translate → speak → stitch).

Yes — POST text to /v1/tts/ with model="Sesame CSM-1B" (or the slug on this page). Returns WAV or MP3. See /api/ for full reference + SDK snippets.

This page is text-to-speech, not voice cloning — the voice is the engine's default. For voice cloning (uploading a reference audio), see /voice/clone/, which requires you to either own the voice rights or have explicit written consent.

Self-hosted engines run on Free.ai-owned GPUs; nothing leaves our servers. Premium engines pass text to upstream model providers under our DPA. We do not train on your inputs and do not sell data.

Yes — Free.ai grants commercial use of generated audio. The engine's underlying license (Apache 2.0, MIT, or vendor terms) is shown above and on the model reference page; in practice this means voiceovers, ads, podcasts, and apps are all in-scope.

Yes — failed jobs auto-refund to the source (daily pool or paid tokens). If a refund does not show up the same day, email contact@free.ai.

Užsiregistruoti nemokamai 10,000 žetonų

Sukurti nemokamą paskyrą

Kredito kortelės nereikia

Kaip vertinate šį įrankį?

4.3/5 from 3 ratings

Like this tool? Share it!