AI Voice — Sesame CSM-1B

تجارتي استعمال صحيح ماڊل ڪوبه واٽر مارڪ نه ڪوبه رجسٽريشن جي ضرورت نه آهي
ماڊل:
+ GPT-5, Claude, Gemini
TTS انجن پاڻمرادو ميزبان Apache 2.0
Sesame CSM-1B — Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.
0 نشان ~0 ٽوڪنز
ڪارڪنن جي ڳاڻيٽي سان قيمت جي ماپ
ڳالھائي پيدا ڪئي وڃي ٿي...

ڇا ڪري ٿو Sesame CSM-1B ڇا؟

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.

مٿين بڪس کي هيٺين سان آزمايو: سلام، منھنجو نالو سم آھي، ۽ آءٌ آواز کي ڏيکارڻ لاءِ هي نمونو پڙهي رهيو آھيان. — اھو آھي canonical TTS demo sentence.

ڪڏھن استعمال ڪجي Sesame CSM-1B

آڊيو ڪتاب

ڊگهي شڪل واري ڪهاڻي، مسلسل آواز سان. هڪ دفعي ۾ هڪ باب کي چٽيو، WAV يا MP3 طور ڊائون لوڊ ڪريو، ۽ ٻاهران سٽڪ ڪريو.

پوڊ ڪاسٽ

مختصر کولڻ بمپرز ۽ اد-پڙھڻ. توانائي جي رفتار کي ترتيب ڏيو، ننڍن فائلن لاءِ MP3 ۾ فارميٽ-سيٽ ڪريو.

وڊيو

فون-سسٽم جي پڇاڙي. اسٽوڊيو-ڪواليٽي آؤٽپوٽ بغير رڪنيت، رڪارڊنگ، يا NDAs سان آواز جي صلاحيت سان.

رسائي

گهٽ ڏسڻ وارن ۽ dyslexic پڙهندڙن لاءِ لکيل مواد سان گڏ آڊيو شامل ڪريو. ڪنهن به صفحي تي لڪايو.

مثالي جملا

"Welcome to the show, today we are exploring the future of AI."
"Your package has arrived. Please retrieve it from the front desk."
"Once upon a time, in a quiet village far away, lived a curious child."
"Press one for sales, two for support, or stay on the line for an agent."
"Breaking news: scientists have discovered a new species of deep-sea fish."
"Thank you for choosing us. We appreciate your business and look forward to serving you again."

قيمت

اسان جي GPUs تي پاڻ کي ميزبان. نسل پنهنجي روزاني مفت پول کان پهرين نڪتو; هڪ ڀيرو اهو ختم ٿئي ٿو، ادا ڪيل ٽوڪين پيڪيجز $5 → 200,000 ٽوڪين تي شروع ٿئي ٿو. تقريبن ~5 ٽوڪين هر ڪردار، گهٽ ۾ گهٽ 100 هر ڪلپ.

پورو ماڊل حوالو → · سڀ TTS آواز ڏسو → · ٻن آوازن کي گڏيل ڀيٽ ڪريو →

وڌيڪ اختيار
نتيجو
ھيءُ ھيءُ ھيءُ Get More Tokens
Want better results? پريميئم ماڊل (GPT-5, Claude, Gemini) deliver higher quality. View Plans

❤️ Free.ai کي پيارو آهي؟ پنھنجن دوستن کي چئو!

سڀني دوستن کي 25,000 ٽوڪنز حاصل ڪرڻ لاءِ رجسٽر ڪريو.

وڌيڪ گھرو ٿا؟ Sign up free for 10,000 tokens
مفت ۾ رجسٽر ٿيو

توھان جو درخواست جو عمل...

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio …

استعمال ڪرڻ جو طريقو AI Voice — Sesame CSM-1B

1
پنھنجي داخلا داخل ڪريو

متن لکو، فائل اپ لوڊ ڪريو، يا جيڪي توهان چاهيو ٿا سو بيان ڪريو. ڪوبه اڪائونٽ نه گھرجي.

2
پيدا ڪرڻ لاءِ ڪلڪ ڪريو

اسان جو AI توهان جي درخواست کي سيڪنڊن ۾ بهترين مفت-سورس ماڊلز استعمال ڪندي پروسيس ڪندو.

3
ڊائون لوڊ ۽ ونڊو

پنھنجو نتيجو ڊائون لوڊ، ڪاپي يا ونڊ ڪريو. پاڻيءَ ۽ تجارتي استعمال لاءِ مفت.

ھي ٽولز API ذريعي استعمال ڪريو

ھن اوزار کي پنھنجي ڪوڊ مان خودڪار ڪريو. OpenAI-compatible REST endpoint, Bearer-token auth, no extra SDK required. Token costs match the web interface.

curl -X POST https://api.free.ai/v1/tts/ \
  -H "Authorization: Bearer sk-free-..." \
  -H "Content-Type: application/json" \
  -d '{"text": "Hello from Free.ai", "voice": "af_heart", "model": "kokoro"}'

AI Voice — Sesame CSM-1B — FAQ

Sesame CSM-1B supports a wide range of languages. The exact list depends on the engine; the form on this page accepts any text and the engine will render in its supported languages. See /voice/ for the full multi-engine picker if you need a specific language.

Most engines render neutral-American English by default and a region-appropriate accent for non-English languages. Premium engines may expose accent variants — paste a sample to compare.

SSML support varies by engine. Pause, prosody, and emphasis tags are honored on most premium engines and on a few self-hosted ones. Plain text always works — no markup required.

Streaming TTS is available on premium engines via the /v1/tts/ API endpoint with stream=true. The web UI on this page returns the full clip once rendering finishes.

Sesame CSM-1B runs on our own GPUs. Generation draws from your daily free pool first. Once depleted, paid tokens start at $5 → 200,000 tokens. Roughly ~5 tokens per character, minimum 100 per clip.

Up to 5,000 characters per request on the web UI. For longer pieces (audiobooks, full chapters), use /voice/audiobook/ which chunks and stitches automatically, or call the API in a loop.

Yes — POST a list of strings to /v1/tts/batch/, or use the workspace UI at /workspace/ to chain TTS into a longer pipeline (e.g., translate → speak → stitch).

Yes — POST text to /v1/tts/ with model="Sesame CSM-1B" (or the slug on this page). Returns WAV or MP3. See /api/ for full reference + SDK snippets.

This page is text-to-speech, not voice cloning — the voice is the engine's default. For voice cloning (uploading a reference audio), see /voice/clone/, which requires you to either own the voice rights or have explicit written consent.

Self-hosted engines run on Free.ai-owned GPUs; nothing leaves our servers. Premium engines pass text to upstream model providers under our DPA. We do not train on your inputs and do not sell data.

Yes — Free.ai grants commercial use of generated audio. The engine's underlying license (Apache 2.0, MIT, or vendor terms) is shown above and on the model reference page; in practice this means voiceovers, ads, podcasts, and apps are all in-scope.

Yes — failed jobs auto-refund to the source (daily pool or paid tokens). If a refund does not show up the same day, email contact@free.ai.

10,000 ٽوڪنز لاءِ مفت ۾ رجسٽر ٿيو

اڪائونٽ ٺاهيو

ڪوبه ڪريڊٽ ڪارڊ نه گھرجي

توھان ھن اوزار کي ڪيئن تصنيف ڪريو ٿا؟

4.3/5 from 3 ratings

Free.ai کي پيارو آهي؟ پنھنجن دوستن کي چئو!