AI Voice — Sesame CSM-1B

ການນໍາໃຊ້ການຄ້າ OK 380+ ແບບ ບໍ່ມີ​ເຄື່ອງ​ໝາຍ​ນ້ຳ ບໍ່ມີ​ການ​ລົງທະບຽນ​ທີ່​ຕ້ອງການ
ແບບ:
+ GPT-5, Claude, Gemini
ເຄື່ອງຈັກ TTS ຈັດການ​ເອງ Apache 2.0
Sesame CSM-1B — Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.
0 ​តួ​អក្សរ ~0 ຕົວ​ແທນ
ຄ່າ​ໃຊ້​ຈ່າຍ​ທີ່​ມີ​ການ​ຄິດໄລ່​ຕົວ​ອັກສອນ
ສ້າງ​ການ​ເວົ້າ...

ເຮັດ​ຫຍັງ Sesame CSM-1B ສຽງຄືແນວໃດ?

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.

ພະຍາຍາມໃຊ້ກະດານຂ້າງເທິງນີ້: ຂໍຂອບໃຈ, ຊື່ຂອງຂ້ອຍແມ່ນ Sam, ແລະຂ້ອຍໄດ້ອ່ານຕົວຢ່າງນີ້ເພື່ອສະແດງສຽງ. — ນັ້ນແມ່ນຄໍາສັບທົດລອງ TTS ແບບ canonical.

ເວລາ​ໃຊ້ Sesame CSM-1B

ອ່ານ​ປື້ມ​ສຽງ

ຄໍາ​ເວົ້າ​ແບບ​ຍາວໆ​ດ້ວຍ​ສຽງ​ທີ່​ຄົບ​ຖ້ວນ. ປ້າຍ​ບົດ​ໃນ​ແຕ່ລະ​ຄັ້ງ, ດາວໂຫລດ​ເປັນ WAV ຫຼື MP3 ແລະ ຕິດ​ຕໍ່​ພາຍນອກ.

ບົດ​ແນະ​ນຳ​ຂອງ​ໂປດ​ແກຣມ

ເປີດ​ບອມເປເຣສ​ສັ້ນໆ ແລະ ອ່ານ​ການ​ໂຄສະນາ. ປັບ​ຄວາມ​ໄວ​ເພື່ອ​ໃຊ້​ພະລັງງານ, ປ່ຽນ​ຮູບແບບ​ເປັນ MP3 ສຳ​ລັບ​ໄຟລ໌​ທີ່​ນ້ອຍ​ກວ່າ.

ສຽງ

ລະບົບໂທ​ລະ​ສັບ​ແຈ້ງ​ເຕືອນ. ຜົນ​ອອກ​ມາ​ທີ່ມີ​ຄຸນນະພາບ​ຄື​ກັບ​ສະຕູດິໂອ ໂດຍບໍ່​ຕ້ອງ​ຈອງ, ບັນທຶກ, ຫຼື NDAs ດ້ວຍ​ສຽງ​ທີ່​ມີ​ພອນ​ສະຫວັນ.

ຄວາມສາມາດ​ໃນ​ການ​ເຂົ້າເຖິງ

ເພີ່ມສຽງ​ພ້ອມ​ກັບ​ເນື້ອໃນ​ທີ່​ຂຽນ​ໄວ້​ສຳລັບ​ຜູ້​ອ່ານ​ທີ່​ມີ​ການ​ເບິ່ງ​ເຫັນ​ຕ່ຳ ແລະ ​ຄົນ​ພິການ​ທາງ​ການ​ອ່ານ. ວາງ​ໃສ່​ໃນ​ໜ້າ​ໃດ​ກໍ​ໄດ້.

ຕົວຢ່າງ​ຄຳສັບ

"Welcome to the show, today we are exploring the future of AI."
"Your package has arrived. Please retrieve it from the front desk."
"Once upon a time, in a quiet village far away, lived a curious child."
"Press one for sales, two for support, or stay on the line for an agent."
"Breaking news: scientists have discovered a new species of deep-sea fish."
"Thank you for choosing us. We appreciate your business and look forward to serving you again."

ລາຄາ

ສ້າງຂື້ນຈາກສະລອຍນ້ ຳ ທີ່ບໍ່ເສຍຄ່າຂອງທ່ານທຸກໆມື້ກ່ອນ; ເມື່ອມັນ ໝົດ ໄປ, ຊຸດ tokens ຈ່າຍເລີ່ມຕົ້ນທີ່ $5 → 200,000 tokens. ປະມານ ~5 tokens ຕໍ່ຕົວອັກສອນ, ຢ່າງ ໜ້ອຍ 100 ຕໍ່ຄລິບ.

ແຫຼ່ງ​ແບບ​ເຕັມ → · ເບິ່ງ​ສຽງ TTS ທັງໝົດ → · ປຽບທຽບ2ສຽງ​ຂ້າງ​ຕໍ່​ຂ້າງ →

ຕົວເລືອກ​ລະດັບ​ສູງ
ຜົນ
ບັດ​ທອງ​ເຫຼືອ​ບໍ່​ພຽງພໍ​ແລ້ວ Get More Tokens
Want better results? ແບບ​ພິເສດ (GPT-5, Claude, Gemini) deliver higher quality. View Plans

❤️ ຮັກ Free.ai? ເວົ້າກັບເພື່ອນຂອງທ່ານ!

ລົງທະບຽນ ເພື່ອໄດ້ຮັບລິ້ງແນະນໍາແລະຫາເງິນ 25,000 ບັດຕໍ່ເພື່ອນ.

ຕ້ອງການ​ເພີ່ມ​ເຕີມ​ບໍ? Sign up free for 10,000 tokens
ລົງທະບຽນຟຣີ

ກຳລັງ​ປະມວນຜົນ​ຄໍາຮ້ອງຂໍ​ຂອງທ່ານ...

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio …

ວິທີການ​ໃຊ້ AI Voice — Sesame CSM-1B

1
បញ្ចូល​ຂໍ້ມູນ​ເຂົ້າ​ມາ​ຂອງ​ທ່ານ

ພິມຂໍ້ຄວາມ, ສົ່ງ​ໄຟລ໌​ຂຶ້ນ​ໄປ, ຫຼື ອະທິບາຍ​ສິ່ງທີ່​ທ່ານ​ຕ້ອງການ. ບໍ່ມີ​ບັນຊີ​ທີ່​ຕ້ອງການ.

2
ສ້າງ​

AI ຂອງພວກເຮົາ ຈັດການຄໍາຮ້ອງຂໍຂອງທ່ານໃນສອງສາມວິນາທີ ໂດຍໃຊ້ແບບຟອມ Open-Source ທີ່ດີທີ່ສຸດ.

3
ດາວໂຫລດ ແລະ ແບ່ງປັນ

ດາວໂຫລດ, ຖ່າຍທອດ, ຫຼື ແບ່ງປັນຜົນງານຂອງທ່ານ. ໂດຍບໍ່ເສຍຄ່າ ສຳ ລັບໃຊ້ສ່ວນຕົວ ແລະ ການຄ້າ.

ប្រើ​ເຄື່ອງມື​ນີ້​ຜ່ານ API

ເຄື່ອງມືນີ້ອັດຕະໂນມັດຈາກໂປຣແກຣມຂອງທ່ານເອງ. OpenAI-ເຂົ້າກັນໄດ້ REST endpoint, Bearer-token auth, ບໍ່ຈໍາເປັນຕ້ອງມີ SDK ເພີ່ມເຕີມ. ຄ່າໃຊ້ຈ່າຍຂອງ token ກົງກັບເວບໄຊທ໌.

curl -X POST https://api.free.ai/v1/tts/ \
  -H "Authorization: Bearer sk-free-..." \
  -H "Content-Type: application/json" \
  -d '{"text": "Hello from Free.ai", "voice": "af_heart", "model": "kokoro"}'

AI Voice — Sesame CSM-1B — FAQ

Sesame CSM-1B supports a wide range of languages. The exact list depends on the engine; the form on this page accepts any text and the engine will render in its supported languages. See /voice/ for the full multi-engine picker if you need a specific language.

Most engines render neutral-American English by default and a region-appropriate accent for non-English languages. Premium engines may expose accent variants — paste a sample to compare.

SSML support varies by engine. Pause, prosody, and emphasis tags are honored on most premium engines and on a few self-hosted ones. Plain text always works — no markup required.

Streaming TTS is available on premium engines via the /v1/tts/ API endpoint with stream=true. The web UI on this page returns the full clip once rendering finishes.

Sesame CSM-1B runs on our own GPUs. Generation draws from your daily free pool first. Once depleted, paid tokens start at $5 → 200,000 tokens. Roughly ~5 tokens per character, minimum 100 per clip.

Up to 5,000 characters per request on the web UI. For longer pieces (audiobooks, full chapters), use /voice/audiobook/ which chunks and stitches automatically, or call the API in a loop.

Yes — POST a list of strings to /v1/tts/batch/, or use the workspace UI at /workspace/ to chain TTS into a longer pipeline (e.g., translate → speak → stitch).

Yes — POST text to /v1/tts/ with model="Sesame CSM-1B" (or the slug on this page). Returns WAV or MP3. See /api/ for full reference + SDK snippets.

This page is text-to-speech, not voice cloning — the voice is the engine's default. For voice cloning (uploading a reference audio), see /voice/clone/, which requires you to either own the voice rights or have explicit written consent.

Self-hosted engines run on Free.ai-owned GPUs; nothing leaves our servers. Premium engines pass text to upstream model providers under our DPA. We do not train on your inputs and do not sell data.

Yes — Free.ai grants commercial use of generated audio. The engine's underlying license (Apache 2.0, MIT, or vendor terms) is shown above and on the model reference page; in practice this means voiceovers, ads, podcasts, and apps are all in-scope.

Yes — failed jobs auto-refund to the source (daily pool or paid tokens). If a refund does not show up the same day, email contact@free.ai.

ລົງທະບຽນຟຣີສໍາລັບ 10,000 ຕົວແທນ

ສ້າງ​ບັນຊີ​ຟຣີ

ບໍ່ມີ​ບັດ​ເຄຣດິດ​ທີ່​ຕ້ອງການ

ທ່ານຈະໃຫ້ຄະແນນເຄື່ອງມືນີ້ແນວໃດ?

4.3/5 from 3 ratings

ຮັກ Free.ai? ເວົ້າກັບເພື່ອນຂອງທ່ານ!