AI Voice — Sesame CSM-1B

商业用途OK 380+模型 无水印 不需要注册
型号 :
+ GPT-5, Claude, Gemini
TTTS发动机 自自托管 Apache 2.0
Sesame CSM-1B — Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.
0 字符字符字符字符 ~ 0 个符号
具有字符数的比额表
正在生成演讲...

什么是什么东西 Sesame CSM-1B 听起来像?

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio context turn. Self-hosted on Free.ai for the /voice/realtime/ tool.

试一下上面的盒子: 你好,我的名字是山姆, 我读了这个样本 来展示声音。—— 这是卡尼卡 TTS 演示词。

何时使用 Sesame CSM-1B

听音书

以一致的语调进行长式叙事。 一次粘贴一章, 以 WAV 或 MP3 下载, 并对外缝合 。

播客插演

简短的打开缓冲器和广告读取器。 调整能量速度, 较小文件的格式切换到 MP3 。

IVR+语音邮件

电话系统提示 演播室质量产出 没有预订、录音 也没有有语音天赋的国家数据系统

无障碍

向低视和阅读障碍者添加与书面内容并存的音频。 在任何页面上放入 。

示例短语

"Welcome to the show, today we are exploring the future of AI."
"Your package has arrived. Please retrieve it from the front desk."
"Once upon a time, in a quiet village far away, lived a curious child."
"Press one for sales, two for support, or stay on the line for an agent."
"Breaking news: scientists have discovered a new species of deep-sea fish."
"Thank you for choosing us. We appreciate your business and look forward to serving you again."

定价

由我们的 GPUs 自行托管。 一代人首先从您每天免费的游泳池中抽取; 一旦耗尽, 付费的象征性包裹开始为 5 美元 $ $ $ $ 200,000 符号。 每个字符大约~ 5 个符号, 最少为 100 个弹夹 。

完整示范参考参考 → · 看到TTS的所有声音 → · 相对两个声音 并肩并肩 →

高级选项
结果成果成果成果成果成果成果成果成果成果成果
声调越来越低 Get More Tokens
Want better results? 模型 (GPT-5, Claude, Gemini) deliver higher quality. View Plans

❤️ Love this tool? Share it!

< a href=" "/ signup/" style=" "color:#16A34A" > 签名 以获得查询链接, 并获得每个朋友25,000个象征性的 。

还要吗? Sign up free for 10,000 tokens
签署自由

处理您的请求...

Sesame CSM-1B — Apache 2.0. Conversational Speech Model designed for low-latency, real-time voice. 24 kHz output, sounds best with a short reference-audio …

如何使用 AI Voice — Sesame CSM-1B

1
输入输入

键入文本、上传文件或描述您想要的东西。不需要账户 。

2
点击生成

我们的人工智能使用最佳的开放源码模型,在秒内处理你的请求。

3
下载共享( S)

下载、复制或分享您的结果。 免费个人和商业使用 。

通过 API 使用此工具

从您自己的代码中自动启用此工具。 OpenAI 兼容的 REST 端点、 Bearer-tok 异常点、 不需要额外的 SDK 。 Token 成本符合网络界面 。

curl -X POST https://api.free.ai/v1/tts/ \
  -H "Authorization: Bearer sk-free-..." \
  -H "Content-Type: application/json" \
  -d '{"text": "Hello from Free.ai", "voice": "af_heart", "model": "kokoro"}'

AI Voice — Sesame CSM-1B — FAQ

Sesame CSM-1B supports a wide range of languages. The exact list depends on the engine; the form on this page accepts any text and the engine will render in its supported languages. See /voice/ for the full multi-engine picker if you need a specific language.

Most engines render neutral-American English by default and a region-appropriate accent for non-English languages. Premium engines may expose accent variants — paste a sample to compare.

SSML support varies by engine. Pause, prosody, and emphasis tags are honored on most premium engines and on a few self-hosted ones. Plain text always works — no markup required.

Streaming TTS is available on premium engines via the /v1/tts/ API endpoint with stream=true. The web UI on this page returns the full clip once rendering finishes.

Sesame CSM-1B runs on our own GPUs. Generation draws from your daily free pool first. Once depleted, paid tokens start at $5 → 200,000 tokens. Roughly ~5 tokens per character, minimum 100 per clip.

Up to 5,000 characters per request on the web UI. For longer pieces (audiobooks, full chapters), use /voice/audiobook/ which chunks and stitches automatically, or call the API in a loop.

Yes — POST a list of strings to /v1/tts/batch/, or use the workspace UI at /workspace/ to chain TTS into a longer pipeline (e.g., translate → speak → stitch).

Yes — POST text to /v1/tts/ with model="Sesame CSM-1B" (or the slug on this page). Returns WAV or MP3. See /api/ for full reference + SDK snippets.

This page is text-to-speech, not voice cloning — the voice is the engine's default. For voice cloning (uploading a reference audio), see /voice/clone/, which requires you to either own the voice rights or have explicit written consent.

Self-hosted engines run on Free.ai-owned GPUs; nothing leaves our servers. Premium engines pass text to upstream model providers under our DPA. We do not train on your inputs and do not sell data.

Yes — Free.ai grants commercial use of generated audio. The engine's underlying license (Apache 2.0, MIT, or vendor terms) is shown above and on the model reference page; in practice this means voiceovers, ads, podcasts, and apps are all in-scope.

Yes — failed jobs auto-refund to the source (daily pool or paid tokens). If a refund does not show up the same day, email contact@free.ai.

免费注册一万个符号

创建自由账户

无需信用卡

你会如何评分这个工具?

4.3/5 from 3 ratings

Love this tool? Share it!