OpenAI: GPT Audio Mini

OpenAI · tts · ~1147 Ii-token clip · 4.3 I- 3 Abathathi nxaxheba
~1147 Ii-token clip
Isebenza ngokukhululekileyo kwi-GPU yethu. I-Groupware OpenAI: GPT Audio Mini →

OpenAI: GPT Audio Mini yi an Umbhalo-usuka-kwilizwi eyenziwe ngu OpenAI. Ihamba ngeendlela ezingaphandle — ~1,147 ii-token Iqhosha ngalinye (50% yokuphawula ngaphezulu kwexabiso eliphezulu).

Sebenzisa nge-API

I-REST API ehambelana ne-OpenAI. Yenza iqhosha kwaye unxulumane nale modeli kwimizuzu.

curl -X POST https://api.free.ai/v1/tts/ \
  -H "Authorization: Bearer sk-free-..." \
  -H "Content-Type: application/json" \
  -d '{"model":"openai/gpt-audio-mini","text":"hello world"}'
Uxwebhu lwe-API Fumana Isitshixo se API

Imibuzo ebuzwa rhoqo

OpenAI: GPT Audio Mini ixhasa uluhlu olubanzi lweelwimi. Uluhlu oluchanekileyo luxhomekeke kwi-engine; ifomu kule phepha ivuma nawuphi na umbhalo kwaye i-engine izakuveza kwiilwimi zayo ezixhaswayo. Bona /voice/ kwi-multi-engine picker epheleleyo ukuba ufuna ulwimi oluthile.

Iinjini ezininzi zibonisa isiNgesi esingenanto- saseMelika ngokumiselweyo kunye nesiqendu esifanelekileyo selizwe leelwimi ezingasazi-siNgesi. Iinjini eziphezulu zingabonisa iziguqulelo zesiqendu - dibanisa isampuli yokuthelekiswa.

Inkxaso ye SSML itshintsha ngokwenjini. Ukuphumla, iprosody, kunye ne tags yokungaqhelekanga zihlonishwa kwinjini ezininzi eziphezulu kunye nakwizinye ezinomphathi- we- self. Umbhalo ocacileyo usoloko usebenza - akukho phawulo lufunekayo.

Ukusasazwa kwe-TTS kufumaneka kwi-premium engines nge /v1/tts/ API endpoint nge stream=true. I-web UI kule phepha ibuyisela i-clip epheleleyo xa ukubonakalisa kugqityiwe.

OpenAI: GPT Audio Mini yinjini ye-TTS ephezulu. Ixabiso linyuka ngobalo lwamagama - ngokuqhelekileyo ~30 amaphawu ngamagama. $1 ithenga amaphawu angama-750,000, ngoko ke i-$5 igubungela amawaka amawaka egama.

Ukufikelela kwiimpawu ezingama-5,000 ngesicelo ngasinye kwi-web UI. Kwiinxalenye ezide (iincwadi zesandi, izihloko ezipheleleyo), sebenzisa /voice/audiobook/ equka i-chunks ne-stitches ngokuzenzekelayo, okanye thumela i-API kwi-loop.

Ewe — UTHENGA uluhlu lwamagama kwi /v1/tts/batch/, okanye sebenzisa isithuba sokusebenza UI kwi /workspace/ ukudibanisa i-TTS kwindlela ende yokuqhuba (umzekelo, guqulela → thetha → stitch).

Ewe — UTHENGA umbhalo kwi /v1/tts/ ngemodeli "OpenAI: GPT Audio Mini" (okanye i-slug kule phepha). Ibuyisela i-WAV okanye i-MP3. Bona /api/ ukubhekisa okupheleleyo + i-SDK snippets.

Eli phepha liyi-text-to-speech, hayi ukucloning kwelizwi - ilizwi limiselwe kwi-engine. Ukuklona kwelizwi (ukulayisha isandi esibhekisa kuyo), bona /voice/clone/, ekufuneka ube nayo ilungelo lelizwi okanye ube negunya elibhaliweyo elicacileyo.

Iinjini ezimkelweyo ziqhuba kwi-Free.ai- eyenziweyo ye GPUs; akukho nto ishiya iiseva zethu. Iinjini eziphezulu zidlulisa umbhalo kumaqela aphezulu abonelela ngemodeli phantsi kwe DPA yethu. Asiqeqeshi kwiingxelo zakho kwaye asithengi idata.

Ewe — Free.ai inikezela ngenkonzo yorhwebo lwesandi esiveliswe. Ilayisensi ephantsi ye-engine (i-Apache 2.0, MIT, okanye iimeko zomboneleli) ibonisiwe phezulu kunye nephepha lobhekiso lemodeli; kwimisebenzi le kuthetha ukuba i-voiceovers, i-ad, i-podcasts, kunye neenkqubo zekhompyutha zonke zikwi-ambithi.

Ewe - imisebenzi engaphumelelanga ibuyiselwa ngokuzenzekelayo kumbhali (i-pool yosuku okanye i-token ehlawulweyo). Ukuba ubuyiselo alubonakalanga ngaloo mini, thumela i-imeyile kwi contact@free.ai.

Uthanda i-Free.ai? Nceda utshele abahlobo bakho!

Iphepha elilandelayo