Wizper (Whisper v3)

Free.ai · stt · ~500 Token kwa minute

Wepụ faịlụ ụda mọọbụ vidiyo, mọọbụ pịa URL n'okpuru

~500 Token kwa minute
Na-agba ọsọ n'efu na GPUs anyị. Nkwalite maka Wizper (Whisper v3) →

Wizper (Whisper v3) bụ a ngwe-ka-okwu móòdù. N'ime ụzọ n'ime n'ime model - ~500 tokens nkeji (50% markup n'elu upstream ọnụego).

Jiri site na API

OpenAI-compatible REST API. Generate a key and call this model in seconds.

curl -X POST https://api.free.ai/v1/stt/ \
  -H "Authorization: Bearer sk-free-..." \
  -H "Content-Type: application/json" \
  -d '{"model":"premium/wizper","audio_url":"https://..."}'
Dọkumenti Wepụta kii API

Ajụjụ ndị a jụrụkarị

Wizper (Whisper v3) na-atụgharị ụda akọwapụtara n'ime ngwe. Bipụta faịlụ MP3, WAV, M4A, mọọbụ faịlụ vidio na Wizper (Whisper v3) na-eziga ntụgharị zuru ezu nakwa SRT/VTT n'aka ekpe na oge.

Wizper (Whisper v3) na-ejikwa asụsụ ndị dị iche iche - Whisper-family models cover 90+, Parakeet covers ~25, ndị ọzọ na-agbanwe. Họrọ "auto-detect" mọọbụ kọwaa asụsụ maka nghọta dị elu.

Ọnụọgụgụ okwu-ezighị ezi bụ 5-10% na ụda English dị ọcha, 10-20% na ụda na-anụ ọkụ ma ọ bụ ụda na-akụda. Ụdị dị ukwuu nke ụda ahụ na-eme ka ọ dị mma n'ihe ndị dị njọ - họrọ nnukwu mgbe ụda ahụ dị njọ.

Ee - segmenti ọbụla na-agụnye oge mbido/nkewapụta. Ekpughe ya dịka SRT mọọbụ VTT na oge map n'ụzọ ziri ezi na vidyo gị.

Wizper (Whisper v3) bụ ntụgharị asụsụ n'ime engine. N'ihe banyere ~500-1,500 token kwa nkeji nke ụda. $1 = 750,000 token.

MP3, WAV, M4A, FLAC, OGG, nakwa vidiyo (MP4, MOV, WebM) - anyị na-ewepụ ụda. Max 500 MB kwa ibudata. Files dị ogologo? Split na /audio/cut/ ma ọ bụ jiri /v1/stt/batch/.

Nhazi nke onyeọsụsụ bụ nbanye dị iche iche - gbanwee "diarize" na /transcribe/. Wizper (Whisper v3) na-ejikwa ntụgharị; nhazi nke onyeọsụsụ na-egosipụta segmenti ọbụla na Onyeọsụsụ 1 / Onyeọsụsụ 2 / wdg.

Ee — /batch/ na-anabata nsomebefaịlụ nke faịlụ ụda. Nhazi nke ọbụla na-abanye na /account/?tab=history na aha faịlụ mbụ. Maka nchekwa nke nsomebefaịlụ-tree jiri API.

Yabụ - POST ụda gị na /v1/stt/transcribe/ na model="Wizper (Whisper v3)". Na-eziga JSON na ngwe + segments + oge-ihe dị na okwu. /api/ nwere ndesịta zuru ezu.

Models na-echekwa onwe ha na-echekwa ụda na GPUs anyị; ọbụna na-aga site na DPA. Ọdịdị a na-ehichapụ mgbe windo n'otu (24h anon, 7d banye-na). Anyị anaghị arụ ọrụ na init gị.

Ee — Free.ai na-enye ikike iji n'ọrụ n'ọrụ nke transcripts. Ichọrọ ikike n'ihe oyiyi ịgbapụtala (n'ihe oyiyi gị, ihenhọrọ nke ikike, mọọbụ ihenhọrọ nke ikike).

Real-time factor bụ ihe dị ka 0.05–0.2× — 60-minuite podcast transcribes na 3–12 minit. Premium models mgbe ụfọdụ na-agwụ n'ụzọ n'ụzọ. Jiri bọtịnụ n'okporo ụzọ mechie táàbụ̀ ahụ.

Ị hụrụ Free.ai? Kpọtụrụ enyi gị!

Ihu ndị a