Fal Speech-to-Text

Free.ai · stt · ~500 tohu i ia minute

Ka whakataka tētahi pūoro, he faila ataata rānei, he URL rānei ki raro iho

~500 tohu i ia minute
Ka haere wātea i runga i a tātau GPUs. Whakahauhau mo Fal Speech-to-Text →

Ko Fal Speech-to-Text ko a tauira-whakaahua-ki-te-tuhi. I whakateretia mā ngā tauira o waho — ~500 ngā tohu i ia wa (50% te tohu i runga i te utu o te awa).

Ka whakamahia mā te API

OpenAI-compatible REST API. Generate a key and call this model in seconds.

curl -X POST https://api.free.ai/v1/stt/ \
  -H "Authorization: Bearer sk-free-..." \
  -H "Content-Type: application/json" \
  -d '{"model":"premium/speech-to-text","audio_url":"https://..."}'
Ka taea te whakataki i te papatono Kitenga te kī API

E pā ana ngā pātai

Ka whakamāoritia e Fal Speech-to-Text te oro i kōrerotia ki roto i te kupu. Whakarewa i tētahi MP3, WAV, M4A, he faila ataata rānei, ā, ka hoki mai a Fal Speech-to-Text ki te whakamāoritanga katoa me ngā whakarārangi SRT/VTT kōwhiria me ngā tātaitai wā.

He tokomaha nga reo e whakahaeretia ana e Fal Speech-to-Text — ko ngā tauira whānau Whisper e taupoki ana i te 90%, Parakeet e taupoki ana i te ~25, e rerekē ana ētahi atu. Ka tīpako "mātau-mātau", ka whakapūtā rānei i te reo mō te tika tiketike rawa.

Ko te ōrautanga hē-wānanga-wānanga ko te 5–10% i runga i te oro Ingarihi mārama, 10–20% i runga i te oro whakahauhau, i te oro whakahauhau rānei.

He — kei roto i ia wāhanga ngā tātai tīmata/whakamutu. Ka whakaputaina hei SRT, VTT rānei me te mahere wā ki runga i tōna ataata.

He mīhini whakahuatuhi utu te Fal Speech-to-Text. Tata ki ngā tohu ~500–1,500 i ia minu o te oro. $1 = 750,000 ngā tohu.

MP3, WAV, M4A, FLAC, OGG, tae atu ki te ataata (MP4, MOV, WebM) — ka tangohia e tātau te oro. 500 MB te nui rawa o ia tāpiritanga. He nui ake ngā faila? Whakawehe me /oro/ whakaiti/, hoatu rānei i /v1/stt/batch/.

He whakawhitinga motuhake te whakahua i te kaikōrero — ka whakawhiti "pāpāho" i runga i te / whakamāori /. Fal Speech-to-Text e whakahaere ana i te whakamāoritanga; ko te whakahua i ngā tohu i ia wāhanga me te Kaikōrero 1 / Kaikōrero 2 / ērā atu mea.

He — /batch/ e whakaae ana ki tētahi pūrākau o ngā pūranga oro. Ka tae ia whakahua i roto i te /account/?tab=hitori me te ingoa pūranga taketake. Mō te pupuritanga o te rākau pūrākau, ka whakamahia te API.

He — POST tōtou oro ki te /v1/stt/transcribe/ me te tauira "Fal Speech-to-Text". Ka hoki mai te JSON me te kupu + wāhanga + wā-wā. He tohutoro katoa te /api/.

Ko ngā tauira ā-whāinga e pupuri ana i te oro i runga i a tātau GPU; ka whakawhitia te utu mā te DPA. Ka tangohia te oro i muri i te matapihi tiritiri (24h anon, 7d te whakaingoatanga). Kāore e whakaakona e tātau i ōna tāurunga.

He — Free.ai e whakaae ana ki te whakamahi hokohoko o ngā tāruatanga. E hiahiatia ana e koe ngā mana ki te oro i whakarewaina e koe (ko tō koe te whakataki, ngā rawa whakawhiwhinga, ngā ihirangi rānei me te whakaaetanga).

Ko te take o te wā tūturu tata tonu ki te 0.05–0.2× — he podcast 60-meneti e whakarerekē ana i roto i ngā minu 3–12. He tere ake te mutunga o ngā tauira Premium. Ka whakamahia te kī kōaro hei kati i te tīpako.

E hiahia ana ki te Free.ai? Whakapāpāho ki ōna hoa!

Whakawhiwhia tēnei pātū