Sespain e mahala

Ngola Sespain audio le video ho ea ho mongolo ka AI. E potlakile, e nepahetse, e mahala.

Ho sebetsa joang

  1. E-ba le Free.ai Transcriber
  2. Kopitsa faele ea hau ea Sespain ea audio kapa video
  3. AI ea rona e tla fumana Sespain ka ho toba'me ngole
  4. Kopitsa transcript ea hau e le tekanyo kapa lihlooho tsa SRT

Sespain Likarolo tsa ho ngola

  • E sebetsa ka faster-whisper (MIT licensed)
  • Ho fumana puo Sespain ka ho toba
  • E tšehetsa MP3, WAV, MP4, M4A, FLAC, le tse ling
  • Timestamps le ho romelloa ha lihlooho (SRT)
  • Ha ho na liphelelo tsa boholo ba faele ka likhetho tse lefelloeng
  • Secha le se sireletsehileng -- lifaele li tlosoa ka mor'a ho sebetsana

Lintlha tsa puo

SenyesemaneSespain
ISO Codees
Mofuta oa AIfaster-whisper
Theko_Tsela

Li-languages tse ling

Tseba Li-Languages Tseo

Lipotso tse botsoang khafetsa

Whisper e kholo-v3-turbo e lula ka holimo ho Sespain - ka tlase ho 7% ea palo ea liphoso tsa mantsoe ka litlhahlobo tse tloaelehileng. Ka ho etsa hore ho bolela hore audio ea studio e hloekileng e khutlela morao e le haufi le ho phethahala,'me audio ea puisano e ka sebelisoa le ho hloekisa ho fokolang.(Tier A, under 7% word error rate ka lihlopha tsa benchmark - re phatlalatsa li-tiers tsa WER tse tšepahalang ho feta li-claims tsa thekiso.)

E-na — Sespain transcription e nka ho tloha ho pholletsa le letsatsi token pool ea hao mahala pele. Audio theko ka 50 tokens ka metsotsoana, ka hona pholletsa le letsatsi pool Anonymous e ka fihlela lihora tse'maloa tsa audio ka letsatsi. Signed-ka akhaonteng fumana pool e kholo le 10,000 signup tokens. Past hore, $ 1 reka 750,000 tokens (~ 250 lihora tsa audio).

Spanish e koahela Castilian (Spain), Mexican, Argentine (rioplatense), Caribbean, le Andean mefuta. Whisper e ne e sebediswa ka ho kopanya le ho sebetsana le tsohle tse tharo ka mohlala o tšoanang — feela ho feta puo = es le transcript tla bontša leha e le efe dialect ke ka audio (ho akarelletsa le voseo le seseo).

MP3, WAV, M4A, FLAC, OGG, OPUS, le WEBM li amoheloa ka kotloloho. Bakeng sa video (MP4, MOV, MKV) re tlosa li-track tsa audio ka lehlakoreng la mosebelisi pele re li romella ho Whisper — ha o hloka ho fetola ntho efe kapa efe ka boeona. Pipeline e ts'oanang ntle le puo ea mohloli, ho kenyeletsoa Sespain.

Li-account tse kentsoeng ka har'a li-account li fihla ho 2 GB. Nako e telele ha e lekanyetsoe - li-file tse telele li kopanngoa ka ho toba (lifensetere tsa metsotsoana e 30 le ho koaloa)'me li kopanngoa ho ea ho transcript e le' ngoe le li-timestamps tse tsoelang pele. Li-recording tsa Sespain tse fetang lihora tse ngata (podcasts, lihlooho tse felletseng, liboka) li sebetsa hantle.

Ha ho joalo — ho arolelana lingoliloeng tsa moqoqo ho hokahantsoe ka ho sa feleng bakeng sa Sespain transcript e ngoe le e ngoe. Liphetho li arotsoe e le Moqoqo 1 / Moqoqo 2 / Moqoqo 3 le li-timestamps, ka hona lipotso tsa puisano, li-panel discussions, le liketsahalo tse ngata li khutlela ka letšoao. Ho arolelana lingoliloeng tsa moqoqo ho sebetsa ka mokhoa o fapaneng'me ho sebetsa ka tsela e ts'oanang ka lipuo tsohle tseo re li tšehetsang.

E-ea — kopanya URL ho /transcribe/youtube/ bakeng sa YouTube kapa /transcribe/podcast/ bakeng sa podcast feeds (Apple, Spotify, RSS). Re kenya audio, e tsamaisa ka Whisper le language=es,'me re khutlisetse transcript le timestamps le li-labels tsa moqolotsi oa litaba. Lintlha tsa Sespain: podcasts, lihlooho, lipotso, le ho feta-bophara-bophahamo YouTube litaba ka Sespain ke tse tloaelehileng haholo workloads re bona.

Whisper e theko e boima ka li-token tsa 50 ka metsotsoana ea audio, ka hona ho ngolisoa ha hora e le 'ngoe ke li-token tsa ~ 3,000. $ 1 e reka li-token tsa 750,000, tse sebetsang ho fihlela lihora tse 250 tsa audio ka dollar. Basebelisi ba bangata ha ba sebelise ntho e ngoe le e ngoe - pool ea letsatsi le letsatsi e nang le li-clip tse khuts'oane, li-notes tsa puisano le podcasts tse le' ngoe.

Ee — li-timestamps tsa lefatše la sehlooho (ka ~10-30 sekontiri) le tsa lefatše la mantsoe li fumaneha. Lefatše la mantsoe ke la morao-rao bakeng sa ho romelloa ha li-subtitles tsa VTT/SRT ka hona li-captions li hokahana line-by-line. Ka API beha timestamps="word" ka'mele oa kopo. Li-transcripts tsa Sespain li khutlisoa ka UTF-8 e tloaelehileng le ho ngola ka mokhoa o tloaelehileng oa puo.

Ee. POST audio (multipart/form-data, field name "file") ho /v1/transcribe/ le language=es — kapa u se ke ua beha parameter ea puo ho etsa hore Whisper e fumane ka ho toba. E khutlisa JSON le transcript, segments, timestamps, le labels tsa motsamaisi. Litlhahlobo tse felletseng le li-snippets tsa SDK li ka fumaneha ho /api/.

E-ea — ha transcription e felile, tobetsa ho fetolela kapa ho kopanya tekanyo ho /translate/. Sespain e kopantsoe le lipuo tsohle tse ling tseo re li tšepang (200+). Bakeng sa lihora tsa kopano, transcript e fetisoa ka /summarize/; bakeng sa ho ngola ka letsoho, e romelle ho /voice/tts/ ho etsa molumo ka puo e loketseng.

Whisper e koetlisitsoe ka lihora tse 680K tsa molumo o moholo oa lefatše la 'nete, ka hona ho ngola ka Sespain ho thata ho mohala o moholo oa morao, li-beds tsa molumo, le ho ngola ka boleng ba mohala. Ho qeta ka thata kapa ho qeta li-speakers tse ngata ho tla senya botsitso.Haeba transcript e sa khone ho sebelisoa, romella lengolo-tsoibila contact@free.ai le faele — re tla lefa tokens'me re sheba hore na e mong oa li-engine tse fapaneng o sebetsana joang le audio ea hau ka ho fetisisa.

U rata Free.ai? Reka le metsoalle ea hau!

Ratela leqepheng lena