I-isi-Arabic yokudlulisa

Bhala isi-Arabic umsindo nevidiyo ube ngumbhalo nge-AI. Ishesha, ilungile, futhi imahhala.

Indlela isebenza ngayo

  1. Iya ku- Umshicileli we-Free.ai
  2. Layisha phezulu ihele lakho le-isi-Arabic lomsindo noma levidiyo
  3. I-AI yethu ithola ngokuzenzakalela i-isi-Arabic futhi ibhalela
  4. Layisha phezulu isingeniso sakho njengesihloko noma isingeniso se-SRT

isi-Arabic Izici zokudlulisa

  • Isebenza nge-faster-whisper (MIT licensed)
  • Ukuthola ulwimi isi-Arabic ngokuzenzakalela
  • Insiza MP3, WAV, MP4, M4A, FLAC, nezinye eziningi
  • Isikhathi sokufaka kanye nokukhishwa kwesihloko esingezansi (SRT)
  • Akukho mkhawulo wobukhulu befayela kuma-plans akhokhelwayo
  • Imfihlo nokuphepha -- amafayela asuswa ngemuva kokusebenza

Iminingwane yesilimi

I-Languageisi-Arabic
Ikhowudi ye-ISOar
Imodeli ye-AIi-faster-whisper
IntengoIkhululekile

Izilimi Eziningi

Bona zonke izilimi

Imibuzo ebuzwa kaningi

Whisper large-v3-turbo iphatha isi-Arabic ngokuqinile — 7-15% isilinganiso sephutha legama ku-benchmark audio. Lindela ukushintshwa okuthile kwezinhlamvu ezibizwa ngegama, ama-amanani, kanye ne-glossary eqinile yezobuchwepheshe; i-bulk ye-transcript izolungile.(I-Tier B, 7-15% word error rate kusethingi se-benchmark — sishicilela ama-tiers we-WER athembekile ngaphezu kokuphikisana nokumaketha.)

Yebo — isi-Arabic ukudluliswa kuqala kusuka ku-token pool yakho yamahhala yansuku zonke. Umsindo ubiza ama-token angama-50 ngomzuzu, ngakho-ke i-pool yansuku zonke engaziwa ifaka amahora ambalwa we-audio ngosuku. Ama-akhawunti abhalisiwe athola i-pool enkulu kanye nama-token angama-10,000 wokubhalisa. Phakathi kwalokhu, i-$1 ithenga ama-token angama-750,000 (amahora angama-250 e-audio).

Isi-Arabic siphathwa ku-Modern Standard Arabic (MSA) level ngokuzenzakalela. Isi-Egyptian, Levantine, Gulf, ne-Maghrebi colloquial speech ziyaziwayo kodwa zibhalwe nge-MSA orthography — i-Whisper ayibonisi noma igcina ukubhalwa kwe-dialect-specific. Ulwazi/ukufundisa kwe-MSA okucacile lindele ukunemba kwe-tier-B; i-heavy Maghrebi noma i-Egyptian colloquial icindezela okuphansi.

MP3, WAV, M4A, FLAC, OGG, OPUS, ne WEBM zivunyelwe ngokuqondile. Ngevidiyo (MP4, MOV, MKV) sikhipha umsindo we-server-side ngaphambi kokuthunyelwa ku-Whisper — awudingi ukushintsha noma yini ngokwakho. Ipayipi elifanayo ngaphandle komthombo we-language, kufaka phakathi i-isi-Arabic.

Ukufaka okungenagama kufinyelela kuma-500 MB ngefayela ngalinye. Ama-akhawunti abhalisiwe afinyelela ku-2 GB. Ukuphela kwesikhathi akuyona umkhawulo onzima - amafayela ade ahlukaniswa ngokuzenzakalela (amafasitela emizuzu engu-30 ahlukaniswe) futhi aphinde ahlukaniswe ibe yi-transcript eyodwa nesikhathi esiqhubekayo. Ukurekhodwa kwehora eliningi isi-Arabic (amapodcasts, izifundo ezigcwele, izinhlanganiso) kusebenza kahle.

Yebo — ukudweba umsindo komsindo kusetshenzisiwe ngokuzenzakalela kuwo wonke ama-isi-Arabic transcript. I-output ihlukaniswe njenge-Speaker 1 / Speaker 2 / Speaker 3 nge-timestamps, ngakho-ke izingqungquthela, izingqungquthela zepaneli, nezingqungquthela zeqembu eliningi zibuyela emuva zinikezwe i-label. Ukudweba umsindo kusebenza ngemodeli ehlukile futhi kusebenza ngokufanayo kuwo wonke ama-languages esiwaxhasayo.

Yebo — chofoza i-URL ku /transcribe/youtube/ ye-YouTube noma /transcribe/podcast/ ye-podcast feeds (Apple, Spotify, RSS). Silanda umsindo, siwuqhube nge-Whisper nge-language=ar, futhi sibuyisele i-transcript nge-timestamps ne-speaker labels. I-isi-Arabic ejwayelekile: amavidiyo ezindaba, izifundo, izifundo, kanye nezingqungquthela zepolitiki ku isi-Arabic yizinto ezivame kakhulu; chofoza i-YouTube URL ku /transcribe/youtube/ noma ulayishe ifayela.

I-Whisper ibiza cishe ama-token angama-50 ngomzuzu we-audio, ngakho-ke ukurekhodwa kwehora elinye kubiza ama-token angama-3,000. I-$1 ithenga ama-token angama-750,000, okusebenza cishe amahora angama-250 we-audio ngedola. Abaningi abasebenzisayo abachithanga lutho — i-pool yamahhala yosuku lonke ifaka ama-clip aphansi, ama-notes omsindo, nama-podcasts afanayo.

Yebo — zombili isigaba-sezinga (noma yikuphi ~10-30 imizuzwana) kanye negama-level timestamps zikhona. Igama-level yiphutha le-VTT/SRT subtitle export ngakho ama-captions asynchronize line-by-line. Kwi-API hlela timestamps="word" kwi-body yesicelo. isi-Arabic izixhumanisi zibuyiselwa ku-script yabo yasekhaya ekunene-kuya-kwesobunxele futhi zibonise ngokulungile kuwo wonke umbukeli owaziyo i-RTL (iziphequluli, i-Word, ama-Google Docs).

Yebo. POST umsindo (ingxenye/ifomu-data, igama lendawo "ihele") ku /v1/transcribe/ nge lingu=ar — noma ushiye i parameter yesilimi ukuze i Whisper ikwazi ukukhomba ngokuzenzakalela. Ibuyisela i JSON nge lingu, amasegmenti, ama-timestamps, nama-speaker labels. Umbiko ophelele kanye ne-SDK snippets ku /api/.

Yebo — uma ukuguqulelwa kuqediwe, chofoza guqula noma chofoza umbhalo ku /guqula/. isi-Arabic ixhumana nanoma iyiphi enye ulwimi esixhasayo (200+). Usuku lwengxoxo lidlulisa ukuguqulelwa /summarize/; ukuguqulelwa lithunyelwe ku /voice/tts/ ukuze kunikezwe umsindo kulimi oluzosetshenziswa.

I-Whisper iqeqeshwa ngamahora angama-100,000 we-audio yezwe langempela, ngakho-ke ithobela umsindo wesizinda kanye ne-phone-quality recordings ku-isi-Arabic. Ukuthola izimpendulo ezinhle, nikeza umsindo ohlanzekile (i-headset mic, akukho mbhede womculo) — kulolu hlobo lomsindo uhlanganisa isilinganiso sephutha le-baseline.Uma i-transcript ibuyela ingasebenzi, thumela i-imeyili ku contact@free.ai ngefayela — sizobuyisela imali ye-token futhi sibheke ukuthi ngabe i-engine eyahlukileyo iphatha umsindo wakho kahle.

Uthanda i-Free.ai? Ngisho nabahlobo bakho!

Linganisa lelikhasi