Question 1

How accurate is Whisper transcription for Arabic?

Accepted Answer

Whisper large-v3-turbo handles Arabic solidly — 7-15% word error rate on benchmark audio. Expect occasional substitutions on named entities, numbers, and dense technical vocabulary; the bulk of the transcript will be correct. (Tier B, 7-15% word error rate on benchmark sets — we publish honest WER tiers rather than marketing claims.)

Question 2

Is Arabic audio-to-text transcription free?

Accepted Answer

Yes — Arabic transcription draws from your daily free token pool first. Audio costs about 50 tokens per minute, so the anonymous daily pool covers a few hours of audio per day. Signed-in accounts get a larger pool plus 10,000 signup tokens. Past that, $1 buys 750,000 tokens (~250 hours of audio).

Question 3

Does the transcriber handle Arabic dialects?

Accepted Answer

Arabic is handled at the Modern Standard Arabic (MSA) level by default. Egyptian, Levantine, Gulf, and Maghrebi colloquial speech are recognized but transcribed in MSA orthography — Whisper does not romanize or preserve dialect-specific spellings. For pure MSA news/lecture audio expect tier-B accuracy; heavy Maghrebi or Egyptian colloquial pushes that lower.

Question 4

What audio formats are supported for Arabic transcription?

Accepted Answer

MP3, WAV, M4A, FLAC, OGG, OPUS, and WEBM are accepted directly. For video (MP4, MOV, MKV) we extract the audio track server-side before sending it to Whisper — you do not need to convert anything yourself. Same pipeline regardless of source language, including Arabic.

Question 5

How long can a Arabic audio file be?

Accepted Answer

Anonymous uploads cap at roughly 500 MB per file. Signed-in accounts go up to 2 GB. Duration is not a hard limit — long files are chunked automatically (30-second windows with overlap) and stitched back into a single transcript with continuous timestamps. Multi-hour Arabic recordings (podcasts, full lectures, meetings) work fine.

Question 6

Does the Arabic transcript identify different speakers?

Accepted Answer

Yes — speaker diarization is on by default for every Arabic transcript. The output is segmented as Speaker 1 / Speaker 2 / Speaker 3 with timestamps, so interviews, panel discussions, and multi-party meetings come back labeled. Diarization runs on a separate model and works the same across all languages we support.

Question 7

Can I transcribe a Arabic YouTube video or podcast?

Accepted Answer

Yes — paste the URL into /transcribe/youtube/ for YouTube or /transcribe/podcast/ for podcast feeds (Apple, Spotify, RSS). We download the audio, run it through Whisper with language=ar, and return the transcript with timestamps and speaker labels. Typical Arabic content: news clips, sermons, lectures, and political interviews in Arabic are the most common workloads; paste a YouTube URL into /transcribe/youtube/ or upload the file.

Question 8

How much does an hour of Arabic audio cost in tokens?

Accepted Answer

Whisper costs about 50 tokens per minute of audio, so a one-hour recording is ~3,000 tokens. $1 buys 750,000 tokens, which works out to roughly 250 hours of audio per dollar. Most users never spend anything — the free daily pool covers short clips, voice notes, and one-off podcasts.

Question 9

Can I get word-level timestamps for Arabic audio?

Accepted Answer

Yes — both segment-level (every ~10-30 seconds) and word-level timestamps are available. Word-level is the default for VTT/SRT subtitle export so the captions sync line-by-line. On the API set timestamps="word" in the request body. Arabic transcripts are returned in their native right-to-left script and render correctly in any RTL-aware viewer (browsers, Word, Google Docs).

Question 10

Is there an API for Arabic transcription?

Accepted Answer

Yes. POST audio (multipart/form-data, field name "file") to /v1/transcribe/ with language=ar — or omit the language parameter to let Whisper auto-detect. Returns JSON with the transcript, segments, timestamps, and speaker labels. Full reference and SDK snippets at /api/.

Question 11

Can I translate the Arabic transcript into another language?

Accepted Answer

Yes — once transcription finishes, click Translate or paste the text into /translate/. Arabic pairs with every other language we support (200+). For meeting minutes pipe the transcript through /summarize/; for dubbing send it to /voice/tts/ to render audio in the target language.

Question 12

What if the Arabic audio is noisy or low-quality?

Accepted Answer

Whisper is trained on hundreds of thousands of hours of real-world audio, so it tolerates background noise and phone-quality recordings on Arabic. For best results, supply clean audio (headset mic, no music bed) — at this tier noise compounds the baseline error rate. If a transcript comes back unusable, email contact@free.ai with the file — we will refund the tokens and look at whether a different engine handles your audio better.

Language	Arabic
ISO Code	`ar`
AI Model	faster-whisper
Price	Free

Free Arabic Transcription

How It Works

Arabic Transcription Features

Language Details

More Languages

FAQ