Question 1

How accurate is Whisper transcription for Kikantoni?

Accepted Answer

Kikantoni is a less-resourced language for Whisper — large-v3-turbo sits above 25% word error rate, sometimes well above. The transcript is useful for search and gist but should not be treated as publication-ready. If a higher-accuracy engine becomes available for Kikantoni we wire it in automatically. (Tier D, over 25% word error rate on benchmark sets — we publish honest WER tiers rather than marketing claims.)

Question 2

Is Kikantoni audio-to-text transcription free?

Accepted Answer

Yes — Kikantoni transcription draws from your daily free token pool first. Audio costs about 50 tokens per minute, so the anonymous daily pool covers a few hours of audio per day. Signed-in accounts get a larger pool plus 10,000 signup tokens. Past that, $1 buys 750,000 tokens (~250 hours of audio).

Question 3

What script and orthography does the Kikantoni transcript use?

Accepted Answer

Kikantoni transcripts are returned in standard UTF-8 with the language's normal orthography.

Question 4

What audio formats are supported for Kikantoni transcription?

Accepted Answer

MP3, WAV, M4A, FLAC, OGG, OPUS, and WEBM are accepted directly. For video (MP4, MOV, MKV) we extract the audio track server-side before sending it to Whisper — you do not need to convert anything yourself. Same pipeline regardless of source language, including Kikantoni.

Question 5

How long can a Kikantoni audio file be?

Accepted Answer

Anonymous uploads cap at roughly 500 MB per file. Signed-in accounts go up to 2 GB. Duration is not a hard limit — long files are chunked automatically (30-second windows with overlap) and stitched back into a single transcript with continuous timestamps. Multi-hour Kikantoni recordings (podcasts, full lectures, meetings) work fine.

Question 6

Does the Kikantoni transcript identify different speakers?

Accepted Answer

Yes — speaker diarization is on by default for every Kikantoni transcript. The output is segmented as Speaker 1 / Speaker 2 / Speaker 3 with timestamps, so interviews, panel discussions, and multi-party meetings come back labeled. Diarization runs on a separate model and works the same across all languages we support.

Question 7

Can I transcribe a Kikantoni YouTube video or podcast?

Accepted Answer

Yes — paste the URL into /transcribe/youtube/ for YouTube or /transcribe/podcast/ for podcast feeds (Apple, Spotify, RSS). We download the audio, run it through Whisper with language=yue, and return the transcript with timestamps and speaker labels. Typical Kikantoni content: lectures, interviews, voice notes, and YouTube content in Kikantoni all work — paste a URL into /transcribe/youtube/ or upload the file directly.

Question 8

How much does an hour of Kikantoni audio cost in tokens?

Accepted Answer

Kwa kuwa gharama ni kama ishara 50 kwa dakika moja za kusikiliza, kwa hiyo kurekodi kwa saa moja ni ishara ya kila dakika. dola 1 inanunua kadi 750,000, ambazo hufanya kazi hadi muda wa saa 250 kwa dola. Watumiaji wengi hawatumii kamwe chochote kile chochote kile kilicho bure kila siku hufunika vidoka vifupi, sauti, na kipande kimoja cha sauti.

Question 9

Can I get word-level timestamps for Kikantoni audio?

Accepted Answer

Yes — both segment-level (every ~10-30 seconds) and word-level timestamps are available. Word-level is the default for VTT/SRT subtitle export so the captions sync line-by-line. On the API set timestamps="word" in the request body. Kikantoni transcripts are returned in standard UTF-8 with the language's normal orthography.

Question 10

Is there an API for Kikantoni transcription?

Accepted Answer

Yes. POST audio (multipart/form-data, field name "file") to /v1/transcribe/ with language=yue — or omit the language parameter to let Whisper auto-detect. Returns JSON with the transcript, segments, timestamps, and speaker labels. Full reference and SDK snippets at /api/.

Question 11

Can I translate the Kikantoni transcript into another language?

Accepted Answer

Yes — once transcription finishes, click Translate or paste the text into /translate/. Kikantoni pairs with every other language we support (200+). For meeting minutes pipe the transcript through /summarize/; for dubbing send it to /voice/tts/ to render audio in the target language.

Question 12

What if the Kikantoni audio is noisy or low-quality?

Accepted Answer

Whisper's noise training helps less at this tier — the bottleneck is the amount of Kikantoni audio Whisper saw during training, not noise. Clean studio audio still beats noisy audio, but neither will reach the accuracy you would get on a high-resource language.Kama nakala itarudi nyuma bila kuweza, barua pepe wasiliana na@free.ai kwa faili tutarekebisha alama hizo na kuangalia kama injini tofauti inashika sauti yako vizuri zaidi.

Lugha	Kikantoni
paper size	`yue`
Kioo cha AI	Mwizi wa haraka
Bei	Huru

Free Kikantoni Transcription

Jinsi Inavyofanya Kazi

Kikantoni Transcription Features

Maelezo ya Lugha

Lugha Zaidi

FAQ