Podcast transcription

Ho sebelisoa ka khoebo 380+ li-models Ha ho letšoao la metsi Ha ho hlokahale ho ngolisoa
Mofuta:
+ GPT-5, Claude, Gemini
Kopitsa lihlooho tsa podcast'me u fumane transcript e hloekileng, e nang le letšoao la mohlophisi le nang le li-markers tsa lihlooho tse fumanehang ka ho toba ho tloha lipakeng tsa li-silence. Lifaele tsa li-form tse telele ho fihlela ho 2GB, lipuo tse 99, ho nepahala ha Whisper-large-v3. Eba le SRT / VTT bakeng sa podcast ea hau ea video, TXT e tloaelehileng bakeng sa litlhaku tsa ho bonts'a, kapa JSON bakeng sa ho hlophisa ka ho sebetsa ka mokhoa oa Descript.

Tlosa le ho beha lihlooho tsa hau tsa podcast, kapa tobetsa ho sheba

MP3, WAV, M4A, OGG, MP4 — lihlooho tse telele ho fihlela ho 2GB

Li-markers tsa lihlooho li hlophisitsoe ka lehlakoreng la mosebelisi ho tloha lipakeng tsa li-segments'me li hokahantsoe le transcript. Li kopanya ho YouTube kapa ho Spotify ka tsela eo li leng ka eona.
Token e tlalehiloeng bakeng sa sehlooho sena
Podcast transcript
Lihlooho tse fumanehang ka ho toba

Ho ngola podcast ea hau...

Liketsahalo tse telele li nka metsotso e mengata. U ka koala sena tab haeba lengolo-tsoibila-ha-e-na-e

E entsoe bakeng sa batsamaisi ba podcast + ba bonts'ang

Bontša litlhaku ka ho kenya e le 'ngoe

Ho kenya lihlooho tsa sehlooho, ho kenya TXT. Li-labels tsa sehlooho li hokahantsoe, li-timestamps tsa sehlooho li loketse ho hlalosa Spotify / YouTube, poso ea blog e ngotsoe metsotso e 10 eseng lihora tse 4.

Lihlooho tsa podcast ea video

Eba le SRT kapa WebVTT le li-labels tsa mohlophisi. Eba le Premiere, Final Cut, kapa DaVinci Resolve — kapa u arolelane le video ea hau ea YouTube bakeng sa li-caption tse hloekileng.

Ho hlophisa lihlooho tse nang le lihlooho tse nang le lihlooho

JSON ho tsoa ho u fa mantsoe ohle le ho qala / ho fela ha li-timestamp. Pipe ho Descript, Reaper, kapa ho tsamaea ha mosebetsi oa hau — ho hlophisa ka ho bonts'a mongolo ho fapana le ho hloekisa.

Ho ngola podcast ho sebetsa joang

  1. E-ea ho sebaka sa ho tlosa - MP3, WAV, M4A, MP4, ho fihlela ho 2GB.
  2. E-ba le li-labels tsa moqoqi le li-markers tsa karolo li hokahane (li difaele tsa difaele). Khetha mofuta oa hau oa tlhahiso.
  3. Re hlahloba nako ea ho sebetsa + theko ea eona pele o sebelisa li-token. Tobetsa ho ngola.
  4. Tlosa TXT, SRT, VTT, kapa JSON e nang le letšoao la mohlophisi. Li-markers tsa lihlooho li romelloa ka lehlakoreng le leng, li loketse ho kenya.

Free.ai podcast transcription vs Descript, Riverside, Otter

Sebopeho sa leqephe Free.ai Descript Riverside Otter.ai
ThekoPay-per-use ($0.003/min)$15-30/mo$19/mo$16.99/mo
Boholo ba faele bo boholo2 GB5 GBTied to record session500 MB (varies)
Ho ngola lingoloa tsa moqoqi
Li-markers tsa lihlooho ka ho toba (e thehiloeng ho khutso)ManualPaid tier
SRT/VTT ho tsoaPaid
Li-languages9922100+English-focused
API ea sechabaLimited
Theko ea bafani e bonts'a li-tiers tse boletsoeng ka bongata ka 2026. Check provider ka mong bakeng sa liqeto tsa hona joale.
Likhetho tse tsoetseng pele
Bo_lemo
Tokens e tlase. Fumana Token e eketsehileng
U batla liphetho tse ntle? Li-models tsa Premium (GPT-5, Claude, Gemini) fana ka boleng bo phahameng. Bona Litlhophiso

❤️ U rata Free.ai? Reka le metsoalle ea hau!

Register ho fumana sehokela sa ho u joetsa le ho fumana li-token tse 25 000 ka motsoalle.

U batla ho feta? Ngola mahala bakeng sa 30K tokens / letsatsi + 10K bonus
Ngola mahala

Ho sebetsana le kopo ea hau...

Ho ngola podcasts ho ea ho mongolo ka AI ka mahala. Li-labels tsa motsamaisi, li-markers tsa karolo, ho romelloa ha SRT.

Mokhoa oa ho sebelisa Podcast transcription

1
Ke eng eo u e kentseng?

Tlatsa mongolo, kenya faele, kapa hlalosa seo u se batlang. Ha ho hlokahale ak'haonte.

2
Tobetsa ho theha

AI ea rona e sebetsana le kopo ea hau ka metsotsoana ka ho sebelisa li-models tse ntlehali tsa open-source.

3
Tlosa & & arolelana

Kopitsa, kenya kapa arolelana litlamorao tsa hau. Haholo-holo bakeng sa ho sebelisana le batho ba bang le ho rekisa.

Senya sesebelisoa ka API

E-ba le sesebelisoa sena ka ho iketsetsa ho tloha ho kotloloho. OpenAI-compatible REST endpoint, Bearer-token auth, ha ho hlokahale SDK e eketsehileng. Litheko tsa token li lumellana le interface ea webosaeteng.

curl -X POST https://api.free.ai/v1/stt/ \
  -H "Authorization: Bearer sk-free-..." \
  -H "Content-Type: application/json" \
  -d '{"file": "@audio.mp3", "language": "auto"}'

Podcast transcription — FAQ

Mochini oa podcast o tloaetse ho bua ka li-diaries le li-markers tsa lihlooho (ho fumana li-gap tsa ho khutla> 2s),'me o fana ka li-file tsa li-form tse telele ho fihlela ho 2GB. Li-formats tsa tlhahiso li kenyelletsa SRT + VTT bakeng sa li-video tsa li-notes, TXT e tloaelehileng bakeng sa li-blog posts, le JSON e hlophisitsoeng le li-timestamps tsa li-turn + li-labels tsa moqoqo bakeng sa ho hlophisa li-workflows tsa Descript-style.

Ho fihlela ho 2GB ka faele - ka lihora tse 14 tsa podcast ea audio ho 128 kbps MP3. Lifaele tse telele li kopanngoa ka lehlakoreng la mosebelisi bakeng sa ho hlaphoheloa; u tla fumana transcript e le 'ngoe e kopantsoeng morao.

E.

Li-gaps tsa ho se utloe li telele ho feta li-seconds tse 2 — li-podcasters li sebelisa li-pauses tsa tlhaho lipakeng tsa li-segments. Sehlooho se seng le se seng se fumana timestamp eo u ka e beha kapele ho li-notes tsa hau tsa ho bonts'a ka "Sehlooho:" block bakeng sa YouTube + Spotify.

Descript e lefa $ 15-$ 30 ka khoeli bakeng sa lihora tsa 10 tsa ho ngola, tse hokahaneng le moqapi oa bona. Re lefa ka ho sebelisa li-token tsa ~ 500 / min ho Whisper ($ 5 = li-token tsa 200K = ~ metsotso e 400), ha ho na ho ingolisa, ho rekisa kantle ho naha u ka li kenya kae kapa kae.

Riverside ke studio ea ho ngola e ngolang liketsahalo tsa hau tsa hau ka ntle ho app ea bona, empa feela ka mor'a ho ngola le bona. Re ngola MP3 / WAV / MP4 efe kapa efe ntle le hore na e ngotsoe kae.

Otter e kopanya metsotso e 300 / khoeli ka tekanyo e mahala'me e khethehile ka Senyesemane. Re fana ka lingoliloeng tsa 99 ka bongata bo tšoanang ba Whisper-large-v3 ntle le tefo ea khoeli le khoeli - u lefa ka metsotso e ngotsoeng.

Ee — khetha SRT kapa WebVTT e le mofuta oa tlhahiso. Li-labels tsa moqoqi li kenyelelitsoe ka har'a (SRT) kapa e le <v Moqoqi N> tags (VTT) tseo libapali tse ngata tsa morao-rao li li bonts'ang ka nepo.

Whisper-large-v3 e sebetsana le li-beds tsa moqoqo le ho khutlisa moputso o moholo (li-rate tsa phoso ea mantsoe tse tloaelehileng 3-7%). Moqoqo o moholo haholo kapa ho koaloa ha boima ho fokotsa botsitso - nahana ka ho kenya /music/vocal-remover/ pele ho kopitsa, kapa ho arolelana li-cold-open.

Whisper e sebetsana le mabitso a tloaelehileng ka ho fetisisa, empa jargon e ikhethang ea k'hamphani e ka hloka ho fetisoa ha ho hlophisoa. Episode ea ~ 30-minute e na le 5-10 brand / lebitso la ho hlophisoa ho sebelisoa ka letsoho.

E-ba le tsona ka nako e le 'ngoe mona, kapa sebelisa / batch / e sebetsang ha u ntse u kene ho qala nako. API ho / api / e amohela POST / v1 / stt / bakeng sa ho kenya ka nako e le 'ngoe.

Ha ho joalo. Lifaele tse kenang li tlosoa ha ho phetheloa ho ngola. Ho ngola ha hau ho lula ho /account/history ea hau bakeng sa ho kenya ha u kentse ka har'a tšebeletso; basebelisi ba sa tsejoeng ba fumana sehokela sa ho arolelana sa lihora tse 24.

Ngolisa mahala bakeng sa 30,000 tokens

E-ba le ak'haonte

Ha ho hlokahale karete ea mokitlane

U tla lekola eng ka sesebelisoa sena?

U rata Free.ai? Reka le metsoalle ea hau!