Fal Speech-to-Text

Free.ai · stt · ~500 אסימונים לכל minute

העלה שמע

זרוק קובץ שמע או וידאו, או הדבק כתובת למטה

~500 אסימונים לכל minute

פועל בחינם על GPUs שלנו. שדרוג עבור Fal Speech-to-Text →

Fal Speech-to-Text is a דגם דיבור אל טקסט. Routed through external models — ~500 tokens לדקה (50% markup over upstream cost).

השתמש באמצעות API

OpenAI-compatible REST API. Generate a key and call this model in seconds.

curl -X POST https://api.free.ai/v1/stt/ \
  -H "Authorization: Bearer sk-free-..." \
  -H "Content-Type: application/json" \
  -d '{"model":"premium/speech-to-text","audio_url":"https://..."}'

תיעוד API קבל מפתח API

Similar models

ElevenLabs STT

Wizper (Whisper v3)

Browse all models →

שאלות ששואלים לעתים קרובות

Fal Speech-to-Text transcribes spoken audio into text. Upload an MP3, WAV, M4A, or video file and Fal Speech-to-Text returns the full transcript plus optional SRT/VTT subtitles with timestamps.

Fal Speech-to-Text handles dozens of languages — Whisper-family models cover 90+, Parakeet covers ~25, others vary. Pick "auto-detect" or specify the language for highest accuracy.

שיעור המילים הוא 5°10% על אודיו אנגלי נקי, 10^20% על אודיו רועש או מבטא. גרסאות גדולות של אותה ארכיטקטורה משתפרות באופן משמעותי במקרים קשים □ לבחור גדול יותר כאשר האודיו הוא מחוספס.

כן, כל קטע כולל חותמת זמן התחלה/ סוף. יצוא כ- SRT או VTT ו- Timemap ישירות על גבי הווידאו שלך.

הוא מנוע תמליל פרמיה, בערך 500,500 אסימונים לדקה של אודיו. 1 = 750,000 אסימונים.

MP3, WAV, M4A, FLAC, OGG, בתוספת וידאו (MP4, MOV, WebM) □ אנו לחלץ את האודיו. מקס 500 MB לטעינה. קבצים ארוכים יותר? פצל עם / audio/cut / או להשתמש /v1/st/batch/.

Diarization רמקולים (באנגלית: Speaker diarization) הוא מעבר נפרד, המייצג תוויות diarization כל קטע עם דובר 1 / יו"ר 2 / וכו'.

כל תמליל נוחת ב־ eccount/? tab=history with the original name. עבור שימור עץ התיקייה להשתמש ב־ API.

כן □ POST הקול שלך אל / v1/st/tranchive/ with model="Fal Speech-to-Text." Returns JSON with text + pages + word-late timestamps. / Api / has the full reference.

מודלים מארחים עצמיים שומרים אודיו על GPUs שלנו; פרמיה עוברת עם DPA. שמע נמחק לאחר חלון שיתוף (24h Anon, 7d חתם ב). אנחנו לא מתאמנים על הקלטים שלך.

כן, Free.ai מענקים שימוש מסחרי בתעתיקים. אתה צריך זכויות על השמע שהעלית (הקלטה משלך, חומר מורשה, או תוכן בהסכמה).

גורם בזמן אמת הוא בערך 0.050. 2× podcast 60 דקות trancripts in 3×12 minutes. דגמי Premium לעתים קרובות לסיים מהר יותר. השתמש בכפתור התור כדי לסגור את הלשונית.

Fal Speech-to-Text

השתמש באמצעות API

Similar models

שאלות ששואלים לעתים קרובות

What does Fal Speech-to-Text do?

כמה שפות תומכות?

How accurate is Fal Speech-to-Text?

Does Fal Speech-to-Text include timestamps?

כמה עולה כל דקה?

What audio formats can I upload to Fal Speech-to-Text?

Can Fal Speech-to-Text identify different speakers?

Can I batch transcribe with Fal Speech-to-Text?

Is there an API for Fal Speech-to-Text?

What about privacy when I transcribe with Fal Speech-to-Text?

Is Fal Speech-to-Text output safe for commercial use?

כמה זמן לוקח ל"פריי-איי-פי-אן"?

קבל 10,000 טוקנים חופשיים

חכו, קחו 10 אלף טוקנים חופשיים!

רוצה עוד?