GOT-OCR v2
Free.ai
·
ocr
·
~500 tokens per page
GOT-OCR v2 is an OCR model. Routed through external models — ~500 tokens per page (50% markup over upstream cost).
Use via API
OpenAI-compatible REST API. Generate a key and call this model in seconds.
curl -X POST https://api.free.ai/v1/ocr/ \
-H "Authorization: Bearer sk-free-..." \
-H "Content-Type: application/json" \
-d '{"model":"premium/got-ocr/v2","prompt":"your prompt here"}'
API Documentation
Get API Key
Frequently Asked Questions
GOT-OCR v2 extracts text from images and PDFs. Upload a scanned page, a screenshot, or a photo of a document and GOT-OCR v2 returns the text — preserving line breaks and (on capable engines) tables, formulas, and structure.
GOT-OCR v2 handles dozens of scripts — Latin, Cyrillic, CJK, Arabic, Devanagari, and more. The /ocr/ landing has the full language list per engine.
Yes for clean printed-style handwriting; cursive and shorthand are harder. TrOCR and modern transformer-based engines are noticeably better than legacy Tesseract on handwriting.
Layout-aware engines (GOT-OCR, Marker, Florence-2) return Markdown/HTML tables and LaTeX for formulas. Plain-text engines return text without structure — check this model's output mode.
GOT-OCR v2 is a premium OCR engine. About ~500–1,500 tokens per page. $1 = 750,000 tokens.
PNG, JPG, WebP, HEIC, BMP, plus single- and multi-page PDFs up to 50 MB. Skewed or rotated pages are auto-corrected before extraction.
Character accuracy is typically >98% on clean printed pages, dropping on low-res photos, severely skewed scans, or unusual fonts. Compare engines on /ocr/compare/ when accuracy matters.
Yes — /batch/ accepts a folder of images/PDFs. Each extraction lands in /account/?tab=history. For folder-tree preservation use the API.
Yes — POST your file to /v1/ocr/ with model="GOT-OCR v2". Returns JSON with text + (when supported) layout + word-level bounding boxes. /api/ has the full reference.
Self-hosted models keep your file on our GPUs; premium pass through with a DPA. Documents are deleted after the share-window. We do not train on your inputs.
Yes — Free.ai grants commercial use of extracted text. You need rights to the source document.
1–5 seconds per page typically. Layout-aware engines (tables, math) take longer. Multi-page PDFs scale roughly linearly.