arXiv PDF Extractor

ប្រើ​ពាណិជ្ជកម្ម​បាន​ហើយ ម៉ូដែល 380+ គ្មាន​សញ្ញា​ទឹក គ្មាន​ការ​ចុះឈ្មោះ​ដែល​ត្រូវការ
ម៉ូដែល & # 160; ៖
+ GPT-5, Claude, Gemini
Drop an arXiv preprint, journal paper, or thesis chapter — AI converts it into clean LaTeX-flavored text. Math equations stay as equations, multi-column layouts get unwound, citations preserved. Powered by Meta Nougat-base.

Drop a research paper PDF here or click to upload

PDF up to 50 MB. ~300 tokens per page (math-aware).

Reading equations + unwinding columns… ~10 sec/page
ជម្រើស​កម្រិត​ខ្ពស់
លទ្ធផល
កំពុង​រត់​ថូខឹន​ទាប & # 160; ។ Get More Tokens
Want better results? ម៉ូដែល​ពិសេស (GPT-5, Claude, Gemini) deliver higher quality. View Plans

❤️ ស្រឡាញ់ Free.ai? ប្រាប់មិត្តភក្តិរបស់អ្នក!

ចុះឈ្មោះ ដើម្បីទទួលបានតំណភ្ជាប់យោងនិងរកប្រាក់ចំណេញ 25,000 រូបិយប័ណ្ណក្នុងមួយមិត្តភក្តិ.

ចង់​បាន​បន្ថែម​ទៀត​ឬ & # 160;? ចុះឈ្មោះដោយឥតគិតថ្លៃសម្រាប់ 5K រូបិយប័ណ្ណ / ថ្ងៃ + ប្រាក់រង្វាន់ 10K
ចុះឈ្មោះដោយឥតគិតថ្លៃ

កំពុង​ដំណើរការ​សំណើ​របស់​អ្នក...

Drop an arXiv preprint, get clean LaTeX-flavored text with every equation rendered inline. Multi-column layouts handled, references kept intact. Free, AI-powered.

របៀប​ប្រើ arXiv PDF Extractor

1
បញ្ចូល​ព័ត៌មាន​បញ្ចូល​របស់​អ្នក

វាយ​អត្ថបទ ផ្ទុក​ឯកសារ​ឡើង ឬ​ពិពណ៌នា​អំពី​អ្វី​ដែល​អ្នក​ចង់​បាន & # 160; ។ គ្មាន​គណនី​ដែល​ត្រូវការ & # 160; ។

2
ចុច​បង្កើត

AI របស់យើងដំណើរការសំណើរបស់អ្នកក្នុងរយៈពេលពីរបីវិនាទីដោយប្រើម៉ូដែលប្រភពបើកចំហល្អបំផុត។

3
ទាញយក និង​ចែករំលែក

ទាញយក ចម្លង ឬ ចែករំលែក​លទ្ធផល​របស់​អ្នក ។ ឥតគិតថ្លៃ​សម្រាប់​ការ​ប្រើ​ផ្ទាល់ខ្លួន និង​ពាណិជ្ជកម្ម ។

ប្រើ​ឧបករណ៍​នេះ​តាម​រយៈ API

ឧបករណ៍នេះដោយស្វ័យប្រវត្តិពីកូដផ្ទាល់ខ្លួនរបស់អ្នក. OpenAI-ឆបគ្នា REST ចំណុចបញ្ចប់, Bearer-token auth, មិនចាំបាច់បន្ថែម SDK. តម្លៃ Token ផ្គូផ្គងចំណុចប្រទាក់បណ្ដាញ.

curl -X POST https://api.free.ai/v1/chat/ \
  -H "Authorization: Bearer sk-free-..." \
  -H "Content-Type: application/json" \
  -d '{"model": "qwen7b", "messages": [{"role": "user", "content": "Use the arXiv PDF Extractor tool on: ..."}]}'

ឧបករណ៍ AI ឥតគិតថ្លៃ​ដែល​ទាក់ទង

arXiv PDF Extractor — FAQ

Drop in an arXiv preprint and the AI converts the entire paper into clean LaTeX-flavored text. Equations come back as proper LaTeX, multi-column layouts unwound, references intact. Built on Meta Nougat, trained specifically on millions of arXiv pages.

Nougat's training corpus was arXiv preprints — so it absolutely shines on the IEEE / ACM / NeurIPS / ICML / arXiv layout family. Other PDF extractors choke on multi-column math; this one was designed for it.

Download the PDF from arXiv (e.g. arxiv.org/pdf/2401.12345), upload it here, get back a single .txt file with the full paper as LaTeX-flavored text. No arXiv API key needed; we just need the PDF.

Yes — that's the headline feature. Inline math is `$...$`, displayed math `$$...$$`. Even raster-rendered equations in older papers come through correctly because the model treats each page as an image.

Auto-handled. Two-column IEEE-style is the most common arXiv layout and Nougat unwinds it into proper reading order without a config flag.

Yes — inline `[12]` / `[Smith2020]` markers stay where they belong, and the full reference list at the end is extracted intact for downstream BibTeX / Zotero use.

~8-15 sec/page. A 12-page conference paper takes ~2-3 min. NeurIPS-style 30+ page papers with appendices: 8-12 min. Submit and walk away.

300 tokens/page, floor 600. Most arXiv conference papers (8-15 pages) are 2,400-4,500 tokens. Daily free pool covers ~1-2 papers/day for signed-in users; paid plans get unlimited.

Feed it to ChatGPT / Claude for "explain this paper", build personal RAG over your saved papers, semantic-search your reading list, copy equations into your own LaTeX project, or read the paper as plain text on your phone.

Yes — Nougat OCRs internally. arXiv has been LaTeX-rendered for 25+ years so most preprints are clean digital. Older scanned papers work but math fidelity drops slightly; rescan at 300+ DPI for best results.

PDFs deleted right after extraction. LaTeX output is kept 24h (anonymous) / 7 days (paid share link). Never used for training. arXiv PDFs are public CC-BY anyway, but we don't store them either way.

Yes — POST multipart `file` to /v1/document/academic-pdf/. JSON response with `text_url`, `pages`, `preview`, `tokens`, `share_url`. Bearer auth (sk-free-…) gives 10K free tokens/month. /api/ for curl example.

ចុះឈ្មោះដោយឥតគិតថ្លៃសម្រាប់ 10,000 រូបិយប័ណ្ណ

បង្កើត​គណនី​ឥតគិតថ្លៃ

គ្មាន​កាត​ឥណទាន​ដែល​ត្រូវការ

តើ​អ្នក​វាយតម្លៃ​ឧបករណ៍​នេះ​យ៉ាង​ដូចម្តេច & # 160;?

ស្រឡាញ់ Free.ai? ប្រាប់មិត្តភក្តិរបស់អ្នក!