Moondream 2

Free.ai (self-hosted) · vision · ~500 tokens per image

Drop a file here or click to browse

~500 tokens per image

Moondream 2 is a vision model built by M87 Labs. Strongest at Low-VRAM image captioning, fast OCR fallback, edge VLM deployments.. Self-hosted on Free.ai GPUs — runs free against your daily token pool (500 tokens per image). Released under Apache 2.0 — commercial use permitted on Free.ai.

Use via API

OpenAI-compatible REST API. Generate a key and call this model in seconds.

curl -X POST https://api.free.ai/v1/image/describe/ \
  -H "Authorization: Bearer sk-free-..." \
  -H "Content-Type: application/json" \
  -d '{"model":"moondream2","prompt":"your prompt here"}'
API Documentation Get API Key

Frequently Asked Questions

Moondream 2 is a vision model built by M87 Labs. Strongest at Low-VRAM image captioning, fast OCR fallback, edge VLM deployments.. Self-hosted on Free.ai GPUs — runs free against your daily token pool (500 tokens per image). Released under Apache 2.0 — commercial use permitted on

Moondream 2 works well for Low-VRAM image captioning, fast OCR fallback, edge VLM deployments.. Try the sample prompts above to see its style.

About 500 tokens per average message. $1 buys 750,000 tokens, so even paid models cost cents per chat. Free accounts get a 30,000-token daily pool.

It depends on the task. /chat/compare/ lets you send the same prompt to Moondream 2 and any other model side-by-side — comparison is the fastest way to decide.

Yes. Outputs are yours — Free.ai does not claim rights to anything you generate. The underlying model is Apache 2.0-licensed.

32,768 tokens.

Replies stream token-by-token within ~1 second. Total response time depends on length and model size — small models stream faster, frontier models trade speed for depth.

Yes. Signed-in users see every chat in /account/?tab=history. You can also share a one-link copy of any conversation via the Share button.

Free.ai does not train models on your conversations. Self-hosted models stay on our GPUs. Premium models route to the upstream provider for inference.

Yes. POST to /v1/chat/ with model="moondream2" and a messages array. Streaming SSE is supported. Full reference: /api/.

Moondream 2 is Apache 2.0-licensed with 2B parameters. See /models/moondream2/ for setup notes and our open-source repos at github.com/freeaigit.

Free accounts get a 30,000-token daily pool. When that runs out, top up starting at $1 (750K tokens) — no subscription required.

Love Free.ai? Tell your friends!

Rate this page