Chats

No previous chats

NVIDIA ~500 tokens/msg
NVIDIA: Nemotron Nano 9B V2

Hi! I'm NVIDIA: Nemotron Nano 9B V2. Ask me anything.

NVIDIA: Nemotron Nano 9B V2 requires purchased tokens. Get Tokens | Sign Up — 10K Free | Use Free Model Instead
All models with one subscription — see plans →
~500 tokens/msg Enter to send
Model Details

Model Details

Hosted on NVIDIA
Category Chat
Context 128000 tokens
Cost ~500 tokens/msg
4.3 from 10 users of this category

About

NVIDIA: Nemotron Nano 9B V2 is a chat model built by NVIDIA. It accepts up to 128K tokens of context per request. Routed through external models — ~500 tokens per message (50% markup over upstream cost).

Use via API

curl https://api.free.ai/v1/chat/ \
  -H "Authorization: Bearer YOUR_KEY" \
  -d '{"model":"nvidia/nemotron-nano-9b-v2:free"}'
API Docs

FAQ

NVIDIA: Nemotron Nano 9B V2 is a chat model built by NVIDIA. It accepts up to 128K tokens of context per request. Routed through external models — ~500 tokens per message (50% markup over upstream cost).

NVIDIA: Nemotron Nano 9B V2 works well for general conversation, writing assistance, brainstorming, code help, and analysis. Try the sample prompts above to see its style.

About 500 tokens per average message. $1 buys 750,000 tokens, so even paid models cost cents per chat. Free accounts get 10,000 signup tokens plus a daily pool.

It depends on the task. /chat/compare/ lets you send the same prompt to NVIDIA: Nemotron Nano 9B V2 and any other model side-by-side — comparison is the fastest way to decide.

Yes. Outputs are yours — Free.ai does not claim rights to anything you generate.

128,000 tokens.

Replies stream token-by-token within ~1 second. Total response time depends on length and model size — small models stream faster, frontier models trade speed for depth.

Yes. Signed-in users see every chat in /account/?tab=history. You can also share a one-link copy of any conversation via the Share button.

Free.ai does not train models on your conversations. Self-hosted models stay on our GPUs. Premium models route to the upstream provider for inference.

Yes. POST to /v1/chat/ with model="nvidia/nemotron-nano-9b-v2:free" and a messages array. Streaming SSE is supported. Full reference: /api/.

NVIDIA: Nemotron Nano 9B V2 is a premium model served by an external provider, so self-hosting is not available. Free.ai exposes it through token-based pricing.

Free accounts get 10,000 signup tokens plus a daily pool. When that runs out, top up starting at $1 (750K tokens) — no subscription required.

Love Free.ai? Tell your friends!

Rate this page