Балачки

Немає попередніх балачок

NVIDIA ~191 contains/ msg
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

Привіт! NVIDIA: Llama 3.3 Nemotron Super 49B V1.5. Спроси меня о чем угодно.

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 потребує придбаних жетонів. Отримати позначки | Sigup ⇩ 10K Free | Замість цього скористайтеся вільною моделлю
All models with one subscription — see plans →
~191 contains/ msg Введіть, щоб надіслати
Подробиці моделі

Подробиці моделі

Увімкнено NVIDIA
Категорія Chat
Контекст 131072 tokens
Вартість ~191 contains/ msg
4.3 from 10 users of this category

Про програму

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 is a chat model built by NVIDIA. It accepts up to 131K tokens of context per request. Routed through external models — ~191 tokens per message (50% markup over upstream cost).

Використовувати через API

curl https://api.free.ai/v1/chat/ \
  -H "Authorization: Bearer YOUR_KEY" \
  -d '{"model":"nvidia/llama-3.3-nemotron-super-49b-v1.5"}'
Документи API

ЧаП

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 is a chat model built by NVIDIA. It accepts up to 131K tokens of context per request. Routed through external models — ~191 tokens per message (50% markup over upstream cost).

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 works well for general conversation, writing assistance, brainstorming, code help, and analysis. Try the sample prompts above to see its style.

About 191 tokens per average message. $1 buys 750,000 tokens, so even paid models cost cents per chat. Free accounts get 10,000 signup tokens plus a daily pool.

It depends on the task. /chat/compare/ lets you send the same prompt to NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 and any other model side-by-side — comparison is the fastest way to decide.

Yes. Outputs are yours — Free.ai does not claim rights to anything you generate.

131,072 tokens.

Replies stream token-by-token within ~1 second. Total response time depends on length and model size — small models stream faster, frontier models trade speed for depth.

Yes. Signed-in users see every chat in /account/?tab=history. You can also share a one-link copy of any conversation via the Share button.

Free.ai does not train models on your conversations. Self-hosted models stay on our GPUs. Premium models route to the upstream provider for inference.

Yes. POST to /v1/chat/ with model="nvidia/llama-3.3-nemotron-super-49b-v1.5" and a messages array. Streaming SSE is supported. Full reference: /api/.

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 is a premium model served by an external provider, so self-hosting is not available. Free.ai exposes it through token-based pricing.

Free accounts get 10,000 signup tokens plus a daily pool. When that runs out, top up starting at $1 (750K tokens) — no subscription required.

Love this tool? Share it!

Оцінити цю сторінку