Chats

No previous chats

Meta ~360 tokens/msg
Meta: Llama 3.1 70B Instruct

Hi! I'm Meta: Llama 3.1 70B Instruct. Ask me anything.

Meta: Llama 3.1 70B Instruct requires purchased tokens. Get Tokens | Sign Up — 10K Free | Use Free Model Instead
All models with one subscription — see plans →
~360 tokens/msg Enter to send
Model Details

Model Details

Hosted on Meta
Category Chat
Context 131072 tokens
Cost ~360 tokens/msg
4.3 from 10 users of this category

About

Meta: Llama 3.1 70B Instruct is a chat model built by Meta. It accepts up to 131K tokens of context per request. Routed through external models — ~360 tokens per message (50% markup over upstream cost).

Use via API

curl https://api.free.ai/v1/chat/ \
  -H "Authorization: Bearer YOUR_KEY" \
  -d '{"model":"meta-llama/llama-3.1-70b-instruct"}'
API Docs

FAQ

Meta: Llama 3.1 70B Instruct is a chat model built by Meta. It accepts up to 131K tokens of context per request. Routed through external models — ~360 tokens per message (50% markup over upstream cost).

Meta: Llama 3.1 70B Instruct works well for general conversation, writing assistance, brainstorming, code help, and analysis. Try the sample prompts above to see its style.

About 360 tokens per average message. $1 buys 750,000 tokens, so even paid models cost cents per chat. Free accounts get 10,000 signup tokens plus a daily pool.

It depends on the task. /chat/compare/ lets you send the same prompt to Meta: Llama 3.1 70B Instruct and any other model side-by-side — comparison is the fastest way to decide.

Yes. Outputs are yours — Free.ai does not claim rights to anything you generate.

131,072 tokens.

Replies stream token-by-token within ~1 second. Total response time depends on length and model size — small models stream faster, frontier models trade speed for depth.

Yes. Signed-in users see every chat in /account/?tab=history. You can also share a one-link copy of any conversation via the Share button.

Free.ai does not train models on your conversations. Self-hosted models stay on our GPUs. Premium models route to the upstream provider for inference.

Yes. POST to /v1/chat/ with model="meta-llama/llama-3.1-70b-instruct" and a messages array. Streaming SSE is supported. Full reference: /api/.

Meta: Llama 3.1 70B Instruct is a premium model served by an external provider, so self-hosting is not available. Free.ai exposes it through token-based pricing.

Free accounts get 10,000 signup tokens plus a daily pool. When that runs out, top up starting at $1 (750K tokens) — no subscription required.

Love Free.ai? Tell your friends!

Rate this page