Chats

No previous chats

Sao10K ~2700 tokens/msg
Sao10K: Llama 3.1 70B Hanami x1

Hi! I'm Sao10K: Llama 3.1 70B Hanami x1. Ask me anything.

Sao10K: Llama 3.1 70B Hanami x1 requires purchased tokens. Get Tokens | Sign Up — 10K Free | Use Free Model Instead
All models with one subscription — see plans →
~2700 tokens/msg Enter to send
Model Details

Model Details

Hosted on Sao10K
Category Chat
Context 16000 tokens
Cost ~2700 tokens/msg
4.3 from 10 users of this category

About

Sao10K: Llama 3.1 70B Hanami x1 is a chat model built by Sao10K. It accepts up to 16K tokens of context per request. Routed through external models — ~2,700 tokens per message (50% markup over upstream cost).

Use via API

curl https://api.free.ai/v1/chat/ \
  -H "Authorization: Bearer YOUR_KEY" \
  -d '{"model":"sao10k/l3.1-70b-hanami-x1"}'
API Docs

FAQ

Sao10K: Llama 3.1 70B Hanami x1 is a chat model built by Sao10K. It accepts up to 16K tokens of context per request. Routed through external models — ~2,700 tokens per message (50% markup over upstream cost).

Sao10K: Llama 3.1 70B Hanami x1 works well for general conversation, writing assistance, brainstorming, code help, and analysis. Try the sample prompts above to see its style.

About 2,700 tokens per average message. $1 buys 750,000 tokens, so even paid models cost cents per chat. Free accounts get 10,000 signup tokens plus a daily pool.

It depends on the task. /chat/compare/ lets you send the same prompt to Sao10K: Llama 3.1 70B Hanami x1 and any other model side-by-side — comparison is the fastest way to decide.

Yes. Outputs are yours — Free.ai does not claim rights to anything you generate.

16,000 tokens.

Replies stream token-by-token within ~1 second. Total response time depends on length and model size — small models stream faster, frontier models trade speed for depth.

Yes. Signed-in users see every chat in /account/?tab=history. You can also share a one-link copy of any conversation via the Share button.

Free.ai does not train models on your conversations. Self-hosted models stay on our GPUs. Premium models route to the upstream provider for inference.

Yes. POST to /v1/chat/ with model="sao10k/l3.1-70b-hanami-x1" and a messages array. Streaming SSE is supported. Full reference: /api/.

Sao10K: Llama 3.1 70B Hanami x1 is a premium model served by an external provider, so self-hosting is not available. Free.ai exposes it through token-based pricing.

Free accounts get 10,000 signup tokens plus a daily pool. When that runs out, top up starting at $1 (750K tokens) — no subscription required.

Love Free.ai? Tell your friends!

Rate this page