Free AI Hosting | Free.ai

Host AI models for free. GPU access, API hosting, and cloud deployment.

Được lưu trên đám mây

Dùng cơ sở hạ tầng Free.ai. Không cài đặt, không bảo trì. Tất cả các mô hình được tải sẵn và sẵn sàng sử dụng qua API hoặc giao diện người dùng web.

Có sẵn

Docker tự lưu

Chạy mô hình AI mã nguồn mở của chúng tôi trên phần cứng của bạn. Docker images with GPU support, optimized for inference.

Dịch vụ tự phục vụ

Được quản lý riêng

Máy chủ GPU chuyên dụng do chúng tôi quản lý, được bố trí ở vùng đám mây của bạn, tách dữ liệu hoàn toàn và SLA tùy chỉnh.

Enterprise

Tự quản lý triển khai

Tất cả các mô hình của chúng tôi là mã nguồn mở (Apache 2. 0 / MIT). Bạn có thể chạy chúng trên cơ sở hạ tầng GPU của riêng mình:

# Pull and run a model with Docker
docker pull ghcr.io/free-ai/inference:latest
docker run --gpus all -p 8000:8000 ghcr.io/free-ai/inference:latest \
  --model qwen2.5-72b --quantization awq

Yêu cầu tối thiểu

CPU NVIDIA với 24GB+ VRAM (RTX 4090, A5000, A100)
CUDA 12.0+ và Docker với bộ công cụ chứa NVIDIA
16GB+ RAM hệ thống, 100GB+ lưu trữ mỗi mẫu
Đối với các mẫu tham số 72B: 80GB VRAM (A100) hoặc cài đặt đa GPU

Tại sao lại là Self-Host?

Bảo mật dữ liệu — Your data never leaves your servers
Không giới hạn tốc độ — Unlimited inference on your hardware
Chấp hành — Meet data residency requirements

Tự chọn — Fine-tune models on your data
Kiểm soát chi phí — Fixed hardware costs, no per-token fees
Khoảng cách không khí — Runs fully offline

View Pricing API Docs

FAQ

Three options: Cloud Hosted (use our infrastructure, zero setup), Docker Self-Hosted (run models on your own GPU hardware), and Managed Private (dedicated GPU servers managed by us in your preferred region).

You need an NVIDIA GPU with 24GB+ VRAM (RTX 4090, A5000, A100), CUDA 12.0+, Docker with NVIDIA Container Toolkit, 16GB+ system RAM, and 100GB+ storage per model. For 72B parameter models, you need 80GB VRAM or a multi-GPU setup.

Yes. Self-hosted deployments run fully offline once the Docker images and model weights are downloaded. This is ideal for air-gapped environments and sensitive data processing.

Pull our Docker image and run it with GPU support. The command is: docker run --gpus all -p 8000:8000 ghcr.io/free-ai/inference:latest --model qwen2.5-72b --quantization awq. The container handles model loading and serves an API endpoint.

All self-hosted models use permissive open-source licenses -- Apache 2.0, MIT, or BSD. You can use them commercially without restrictions. We deliberately exclude models with restrictive licenses like Meta's Llama license.

Managed private hosting gives you dedicated GPU servers in your preferred cloud region, fully managed by our team. We handle setup, patching, model updates, and monitoring. You get full data isolation with an enterprise SLA.

Yes. Since all models are open-source, you can fine-tune them on your own data using standard training frameworks like Hugging Face Transformers. Our Docker images are compatible with popular fine-tuning tools.

Contact our sales team to discuss a trial period. We typically offer a short evaluation period for enterprise prospects to test managed private hosting before committing to a long-term plan.

Cloud hosting uses the standard token-based pricing. Self-hosted is free -- you only pay for your own hardware and electricity. Managed private hosting is priced based on GPU allocation, region, and SLA level.

Yes. You can self-host specific models for high-volume or sensitive workloads while using the Free.ai cloud for everything else. The API format is identical, making it easy to route requests between your infrastructure and ours.

We provide documentation, Docker images, and community support for self-hosted deployments. Managed private hosting includes full technical support, monitoring, and a dedicated account manager.

Cloud hosted is best for teams that want zero maintenance. Self-hosted is ideal for data privacy, compliance, or unlimited usage on your own hardware. Managed private is the best of both worlds -- full data isolation with no operational burden.

Free AI Hosting | Free.ai

Được lưu trên đám mây

Docker tự lưu

Được quản lý riêng

Tự quản lý triển khai

Yêu cầu tối thiểu

Tại sao lại là Self-Host?

FAQ

Free.ai có những lựa chọn gì?

Điều kiện phần cứng tối thiểu cho tự lưu trữ là gì?

Tôi có thể chạy Free.ai mẫu mà không có kết nối internet không?

Làm thế nào để tôi triển khai một thực thể tự lưu trữ?

Giấy phép nào áp dụng cho các mẫu tự lưu trữ?

Điều gì là lựa chọn lưu trữ riêng?

Tôi có thể điều chỉnh các mô hình trong một thiết lập tự lưu trữ không?

Có một thử nghiệm miễn phí cho quản lý lưu trữ không?

Giá cả làm việc như thế nào cho tự chủ vs đám mây?

Tôi có thể kết hợp sử dụng đám mây và tự lưu trữ không?

Hỗ trợ gì có sẵn cho việc triển khai tự chủ?

Làm sao tôi chọn được nơi ở?

Lấy 10.000 token miễn phí

Chờ đã — Cầm 10K token miễn phí!

Muốn thêm nữa không?