Skip to main content
Let your team focus on building great AI products.
FriendliAI will make sure your AI runs fast, affordable, and reliable at scale.

Start building

For teams requiring production-scale AI without infra worries:

Friendli Dedicated Endpoints QuickStart

Reliable, high-performance inference with dedicated GPU resources.
Predictable, efficient scaling with full observability at scale.
For teams seeking instant access to popular models:

Friendli Serverless Endpoints QuickStart

Instant API access to popular open-source models.
Fast, affordable inference with simple pay-as-you-go pricing.
For teams prioritizing security and compliance:

Friendli Container QuickStart

On-premise, containerized solutions with data protection and governance controls.
Kubernetes-native, designed for enhanced privacy, security, and governance.

Resources

Friendli SDK Guide

Learn how to interact with Friendli products programmatically via the official Python SDK.

Friendli Suite Guide

Learn how to use Friendli Suite, our all-in-one platform with a feature-rich web console.

Model Library

Browse 520K+ models supported by Friendli.

API Reference

API references for all endpoints.

Tutorial

Build AI agents with Friendli products.

Blog

Check technical insights from the Friendli team.