Skip to main content
Get started with Friendli Dedicated Endpoints. This quickstart walks you through launching your first endpoint and sending your first request.

1. Sign Up or Log In

Create an account or log in at Friendli Suite.

2. Navigate to the Dedicated Endpoints Page

Go to Dedicated Endpoints to see your endpoint list.

3. Prepare Your Model

Choose a model to serve from Hugging Face, or upload your own. Hugging Face

4. Deploy Your Endpoint

Deploy the model from step 3 with your chosen GPU. You can also configure replicas and optimization options.

5. Generate Responses

Generate responses in two ways: the playground or the endpoint URL. Use the playground tab to test your model in a chat-style interface. For programmatic use, send requests to your endpoint address—shown on the endpoint information tab—through our API. See this guide for how to create a Personal API key.
import os
from openai import OpenAI

client = OpenAI(
    api_key=os.getenv("API_KEY"),
    base_url="https://api.friendli.ai/dedicated/v1",
)

chat_completion = client.chat.completions.create(
    model="YOUR_ENDPOINT_ID",
    messages=[
        {
            "role": "user",
            "content": "Tell me how to make a delicious pancake"
        }
    ]
)
print(chat_completion.choices[0].message.content)
For a more detailed tutorial for your usage, please refer to our tutorial for using Hugging Face models.
Last modified on June 24, 2026