> ## Documentation Index
> Fetch the complete documentation index at: https://friendli.ai/docs/llms.txt
> Use this file to discover all available pages before exploring further.

# Serving MoE Models

> Serve Mixture of Experts (MoE) models like Mixtral 8x7B with Friendli Container. Covers policy search setup and multi-GPU Docker configuration.

## Introduction

This guide explores the steps to serve Mixture of Experts (MoE) models such as Mixtral 8x7B using Friendli Container.

## Search Optimal Policy and Run Friendli Container

To serve MoE models efficiently, you need to run a policy search to find the optimal execution policy.
Learn how to run the policy search at [Running Policy Search](/guides/container/optimizing-inference-with-policy-search#running-policy-search).
Once the search finds an optimal policy, it compiles the policy into a file that you can use to create serving endpoints.
The engine then serves the endpoint using the optimal policy.
