LiteLLM
LiteLLM is an open-source LLM gateway that provides a unified OpenAI-compatible API to many model providers. Route Fased through LiteLLM to centralize usage tracking, logging, virtual keys, and backend switching behind one proxy.Why use LiteLLM with Fased?
- Usage tracking — Review Fased traffic across the models your proxy exposes
- Model routing — Switch between Claude, GPT, Gemini, Bedrock, local models, and other upstreams behind one proxy
- Virtual keys — Create keys with spend limits and model access limits for Fased
- Logging — Full request/response logs for debugging
- Fallbacks — Automatic failover if your primary provider is down
Quick start
For normal browser setup, open Agents, select the Agent, then use Agent > Models > LiteLLM to add the proxy URL/key and choose the Agent’s model roles. Use the CLI examples below when provisioning repeatable local or hosted proxies.Via onboarding
Manual setup
- Start LiteLLM Proxy:
- Point Fased to LiteLLM:
litellm/default as the default initial ref, then refreshes or
accepts the model IDs exposed by your proxy.
Configuration
Environment variables
Config file
Virtual keys
Create a dedicated key for Fased with spend limits:LITELLM_API_KEY.
Model routing
LiteLLM can route model requests to different backends. Configure in your LiteLLMconfig.yaml:
default or gpt-5.5.
LiteLLM handles the upstream routing.
Model discovery
LiteLLM exposes the models available on your proxy through/models when configured for model discovery. That model list is local to your LiteLLM instance, not a global LiteLLM catalog.
fased providers refresh does not fetch a central LiteLLM model list. In the normal UI, LiteLLM is treated as a dynamic provider: litellm/<model-id> is allowed when your proxy exposes that model.
Viewing usage
Fased Usage shows the local model usage history for calls made by Fased, grouped by provider, model, Agent, task, channel, and session. LiteLLM’s dashboard or API shows proxy-level spend and quota from LiteLLM’s perspective:Notes
- LiteLLM runs on
http://localhost:4000by default - Fased connects via the OpenAI-compatible chat completions endpoint
- Capabilities depend on the upstream model and how your LiteLLM proxy exposes it