Skip to main content

Ollama

Ollama is a first-class Agent > Models provider in Fased. It uses Ollama’s native API, not the OpenAI-compatible /v1 endpoint. Use a base URL like:
http://127.0.0.1:11434
Do not use:
http://127.0.0.1:11434/v1
The native API keeps tool calls and streaming behavior aligned with Ollama.

Quick start

  1. Install and start Ollama.
  2. Pull a model:
ollama pull llama3.3
  1. Open Agents > selected Agent > Models > Ollama.
  2. Enter the native base URL, model ID, and optional API key.
  3. Assign the model to the Agent’s primary/fallback/task roles.
For local-only Ollama, the API key can be blank in the UI. Fased stores a local non-secret marker so the provider can be registered without exposing a real credential to the Agent runtime.

Discovery

When Ollama is explicitly configured, Fased can discover local models from:
GET http://127.0.0.1:11434/api/tags
The discovered model IDs appear as ollama/<model-id> refs. Costs are recorded as 0 because local Ollama does not bill through a hosted provider.

Config shape

{
  models: {
    providers: {
      ollama: {
        baseUrl: "http://127.0.0.1:11434",
        api: "ollama",
        apiKey: "ollama-local",
        request: { allowPrivateNetwork: true },
        models: [
          {
            id: "llama3.3",
            name: "llama3.3",
            reasoning: false,
            input: ["text"],
            cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
            contextWindow: 128000,
            maxTokens: 8192,
          },
        ],
      },
    },
  },
}
request.allowPrivateNetwork: true is required because Ollama usually runs on loopback, LAN, or a private tailnet host.

Cloud and hybrid

If you use Ollama Cloud or a remote Ollama host that requires auth, provide a real OLLAMA_API_KEY or save the key in Agent > Models. For a private LAN or tailnet host, keep the native base URL and explicitly approve private-network access.