Quick Start¶

Use the gateway with curl, Python, or JavaScript once DeltaLLM is running and at least one model is available.

This page is the main "how do I call the gateway?" guide for first-time users.

1. Start the Gateway¶

After completing installation or Docker, make sure the backend is running:

# Optional: start Redis for distributed caching and rate limiting
redis-server --daemonize yes

python -m uvicorn src.main:app --host 0.0.0.0 --port 8000 --reload

2. Verify It's Running¶

curl http://localhost:8000/health/liveliness

Expected response:

{
  "status": "ok"
}

3. List Available Models¶

curl http://localhost:8000/v1/models \
  -H "Authorization: Bearer YOUR_MASTER_KEY"

Note

If the response contains no models, either: 1. enable general_settings.model_deployment_bootstrap_from_config: true in config.yaml and restart once, or 2. create a model deployment from the Admin UI first.

The starter config.example.yaml includes a sample gpt-4o-mini deployment for this bootstrap flow.

4. Make a Chat Request¶

Use the standard OpenAI chat completions format:

curl -X POST http://localhost:8000/v1/chat/completions \
  -H "Authorization: Bearer YOUR_MASTER_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4o-mini",
    "messages": [
      {"role": "user", "content": "Hello!"}
    ]
  }'

5. Use with the OpenAI SDK¶

Point any OpenAI SDK client at DeltaLLM:

PythonJavaScript

from openai import OpenAI

client = OpenAI(
    base_url="http://localhost:8000/v1",
    api_key="YOUR_MASTER_KEY",
)

response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "http://localhost:8000/v1",
  apiKey: "YOUR_MASTER_KEY",
});

const response = await client.chat.completions.create({
  model: "gpt-4o-mini",
  messages: [{ role: "user", content: "Hello!" }],
});
console.log(response.choices[0].message.content);

6. Create a Virtual API Key¶

Instead of sharing the master key, create scoped virtual keys:

curl -X POST http://localhost:8000/ui/api/keys \
  -H "Authorization: Bearer YOUR_MASTER_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "key_name": "my-app-key",
    "max_budget": 10.00
  }'

The response includes a raw_key — use this as the API key for your application.

7. Next Admin Flows¶

The current getting-started build also supports:

invitation-based onboarding from People & Access
password reset from the browser login page
SSO login with MFA-aware session enforcement when SSO is configured
opt-in governance notifications for budget thresholds and key lifecycle events when email delivery is enabled