Skip to main content

Base URL

https://api.openinference.xyz/v1

Authentication

Every request must include your API key in the Authorization header.
Authorization: Bearer $OPENINFERENCE_API_KEY
You can create and manage API keys from the Open Inference console.

Endpoints

Chat completions

Generate a chat response from a model given a list of messages.

Completions

Generate a text completion given a prompt.

OpenAI compatibility

The Open Inference API is compatible with the OpenAI SDK. Point base_url to https://api.openinference.xyz/v1 and use your Open Inference API key.
from openai import OpenAI

client = OpenAI(
    base_url="https://api.openinference.xyz/v1",
    api_key="your-api-key",
)

Rate limits

Rate limits depend on your plan. Contact sales for details on dedicated capacity.

Errors

The API returns standard HTTP status codes.
CodeDescription
401Invalid or missing API key
429Rate limit exceeded
500Internal server error