https://api.tokios.com with your sk-tokios-... key as a Bearer token. Tokios exposes three endpoints covering the OpenAI chat completions, OpenAI responses, and Anthropic messages surfaces.
Base URL
Authentication
Every request requires the following headers:POST /v1/chat/completions
OpenAI-compatible chat completions endpoint. Send a list of messages and receive a completion from your registered model.Request body
The registered deployment name (e.g.
"gemma-tunnel").Array of message objects. Each object must have:
role— one of"system","user", or"assistant"content— the message text (string)
Stream the response using server-sent events.
Sampling temperature between 0 and 2. Higher values produce more random output.
Maximum number of tokens to generate.
Example request
Example response
POST /v1/messages
Anthropic-compatible messages endpoint. Send a messages array and receive a response in Anthropic’s format.Request body
The registered deployment name.
Array of message objects. Each object must have:
role— one of"user"or"assistant"content— the message text (string)
Maximum number of tokens to generate.
System prompt to prepend to the conversation.
Stream the response using server-sent events.
Example request
Example response
POST /v1/responses
OpenAI responses API endpoint. Compatible with the newer OpenAI responses format.Request body
The registered deployment name.
The input text or array of input items to generate a response for.
Stream the response using server-sent events.
Example request
Example response
Error codes
| HTTP Status | Meaning |
|---|---|
| 401 | Invalid or missing API key |
| 403 | API key doesn’t have access to the requested model |
| 404 | Model deployment not found |
| 503 | Connector offline — the tunnel is not active |
| 429 | Rate limit exceeded |