Create your account
Sign up at tokios.com — it’s free and no credit card is required. After signing up you’ll land in the Tokios dashboard, where you manage connectors, registered models, and API keys.
Download and run the connector
Grab the
Create a minimal Then start the connector:You should see a confirmation line like
tokios-connector binary for your platform from the dashboard’s Downloads page.| Platform | Download |
|---|---|
| Windows x64 | tokios-connector.exe |
| macOS arm64 | tokios-connector |
| Linux x64 | tokios-connector |
tokios.json config file next to the binary, replacing <your-tunnel-token> with the tunnel token shown in the dashboard:tokios.json
11434 is Ollama’s default port. If you’re running llama.cpp, vLLM, or LM Studio, update upstream to match their listening address (e.g. http://localhost:8080).[tokios] tunnel connected in your terminal. The connector is now holding an outbound WebSocket to api.tokios.com — no inbound ports were opened on your machine.Register your model
In the Tokios dashboard, navigate to Connectors and click Pair Connector. Select the connector that just came online and give your model a name — for example,
gemma-tunnel.Tokios registers the model and assigns it a stable, cloud-reachable deployment identifier tied to your account. You’ll use this name as the model field in every API request.Mint an API key
Go to API Keys in the dashboard and click Generate Key. Tokios issues a tenant-scoped key in the form
sk-tokios-....Copy it now and store it somewhere safe (a password manager or secrets vault). The dashboard will not show it again after you close the dialog.What’s next?
Use with Claude Code
Point Claude Code’s API base URL at Tokios and run your AI coding agent entirely against your own local models.
API Endpoints Reference
Explore the full list of supported paths —
/v1/chat/completions, /v1/messages, /v1/responses — and their request/response schemas.