model field in your API request. No client reconfiguration needed.
How routing works
Each registered model has a unique name. When Tokios receives a request, it reads themodel field, looks up the deployment with that name, and routes the request down that deployment’s tunnel. The API endpoint and API key stay the same regardless of which model you are targeting — only the model field in the request body needs to change.
Registering multiple models
Each model requires its own connector instance pointing at a different upstream. For example:- Run one connector for Ollama (port
11434) and name the deploymentgemma-tunnel - Run another connector for llama.cpp (port
8080) and name the deploymentllama3-local
tokios.json configuration file for each connector:
tokios-gemma.json
tokios-llama.json
Each connector instance needs its own
tunnel_token. You can generate additional tunnel tokens for each model from the Tokios dashboard.Switching models in requests
With both connectors running, switch between models by changing only themodel field in your request:
https://api.tokios.com/v1/chat/completions), the authentication header, and all other request parameters remain identical. Tokios handles the dispatch.