TokenHub uses the sameDocumentation Index
Fetch the complete documentation index at: https://docs.inferoute.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.
model parameter you already know from the OpenAI API. You pass a model name in your request, and TokenHub resolves it to the appropriate provider and model version. You can be as specific as you want — pinning a request to a particular provider and model — or as general as you want, delegating the selection to TokenHub entirely.
Model naming formats
TokenHub supports three model naming formats.Provider-prefixed names
Fully qualified names that pin a request to a specific provider. Use this format when you need to guarantee which provider handles the request.Short aliases
Shorthand names that map to a canonical model across providers. TokenHub resolves the alias to the best available endpoint for that model.| Alias | Resolves to |
|---|---|
gpt-4o | OpenAI GPT-4o |
gpt-4 | OpenAI GPT-4 |
gpt-3.5-turbo | OpenAI GPT-3.5 Turbo |
claude-3-5-sonnet | Anthropic Claude 3.5 Sonnet |
claude-3-opus | Anthropic Claude 3 Opus |
claude-3-haiku | Anthropic Claude 3 Haiku |
gemini-1.5-pro | Google Gemini 1.5 Pro |
gemini-1.5-flash | Google Gemini 1.5 Flash |
mistral-large | Mistral Large |
Auto selection
Settingmodel to "auto" tells TokenHub to pick the most suitable model for your request based on its content, your active routing strategy, and current provider availability.
Code examples
The following examples show how to use each naming format in a standard chat completion call.Model capabilities
Different models support different features. The table below summarizes key capabilities for the models available on TokenHub.| Model | Context window | Multimodal input | Max output tokens |
|---|---|---|---|
| GPT-4o | 128K tokens | Images, audio | 16K tokens |
| GPT-4 | 128K tokens | Images | 8K tokens |
| GPT-3.5 Turbo | 16K tokens | — | 4K tokens |
| Claude 3.5 Sonnet | 200K tokens | Images, documents | 8K tokens |
| Claude 3 Opus | 200K tokens | Images, documents | 4K tokens |
| Claude 3 Haiku | 200K tokens | Images | 4K tokens |
| Gemini 1.5 Pro | 1M tokens | Images, video, audio | 8K tokens |
| Gemini 1.5 Flash | 1M tokens | Images, video, audio | 8K tokens |
| Mistral Large | 128K tokens | — | 4K tokens |
| Llama 3.1 70B | 128K tokens | — | 4K tokens |
GET /v1/models endpoint.