This page covers the errors and issues most commonly reported by TokenHub customers, along with the steps to resolve each one. If you cannot resolve an issue using the guidance below, collect theDocumentation Index
Fetch the complete documentation index at: https://docs.inferoute.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.
X-Inferoute-Request-Id from the failing response and contact support@tokenhub.ai.
Common errors
401 Unauthorized — invalid or missing API key
401 Unauthorized — invalid or missing API key
429 Too Many Requests — rate limit exceeded
429 Too Many Requests — rate limit exceeded
Your API key has exceeded its requests-per-minute (RPM) or tokens-per-minute (TPM) limit.Steps to resolve:
- Read the
Retry-Afterheader in the response — it tells you how many seconds to wait before retrying. - Implement exponential backoff in your client code. See the Rate Limits page for Python and Node.js examples.
- Check your current usage against your limits in the Usage section of the dashboard.
- If you consistently hit limits under normal load, contact support@tokenhub.ai to request a limit increase for your plan.
402 Payment Required — usage limit reached
402 Payment Required — usage limit reached
The API key used for this request has reached its monthly token or spend limit.Steps to resolve:
- Go to Settings → API Keys, find the key, and click Edit → Usage Limits.
- Increase the monthly token limit or monthly spend limit, or remove the limit if it is no longer needed.
- If you need to add funds to your account, go to Settings → Billing.
404 Model not found — invalid model name
404 Model not found — invalid model name
The model name you specified does not exist or is not available on your plan.Steps to resolve:
- Retrieve the current list of supported models:
- Check that the model name in your request exactly matches an
idvalue from the response. Model names are case-sensitive. - If you are using a provider-prefixed name (for example,
openai/gpt-4o), confirm that the prefix is correct.
500 / Provider error — provider outage or internal error
500 / Provider error — provider outage or internal error
The request failed due to an error on the provider’s side or an unexpected internal error.Steps to resolve:
- Check the TokenHub status page at status.tokenhub.ai for active incidents.
- If the error is provider-specific, check the affected provider’s own status page.
- Enable fallback routing in the dashboard (Routing → Fallback) so that TokenHub automatically retries failed requests on a backup provider.
- If the error is intermittent and not tied to a known incident, retry the request. If it persists, contact support with the
X-Inferoute-Request-Idfrom the failing response.
Slow responses — higher than expected latency
Slow responses — higher than expected latency
Response times are significantly higher than your baseline.Steps to resolve:
- In the dashboard, go to Routing → Strategy and switch to Latency-optimized mode. This selects the provider with the lowest observed response time for each request.
- Check the
X-Inferoute-Providerresponse header to see which provider served the request. If a normally fast provider is slow, check its status page. - Avoid unnecessarily large
max_tokensvalues — providers begin generating before knowing the full output length, but very high limits can affect scheduling. - If the issue is specific to one model or provider, try an equivalent model on a different provider using a provider-prefixed name.
Wrong provider being used — unexpected provider in response
Wrong provider being used — unexpected provider in response
TokenHub is routing your request to a provider you did not intend to use.Steps to resolve:
- Use a provider-prefixed model name to explicitly target the provider you want:
python
- Check the
X-Inferoute-Providerheader in the response to confirm which provider served the request. - Review your routing strategy in Routing → Strategy. Cost-optimized or latency-optimized strategies may select a different provider than you expect based on current conditions.
Debugging tips
Identify the serving provider
Every TokenHub response includes anX-Inferoute-Provider header that tells you which provider handled the request:
Get a request ID for support tickets
Every response includes anX-Inferoute-Request-Id header with a unique identifier for the request: