Troubleshooting
Common issues and solutions.Known Limitations
These are expected behaviors, not bugs. See Introduction for the full list.
| Issue | Expected? | Solution |
|---|---|---|
| Cold start delay (10-20s) | Yes | First request after idle is slow. Subsequent requests are fast. |
| 529 quota errors | Yes | Auto-retry usually fixes it. If persistent, wait 30-60 seconds. |
| Web search not working | Yes | Google grounding protocol differs. Use without web search. |
| High latency (South America, etc.) | Yes | Server is in Singapore. Farther = slower. |
Connection Issues
Connection refused or Failed to connect
Connection refused or Failed to connect
Cause: Cannot reach the proxy server.Solutions:
- Check your internet connection
- Test the endpoint:
- If that fails, try again in a few minutes
Timeout errors
Timeout errors
Cause: Request taking too long.Solutions:
- Try once again — could be cold start
- If first request, wait 10-20 seconds (cold start is normal)
- Check your internet speed
Cold start delay
Cold start delay
Cause: First request after idle spins up the backend.Solutions:
- This is normal — wait 10-20 seconds
- Subsequent requests will be fast
- If every request is slow, check your internet connection
Authentication Issues
Invalid API key
Invalid API key
Cause: API key not recognized.Solutions:
- Double-check your API key for typos
- Ensure no extra spaces before/after the key
- Contact admin on Discord: presumptiousbastard
401 Unauthorized
401 Unauthorized
Quota & Rate Limit Issues
529 Resource Exhausted
529 Resource Exhausted
Cause: Temporary quota limit hit.Solutions:
- Simply retry — auto-retry usually works
- If persistent, wait 30-60 seconds
- Switch to
gemini-3-flashtemporarily to preserve Claude quota
Rate limited or Quota exceeded
Rate limited or Quota exceeded
Cause: Heavy usage in short time.Solutions:
- Wait a few minutes
- Use
gemini-3-flashfor simple tasks - If you consistently hit limits, contact admin about usage patterns
Model Issues
Model not found
Model not found
Cause: Invalid model name.Solutions:
- Use exact model names:
claude-opus-4-5-thinkingclaude-sonnet-4-5-thinkingclaude-sonnet-4-5gemini-3-flash
- See Available Models for full list
Extended thinking not working
Extended thinking not working
Cause: Using model without
-thinking suffix.Solutions:- Use
claude-opus-4-5-thinkingorclaude-sonnet-4-5-thinking - Models without
-thinkingdon’t use extended thinking mode
Tool-Specific Issues
Claude Code: command not found
Claude Code: command not found
Install Claude Code first:
Claude Code: Login prompt appears
Claude Code: Login prompt appears
Your config isn’t loading. Check:
~/.claude-proxy/settings.jsonexists- You’re using
claude-proxyalias (not justclaude) - Run
source ~/.zshrcto reload
Cline: Settings not saving
Cline: Settings not saving
- Close VS Code completely
- Reopen and reconfigure
- Check VS Code has write permissions
Cursor: Model not appearing
Cursor: Model not appearing
- Restart Cursor after adding model
- Check model is toggled “on” in settings
- Try removing and re-adding the model
OpenCode: Images not working
OpenCode: Images not working
Add
modalities config to your opencode.json: