Production-ready AI gateway

OpenAI-compatible models via Claudible

Enterprise-grade reliability with 99.9% uptime. Token-based pricing with cache hit/miss aware billing. Drop-in OpenAI Chat Completions format.

99.9% Uptime SLA
Cache-aware Billing
OpenAI Compatible
0% Available

Service Status

0% 7-day availability
100% 50% 0%

Send your first request

Endpoint POST https://cn.claudible.io/v1/chat/completions
curl https://cn.claudible.io/v1/chat/completions \
  -H "Authorization: Bearer YOUR_CLAUDIBLE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model":"deepseek-v4-flash","messages":[{"role":"user","content":"Hello"}]}'

Models & pricing

Model Input miss Input hit Output
deepseek-v4-flash80.1616
deepseek-v4-pro801160
glm-5.13204640
kimi-k2.62403480
mimo-v2.580.1616
mimo-v2.5-pro801160
qwen3.6-plus841.05168
qwen3.7-max11523345
qwen3.7-plus181.8474

credits / 1M tokens

Provider docs: api-docs.deepseek.com ↗