Production-ready AI gateway

OpenAI-compatible models via Claudible

Enterprise-grade reliability with 99.9% uptime. Token-based pricing with cache hit/miss aware billing. Drop-in OpenAI Chat Completions format.

99.9% Uptime SLA

Cache-aware Billing

OpenAI Compatible

0% Available

Service Status

0% 7-day availability

Send your first request

Endpoint POST https://cn.claudible.io/v1/chat/completions

curl https://cn.claudible.io/v1/chat/completions \
  -H "Authorization: Bearer YOUR_CLAUDIBLE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model":"deepseek-v4-flash","messages":[{"role":"user","content":"Hello"}]}'

Models & pricing

Model	Input miss	Input hit	Output
`deepseek-v4-flash`	8	0.16	16
`deepseek-v4-pro`	80	1	160
`glm-5.1`	320	4	640
`kimi-k2.6`	240	3	480
`mimo-v2.5`	8	0.16	16
`mimo-v2.5-pro`	80	1	160
`qwen3.6-plus`	84	1.05	168
`qwen3.7-max`	115	23	345
`qwen3.7-plus`	18	1.84	74

credits / 1M tokens

Provider docs: api-docs.deepseek.com ↗