OpenAI-compatible models via Claudible
Enterprise-grade reliability with 99.9% uptime. Token-based pricing with cache hit/miss aware billing. Drop-in OpenAI Chat Completions format.
99.9% Uptime
SLA
Cache-aware
Billing
OpenAI
Compatible
0%
Available
Service Status
0%
7-day availability
Send your first request
Endpoint
POST
https://cn.claudible.io/v1/chat/completions
curl https://cn.claudible.io/v1/chat/completions \
-H "Authorization: Bearer YOUR_CLAUDIBLE_API_KEY" \
-H "Content-Type: application/json" \
-d '{"model":"deepseek-v4-flash","messages":[{"role":"user","content":"Hello"}]}'
Models & pricing
| Model | Input miss | Input hit | Output |
|---|---|---|---|
deepseek-v4-flash | 8 | 0.16 | 16 |
deepseek-v4-pro | 80 | 1 | 160 |
glm-5.1 | 320 | 4 | 640 |
kimi-k2.6 | 240 | 3 | 480 |
mimo-v2.5 | 8 | 0.16 | 16 |
mimo-v2.5-pro | 80 | 1 | 160 |
qwen3.6-plus | 84 | 1.05 | 168 |
qwen3.7-max | 115 | 23 | 345 |
qwen3.7-plus | 18 | 1.84 | 74 |
credits / 1M tokens
Provider docs: api-docs.deepseek.com ↗