OpenAI-compatible models via Claudible
Token-based pricing with cache hit/miss aware billing. Drop-in OpenAI Chat Completions format.
Send your first request
Endpoint
POST
https://cn.claudible.io/v1/chat/completions
curl https://cn.claudible.io/v1/chat/completions \
-H "Authorization: Bearer YOUR_CLAUDIBLE_API_KEY" \
-H "Content-Type: application/json" \
-d '{"model":"deepseek-v4-flash","messages":[{"role":"user","content":"Hello"}]}'
Models & pricing
| Model | Input miss | Input hit | Output |
|---|---|---|---|
deepseek-v4-flash | 8 | 0.16 | 16 |
deepseek-v4-pro | 100 | 1 | 200 |
glm-5.1 | 400 | 4 | 800 |
kimi-k2.6 | 300 | 3 | 600 |
qwen3.6-plus | 105 | 1.05 | 210 |
credits / 1M tokens
Provider docs: api-docs.deepseek.com ↗