OfoxAI
OfoxAI is a unified LLM API gateway that routes requests to GPT, Claude, Gemini, DeepSeek, Qwen, and 100+ other models through a single API key. Developers point existing OpenAI, Anthropic, or Gemini SDKs at OfoxAI endpoints and swap models without rewriting client code. It is built for teams shipping production apps and solo devs who want one bill instead of juggling multiple provider accounts.
The platform bills at official model-provider prices with zero platform markup, with per-token rates listed in a searchable catalog of 110+ models. Multi-region nodes in Tokyo, Singapore, and Frankfurt target roughly 300ms latency, and the gateway exposes OpenAI-compatible, Anthropic Messages, and Gemini native protocols from one account.
Backend engineers, AI agent builders, and indie developers use OfoxAI when they need reliable multi-model access for coding assistants, chat apps, and research pipelines. Enterprise teams get budget caps per API key, zero content retention on prompts, and SLA-backed uptime without contacting sales to unlock team controls.
Route 110+ models from OpenAI, Anthropic, Google, DeepSeek, and Qwen through one API key
OpenAI, Anthropic Messages, and Gemini native endpoints with no client rewrites
Per-token billing at provider list prices with 0% platform fee
Global nodes in Tokyo, Singapore, and Frankfurt for roughly 300ms latency
Works with Claude Code, Cursor, Cline, LangChain, and 20+ dev tool integrations
One API key unlocks 100+ models across OpenAI, Anthropic, Google, DeepSeek, and Chinese providers.
0% platform fee with per-token billing at official provider rates.
Native Anthropic, OpenAI, and Gemini protocol support for zero-migration setup.
Set daily, weekly, or monthly budget caps per API key with automatic rate limiting.
Prompts and responses are not logged or used for model training.
Usage-based billing only; costs scale directly with token consumption.
Enterprise SLA covers Ofox platform availability, not upstream provider outages.
LLM observability integrations with Langfuse and Datadog are listed as coming soon.
How does OfoxAI pricing work?
OfoxAI uses pure pay-as-you-go billing. You top up a balance in the OfoxAI Console, and each API call is charged in real time based on input tokens, output tokens, and any cache or web-search fees for that model. Balance never expires, and there are no subscriptions or monthly minimums.
Which API protocols does OfoxAI support?
OfoxAI supports three protocols from one API key: OpenAI-compatible endpoints at api.ofox.ai/v1, Anthropic Messages at api.ofox.ai/anthropic, and Gemini native at api.ofox.ai/gemini. Existing SDKs can point at these base URLs without rewriting application code.
Does OfoxAI charge a platform fee on top of model prices?
No. OfoxAI bills at official model-provider prices with 0% platform markup. The enterprise page states you pay provider list rates with no surcharge, unlike some gateways that add a percentage on top.
Can I use OfoxAI with Claude Code?
Yes. OfoxAI documents a Claude Code setup where you set ANTHROPIC_BASE_URL to https://api.ofox.ai/anthropic and ANTHROPIC_AUTH_TOKEN to your OfoxAI API key. The vibe-coding page also lists OpenCode and Cline as supported tools.
How many models does OfoxAI offer?
OfoxAI's model catalog lists 110 models across providers including OpenAI, Anthropic, Google, DeepSeek, Qwen, Kimi, Doubao, Zhipu GLM, Mistral, and xAI Grok. Each model page shows context window, max output, and per-million-token input and output rates.
Does OfoxAI store or train on my prompts?
No. OfoxAI enterprise documentation states prompts and responses are never logged or used for training. Only request metadata and token usage are retained for billing purposes.

