Skip to main content
All models share the same API endpoint. Simply change the model parameter to switch between them:
POST https://toapis.com/v1/chat/completions
For the complete request parameters and response format, see Chat Completions API.

Anthropic / Claude

Model IDDescriptionVisionThinking
claude-haiku-4-5Fastest and most cost-effective Claude model, ideal for high-throughput lightweight tasks
claude-sonnet-4-6Best balance of performance and cost, suitable for most business use cases
claude-opus-4-6Claude flagship model, ideal for complex reasoning and high-quality generation

OpenAI / GPT-5

Model IDDescriptionVisionThinking
gpt-5OpenAI flagship model with powerful general reasoning and multimodal capabilities
gpt-5-codexGPT-5 optimized for code — excels at code generation and debugging
gpt-5-codex-miniLightweight code model with ultra-fast response and low cost
gpt-5.1GPT-5 enhanced, with improved instruction following and reasoning
gpt-5.1-codexGPT-5.1 code-specialized variant
gpt-5.1-codex-maxGPT-5.1 flagship code model, maximum coding capability
gpt-5.1-codex-miniGPT-5.1 lightweight code variant, fast and cost-efficient
gpt-5.2GPT-5 second generation with enhanced multimodal capabilities
gpt-5.2-codexGPT-5.2 code-specialized variant
gpt-5.3-codexLatest GPT-5 code model with continuously improving capabilities
gpt-5.3-codex-sparkGPT-5.3 Codex Spark variant, focused on high-speed code generation
gpt-5.4GPT-5 generation 4, enhanced general reasoning and multimodal capabilities

Google / Gemini

Model IDDescriptionVisionThinking
gemini-3.1-fastGoogle Gemini fast variant, high-value multimodal model
gemini-3.1-thinkingGoogle Gemini thinking variant with built-in reasoning chain, ideal for complex problem solving

DeepSeek

Model IDDescriptionVisionThinking
deepseek-v3.2DeepSeek flagship model with extended thinking support, excels at code and complex reasoning

Alibaba / Qwen

Model IDDescriptionVisionThinking
qwen3-maxQwen flagship model with deep thinking support, excels at complex reasoning and code generation
qwen3.5-plusQwen enhanced model, balanced performance and cost
qwen3.5-flashQwen ultra-fast model, ideal for low-latency and high-throughput scenarios

Zhipu / GLM

Model IDDescriptionVisionThinking
glm-5Zhipu flagship model with outstanding Chinese language understanding and generation

Moonshot / Kimi

Model IDDescriptionVisionThinking
kimi-k2.5Moonshot flagship model with ultra-long context and strong Chinese language capabilities

MiniMax

Model IDDescriptionVisionThinking
MiniMax-M2.5MiniMax flagship model with excellent cost-effectiveness