Models

claude-opus-4-6LLM

Opus 4.6 is Anthropic’s strongest model for coding and long-running professional tasks. It is built for agents that operate across entire workflows rather than single prompts, making it especially effective for large codebases, complex refactors, and multi-step debugging that unfolds over time. The model shows deeper contextual understanding, stronger problem decomposition, and greater reliability on hard engineering tasks than prior generations. Beyond coding, Opus 4.6 excels at sustained knowledge work. It produces near-production-ready documents, plans, and analyses in a single pass, and maintains coherence across very long outputs and extended sessions. This makes it a strong default for tasks that require persistence, judgment, and follow-through, such as technical design, migration planning, and end-to-end project execution.

Context
1M
Input
$5.00 / 1M tokens
Output
$25.00 / 1M tokens
claude-opus-4-7LLM

Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents. Building on the coding and agentic strengths of Opus 4.6, it delivers stronger performance on complex, multi-step tasks and more reliable agentic execution across extended workflows. It is especially effective for asynchronous agent pipelines where tasks unfold over time - large codebases, multi-stage debugging, and end-to-end project orchestration. Beyond coding, Opus 4.7 brings improved knowledge work capabilities - from drafting documents and building presentations to analyzing data. It maintains coherence across very long outputs and extended sessions, making it a strong default for tasks that require persistence, judgment, and follow-through. Beyond coding, Opus 4.6 excels at sustained knowledge work. It produces near-production-ready documents, plans, and analyses in a single pass, and maintains coherence across very long outputs and extended sessions. This makes it a strong default for tasks that require persistence, judgment, and follow-through, such as technical design, migration planning, and end-to-end project execution.

Context
1M
Input
$5.00 / 1M tokens
Output
$25.00 / 1M tokens
claude-sonnet-4-6LLM

Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across coding, agents, and professional work. It excels at iterative development, complex codebase navigation, end-to-end project management with memory, polished document creation, and confident computer use for web QA and workflow automation.

Context
1M
Input
$3.00 / 1M tokens
Output
$15.00 / 1M tokens
deepseek-v4-flashLLM
Context
1M
Input
$0.15 / 1M tokens
Output
$0.30 / 1M tokens
deepseek-v4-proLLM
Context
1M
Input
$1.70 / 1M tokens
Output
$3.50 / 1M tokens
glm-5LLM
Context
128K
Input
$0.89 / 1M tokens
Output
$3.20 / 1M tokens
gpt-5.4-miniLLM

GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports text and image inputs with strong performance across reasoning, coding, and tool use, while reducing latency and cost for large-scale deployments. The model is designed for production environments that require a balance of capability and efficiency, making it well suited for chat applications, coding assistants, and agent workflows that operate at scale. GPT-5.4 mini delivers reliable instruction following, solid multi-step reasoning, and consistent performance across diverse tasks with improved cost efficiency.

Context
400K
Input
$0.75 / 1M tokens
Output
$4.50 / 1M tokens
happyhorse-1.0-t2vVideo

HappyHorse-1.0-T2V supports text-to-video generation, featuring highly realistic dynamic rendering. It accurately comprehends text semantics to produce high-quality videos that are fluid, natural, and rich in detail.

1080P
$0.19 / s
720P
$0.11 / s
kimi-k2.5LLM
Context
256K
Input
$0.59 / 1M tokens
Output
$3.00 / 1M tokens
minimax-m2.5LLM
Context
128K
Input
$0.30 / 1M tokens
Output
$1.20 / 1M tokens
qwen-image-plusImage

The Qwen series of image-generation models boasts exceptional text-rendering capabilities and excels in complex text rendering as well as a wide range of generation and editing tasks. This version, a snapshot taken on January 9, 2026, is a distilled and accelerated variant of Qwen-Image-Max, enabling faster generation of high-quality images.

Per image
$0.0300
qwen3.6-plusLLM

The Qwen3.6 native vision-language Plus series models demonstrate exceptional performance on par with the current state-of-the-art models, with a significant improvement in overall results compared to the 3.5 series. The models have been markedly enhanced in code-related capabilities such as agentic coding, front-end programming, and Vibe coding, as well as in multi-modal general object recognition, OCR, and object localization

Context
1M
Input
$2.00 / 1M tokens
Output
$6.00 / 1M tokens
Cache read
$0.20 / 1M tokens