Qwen: Qwen3 Max

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It delivers higher accuracy in math, coding, logic, and science tasks, follows complex instructions in Chinese and English more reliably, reduces hallucinations, and produces higher-quality responses for open-ended Q&A, writing, and conversation. The model supports over 100 languages with stronger translation and commonsense reasoning, and is optimized for retrieval-augmented generation (RAG) and tool calling, though it does not include a dedicated “thinking” mode.

Context Length:256K tokens

Pricing:$1.20M

Created:September 23, 2025

Model Information

Context Length

256K

Created

September 23, 2025

TokenizerQwen3

Modalitytext->text

Input Modalities

text

Output Modalities

text

Pricing Information

Prompt

$1.20M

per 1M tokens

Completion

$6.00M

per 1M tokens

Supported Parameters

max_tokenspresence_penaltyresponse_formatseedtemperaturetool_choicetoolstop_p

Common Use Cases

Text Generation

• Content writing and editing
• Code generation and debugging
• Creative writing and storytelling
• Translation and summarization

General Applications

• Chatbots and virtual assistants
• Educational content creation
• Research and analysis
• Automation and workflow

Frequently Asked Questions

What is the context length of this model?

This model has a context length of 256K tokens, which means it can process and remember up to 256K tokens of text in a single conversation or request.

How much does it cost to use this model?

Prompt tokens cost $1.20M/1M tokens and completion tokens cost $6.00M/1M tokens.

What modalities does this model support?

This model supports text->text modality, accepting textas input and producing text as output.

When was this model created?

This model was created on September 23, 2025.