Qwen: Qwen3 Max
Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It delivers higher accuracy in math, coding, logic, and science tasks, follows complex instructions in Chinese and English more reliably, reduces hallucinations, and produces higher-quality responses for open-ended Q&A, writing, and conversation. The model supports over 100 languages with stronger translation and commonsense reasoning, and is optimized for retrieval-augmented generation (RAG) and tool calling, though it does not include a dedicated “thinking” mode.
Model Information
Pricing Information
Supported Parameters
Common Use Cases
Text Generation
- • Content writing and editing
- • Code generation and debugging
- • Creative writing and storytelling
- • Translation and summarization
General Applications
- • Chatbots and virtual assistants
- • Educational content creation
- • Research and analysis
- • Automation and workflow
Frequently Asked Questions
What is the context length of this model?
This model has a context length of 256K tokens, which means it can process and remember up to 256K tokens of text in a single conversation or request.
How much does it cost to use this model?
Prompt tokens cost $1.20M/1M tokens and completion tokens cost $6.00M/1M tokens.
What modalities does this model support?
This model supports text->text modality, accepting textas input and producing text as output.
When was this model created?
This model was created on September 23, 2025.