QwQ-32B

模型描述

QwQ-32B is a medium-sized reasoning model from the Qwen series, optimized for enhanced performance in downstream tasks, particularly challenging problems requiring deep reasoning. Unlike conventional instruction-tuned models, QwQ-32B integrates advanced architectural components such as RoPE, SwiGLU, RMSNorm, and Attention QKV bias. With 64 layers, 40 query heads, and 8 key-value heads (GQA), it supports a full 131,072-token context length, though YaRN must be enabled for prompts exceeding 8,192 tokens. Pretrained and post-trained via supervised finetuning and reinforcement learning, it achieves competitive results against leading models like DeepSeek-R1 and o1-mini. Users can explore its capabilities via QwenChat or refer to official resources for deployment guidelines.

全文结束

推荐模型

gpt-4.1-nano-2025-04-14

GPT-4.1 nano 是最快、最具性价比的 GPT-4.1 模型。

o3

Our most powerful reasoning model with leading performance on coding, math, science, and vision

gpt-4o-mini-rev

使用逆向工程在官方应用程序中调用模型并将其转换为 API。