QwQ-32B
QwQ-32B is Alibaba Qwen's March 2025 open-weights reasoning model, a 32.5-billion-parameter dense transformer fine-tuned from Qwen2.5-32B with supervised fine-tuning and reinforcement learning, positioned against DeepSeek-R1 and OpenAI o1-mini on hard-problem reasoning.