QwQ-32B

QwQ-32B

Matching R1 Reasoning, Yet 20x Smaller

147 followers

QwQ-32B, from Alibaba Qwen team, is a new open-source 32B LLM achieving DeepSeek-R1 level reasoning via scaled Reinforcement Learning. Features a "thinking mode" for complex tasks.

QwQ-32B makers

Here are the founders, developers, designers and product people who worked on QwQ-32B