QwQ-32B

QwQ-32B

Matching R1 Reasoning, Yet 20x Smaller

1 shoutout

147 followers

QwQ-32B, from Alibaba Qwen team, is a new open-source 32B LLM achieving DeepSeek-R1 level reasoning via scaled Reinforcement Learning. Features a "thinking mode" for complex tasks.

QwQ-32B: Matching R1 Reasoning, Yet 20x Smaller | Product Hunt