9 followers
2mo ago
Qwen2.5-VL-32B is the open-source 32B vision-language model. Combines strong language understanding with image/video analysis. Optimized with RL.
1
3