Zac Zuo

1mo ago

Qwen2.5-Omni - The end-to-end model powering multimodal chat

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, Understands text, images, audio & video; generates text & natural streaming speech.