109 followers
1yr ago
Fuyu-8B is a multimodal model capable of... 🖼️ Visual Question Answering 🖼️ Image Captioning 🖼️ Text localization and more!
12
111