Ankit Sharma

3mo ago

PaliGemma 2 mix - A vision-language model for multiple tasks

PaliGemma 2 mix, Google’s new vision-language model, solves tasks like image captioning, OCR, object detection, and segmentation.