We used Cohere multi-modal embeddings to select the images for the articles. We found that this works better than GPT-Vision to select high-quality images that are contextually relevant to the surrounding text. This was a hard problem to solve, and Cohere's multi-modal embedding quality helped improve the quality of the selected images.