The generated video is using images as background and putting text on the images. The generated images don't fit well with the video's context.