You Do the Math: Fine-Tuning Multimodal Models (CLIP) to Match Cartoon Images to Joke Captions

:pray: Please leave your feedback/questions here for this blog post!