Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
bhalajinΒ 
posted an update 4 days ago
Post
1594
###### CVPR2025 Workshop Challenge Alert ######

🫠 Between deadlines, rebuttals, and existential crises??? "We got you!!!!"

πŸ“’ Our new CVPR25 multi-modal challenge is online !!!

🍽️ Dishcovery: VLM MetaFood Challenge!!!! 🍽️


πŸ˜‹πŸ§« Can your groundbreaking VLM understand the difference between sushi styles, pasta types, or cooking methods from just image + caption pairs?

🌐 Our Task: Match fine-grained images to food descriptions


Challenge Highlights:

πŸ“¦ 400K food image-caption pairs, a little taste to get you started !!!

πŸ”¬ Got a SoTA VLM? Come test it on our challenging test sets !!!

🎯 Challenge for everyone! Easy to use SigLIP baseline is provided !!!

πŸ” Real, synthetic, noisy data – just like real life - Will your VLM redefine how people track their diets??? ( πŸ—£οΈ We believe so!!! )


πŸ”— Join the challenge: https://www.kaggle.com/competitions/dishcovery-vlm-mtf-cvpr-2025

πŸ—“οΈ Deadline: Phase I: 4th of May, 2025 - Phase II: 10th of May, 2025

πŸ‘‰ Workshop website: https://sites.google.com/view/cvpr-metafood-2025


#CVPR25 #ComputerVision #CV #Deeplearning #DL #VisionLanguage #VLM #multimodal #FoundationModels
In this post