laion/CLIP-ViT-L-14-CommonPool.XL-s13B-b90K Zero-Shot Image Classification • Updated Nov 12, 2024 • 3.21k • 3
Running on Zero 306 306 Joy Caption Beta One 🖼 Generate captions for images based on various styles and formats
view post Post 2133 Wan2.1-FLF2V🎥 a 14B start-end frame video generation model just released by Alibaba_Wan🔥 Wan-AI/Wan2.1-FLF2V-14B-720P✨ Give it two images (start & end), it generates a smooth, high-quality video in between.✨ Apache 2.0 licensed ✨ Built on DiT + Flow Matching See translation 1 reply · 🚀 5 5 🤗 1 1 ❤️ 1 1 + Reply
view article Article Open-source DeepResearch – Freeing our search agents By m-ric and 4 others • Feb 4 • 1.26k
view post Post 3397 🚀AraClip is now fully integrated with Hugging Face 🤗AraClip is a specialized CLIP model that was created by @pain and optimized for Arabic text-image retrieval tasks🔥🔗 Try it out 🔗🤖 model: Arabic-Clip/araclip🧩 Gradio demo: Arabic-Clip/Araclip-Simplified🌐 website: https://arabic-clip.github.io/Arabic-CLIP/ See translation 2 replies · 🔥 5 5 ❤️ 3 3 🚀 1 1 + Reply
view post Post 2836 SkyReels-A2 🚀 an open framework for controllable video generation from text + images, released by Skywork, KunLun ✨Model: Skywork/SkyReels-A2✨Paper: SkyReels-A2: Compose Anything in Video Diffusion Transformers (2504.02436) See translation 1 reply · 🔥 7 7 ❤️ 4 4 🤗 1 1 + Reply