๐ฌ VEO3 Directors - All-in-One AI Video Creation Suite
๐ What is VEO3 Directors? VEO3 Directors is a revolutionary end-to-end AI video creation platform that transforms your ideas into cinematic reality. From story conception to final video with synchronized audio - all in one seamless workflow!
๐ฒ Instantly generate creative story ideas across multiple genres ๐ Bilingual support (English/Korean) ๐ญ Rich categories: Genre, Setting, Characters, and more
๐ฅ AI Script & Prompt Crafting
๐ฌ Powered by Friendli API for Hollywood-quality prompts ๐ค AI Director writes detailed cinematography instructions ๐ฌ Professional elements: camera movements, lighting, VFX
๐ฌ Video + Audio Generation
๐จ Wan2.1-T2V-14B for stunning visual quality โก NAG 4-step inference - 10x faster generation ๐ต MMAudio auto-generates matching soundscapes ๐๏ธ Full control over resolution, duration, and style ๐ฌLLM(API): VIDraft/Gemma-3-R1984-27B
๐ก How It Works
Generate Story โ "The Time Traveler's Final Choice" ๐ฐ๏ธ Create Script โ AI writes cinematic scene descriptions ๐ Produce Video โ 4-8 second clip with synchronized audio ๐๏ธ
๐ฏ What Makes It Special
Unified Workflow: From idea to video in one interface Director-Level Prompts: Professional cinematography language Lightning Fast: Minutes, not hours Smart Audio: Context-aware sound generation
๐ Use Cases
๐ฑ Social Media Content ๐ Educational Videos ๐บ Marketing & Ads ๐ฎ Game Cutscene Prototyping ๐จ Digital Art Creation
Upload Image - Select your starting image Enter Prompt - Describe desired motion and style Adjust Settings - 8 steps, 2-5 seconds recommended Generate - Complete in just minutes!
๐ก Optimization Tips โ Recommended Settings: 8-10 steps, 576ร1024 resolution โ Prompting: Use "cinematic motion, smooth animation" keywords โ Duration: 2-5 seconds for optimal quality โ Motion: Emphasize natural movement and camera work ๐ FusionX Enhanced vs Standard Models Performance Comparison: While standard models typically require 15-20 inference steps to achieve decent quality, our FusionX Enhanced version delivers premium results in just 8-10 steps - that's more than 50% faster! The rendering speed has been dramatically improved through optimized LoRA fusion, allowing creators to iterate quickly without sacrificing quality. Motion quality has been significantly enhanced with advanced causal modeling, producing smoother, more realistic animations compared to base implementations. Detail preservation is substantially better thanks to MPS Rewards training, maintaining crisp textures and consistent temporal coherence throughout the generated sequences.
๐ Whisper-OCR Multilingual Translation Space ๐
Welcome! This Space takes English audio, video, images, and PDFs and instantly converts them into Chinese (ZH), Thai (TH), and Russian (RU)โno other source language required.
๐ค I'm leading 'Openfree AI', Korea's most prominent AI open-source community. First and foremost, I'd like to express my deepest gratitude for Hugging Face's continuous support and efforts. ๐ Our Openfree AI collaborates with various AI communities across Korea, contributing to knowledge sharing and ecosystem development. ๐ค I've been actively promoting the critical importance of Hugging Face as Korea's AI infrastructure backbone, engaging with senior government officials, National Assembly members, university leaders, and media executives to emphasize how Hugging Face represents Korea's AI future at a national policy level. I consider myself a 'voluntary Korean ambassador for Hugging Face'. ๐ฐ๐ทโจ Let me share our community's achievements on the Hugging Face platform over the past year: ๐ฏ
๐ Published hundreds of models and spaces ๐ฅ Surpassed 10 million cumulative visitors ๐ Achieved 1.7 million Monthly Active Users (MAU) ๐จ Generated over 1 million images/videos per month
These achievements were possible thanks to Hugging Face's generous support, including H200 resources. Thank you sincerely. ๐ ๐ I'm thrilled to share exciting news! This July, we'll host the "Hugging Face Forever" seminar at the Korean National Assembly, sponsored by AI policy lawmakers. ๐๏ธ Our community will organize this groundbreaking event focusing on 'Hugging Face and Community Contributions and Roles' - a truly meaningful and revolutionary milestone for Korea's AI ecosystem. ๐ซ We'll continue working hard for Korea's AI ecosystem development and... oh, if you ever need a Korean branch manager for Hugging Face, please let me know! ๐ (Just kidding... or am I? ๐ค) Thank you. ๐ค Openfree AI Representative ๐
๐จ ChartGPT: AI that Draws Diagrams and Designs from Natural Language
Hello! We're the VIDraft team ๐ Introducing ChartGPT - an AI that automatically creates professional diagrams and visual designs when you describe them in text!
๐ง Optimal AI Implementation Based on Gemma-3-R1984-27B ensuring exceptional factuality and accuracy Perfectly understands and visualizes complex structures FLUX.1-schnell for high-quality image generation ๐จ
๐ Perfect Support for Korean & English Just say "Create a flowchart for the machine learning process" and you're done! ๐ฏ Korean prompts are automatically translated to English for design generation โจ ๐ 5 Diagram Types ๐บ๏ธ Concept Map - Connect ideas ๐ Synoptic Chart - See the whole structure at a glance โ๏ธ Radial Diagram - Structure expanding from center ๐ Process Flow - Visualize workflows ๐ WBS - Project hierarchy structure ๐จ 6 Visual Design Types (NEW!) ๐ญ Product Design - Industrial design concept sketches ๐ง Mindmap - Colorful thought maps ๐ฑ Mockup - UI/UX wireframes ๐ Infographic - Data visualization ๐ Diagram - Business workflows ๐ Flowchart - Decision flow charts ๐ Brave Search Integration Need the latest information? Generate more accurate diagrams with real-time web search! ๐ ๐ MCP Protocol Support Perfect integration with other AIs like Claude and ChatGPT! ๐ค ๐ก Usage Examples Diagram Generation Prompt: "Create a concept map showing AI classification system" Result: Beautiful diagram with deep learning, machine learning, and NLP systematically connected โจ Design Generation Prompt: "smartphone banking app design" Result: Professional-level UI/UX mockup design ๐จ ๐ฏ Recommended For ๐ Educators: Visually explain complex concepts ๐ผ Planners: Organize project structures at a glance ๐ง Developers: Document system architecture ๐ Students ๐จ Designers ๐ Marketers
๐ Just Found an Interesting New Leaderboard for Medical AI Evaluation!
I recently stumbled upon a medical domain-specific FACTS Grounding leaderboard on Hugging Face, and the approach to evaluating AI accuracy in medical contexts is quite impressive, so I thought I'd share.
๐ What is FACTS Grounding? It's originally a benchmark developed by Google DeepMind that measures how well LLMs generate answers based solely on provided documents. What's cool about this medical-focused version is that it's designed to test even small open-source models.
๐ฅ Medical Domain Version Features
236 medical examples: Extracted from the original 860 examples Tests small models like Qwen 3 1.7B: Great for resource-constrained environments Uses Gemini 1.5 Flash for evaluation: Simplified to a single judge model
๐ The Evaluation Method is Pretty Neat
Grounding Score: Are all claims in the response supported by the provided document? Quality Score: Does it properly answer the user's question? Combined Score: Did it pass both checks?
Since medical information requires extreme accuracy, this thorough verification approach makes a lot of sense. ๐ Check It Out Yourself
๐ญ My thoughts: As medical AI continues to evolve, evaluation tools like this are becoming increasingly important. The fact that it can test smaller models is particularly helpful for the open-source community!
๐จ FLUX VIDEO Generation - All-in-One AI Image/Video/Audio Generator
๐ Introduction FLUX VIDEO Generation is an all-in-one AI creative tool that generates images, videos, and audio from text prompts, powered by NVIDIA H100 GPU for lightning-fast processing!
โจ Key Features 1๏ธโฃ Text โ Image โ Video ๐ผ๏ธโก๏ธ๐ฌ
Generate high-quality images from Korean/English prompts Transform still images into natural motion videos Multiple size presets (Instagram, YouTube, Facebook, etc.) Demo: 1-4 seconds / Full version: up to 60 seconds
๐จ AI Hairstyle Changer - Transform with 93 Styles! ๐โโ๏ธโจ
๐ Introduction Experience 93 different hairstyles and 29 hair colors in real-time with your uploaded photo! Transform your look instantly with this AI-powered Gradio web app.
โจ Key Features
๐ธ Simple 3 Steps Upload Photo - Upload a front-facing photo Select Style - Choose from 93 hairstyles Pick Color - Click your desired color from 29 color palette options
๐ซ Diverse Hairstyles (93 types)
๐ฏ Short Cuts: Pixie Cut, Bob, Lob, Crew Cut, Undercut ๐ Waves: Soft Waves, Hollywood Waves, Finger Waves ๐ Braids: French Braid, Box Braids, Fishtail Braid, Cornrows ๐ Updos: Chignon, Messy Bun, Top Knot, French Twist ๐ Special Styles: Space Buns, Dreadlocks, Mohawk, Beehive
โก Fast Processing: Get results in just 10-30 seconds ๐ฏ High Accuracy: Natural-looking transformations with AI technology ๐ Professional Quality: High-resolution output suitable for social media ๐ Unlimited Trials: Try as many combinations as you want ๐ฑ User-Friendly: Intuitive interface with visual color palette
๐ก Perfect For
๐ Salon Consultations: Show clients potential new looks before cutting ๐๏ธ Personal Styling: Experiment before making a big change ๐ญ Entertainment: Fun transformations for social media content ๐ฌ Creative Projects: Character design and visualization ๐ Fashion Industry: Match hairstyles with outfits and makeup ๐ธ Photography: Pre-visualization for photoshoots
๐๏ธ Voice Clone AI Podcast Generator: Create Emotionally Rich Podcasts with Your Own Voice!
๐ Project Introduction Hello! Today we're excited to introduce an AI-powered solo podcast generator that creates high-quality voice cloning with authentic emotional expression. Transform any PDF document, web URL, or keyword into a professional podcast with just a few clicks! ๐โก๏ธ๐ง
URL: Simply paste any blog or article link PDF: Upload research papers or documents directly Keyword: Enter a topic and AI searches for the latest information to create content
2. ๐ญ Emotionally Expressive Voice Cloning Powered by Chatterbox TTS:
๐ค Voice Cloning: Learn and replicate your unique voice perfectly ๐ข Natural intonation and emotional expression ๐ Customizable emotion intensity with Exaggeration control โก Seamless handling of long texts with automatic chunking
3. ๐ค State-of-the-Art LLM Script Generation
Professional-grade English dialogue using Private-BitSix-Mistral 12 natural conversational exchanges Real-time web search integration for up-to-date information Fully editable generated scripts! โ๏ธ
๐ก Use Cases ๐ Educational Content
Transform complex research papers into easy-to-understand podcasts Create English learning materials in your own voice
๐ฐ News & Information
Convert international articles into engaging audio content Produce global trend analysis podcasts
๐จ Creative Content
Tell stories in English with your own voice Build your global personal brand with custom audio content
๐ ๏ธ Tech Stack ๐ง LLM: Llama CPP + Private-BitSix-Mistral ๐ฃ๏ธ TTS: Chatterbox (Voice Cloning & Emotional Expression) ๐ Search: Brave Search API ๐ Document Processing: LangChain + PyPDF ๐ฅ๏ธ Interface: Gradio ๐ What Makes Us Special
๐ค Voice Cloning: Perfect voice replication from just a short audio sample ๐ Emotion Contro ๐ Unlimited Length ๐ Real-time Updates
๐ฏ Core Features 15 Expert Theories for professional brand naming Bilingual Support Korean/English for global brands Unified Evaluation System creativity/memorability/relevance scores Real-time Visualization theory-specific custom designs
๐ฌ Applied Theories Cognitive Theories (4) ๐ฆ Square Theory - Semantic square structure with 4-word relationships ๐ Sound Symbolism - Psychological connections between phonemes and meaning ๐ง Cognitive Load - Minimized processing for instant recognition ๐๏ธ Gestalt Theory - Perceptual principles where whole exceeds parts
Creative Theories (3) ๐ Conceptual Blending - Merging concepts to create new meanings ๐ง SCAMPER Method - 7 creative transformation techniques ๐ฟ Biomimicry - Nature-inspired wisdom from 3.8 billion years of evolution
Cultural Theories (3) ๐ญ Jung's Archetype - 12 universal archetypes for emotional connection ๐ Linguistic Relativity - Cross-cultural thinking patterns consideration ๐งฌ Memetics - Cultural transmission and evolutionary potential
Differentiation Theories (3) โก Von Restorff Effect - Uniqueness for 30x better recall ๐จ Color Psychology - Emotional associations and color meanings ๐ Network Effects - Value maximization through network structures
๐ซ Special Features Each theory provides unique visualizations and customized analysis:
Square Theory โ 4-corner relationship diagram Blending โ Concept fusion flowchart Color โ Interactive color palette display Theory-specific insights for each approach
๐๏ธ AI Podcast Generator - Professional Conversation Creation Tool
๐ Project Overview Transform any URL, PDF, or keyword into professional podcast conversations automatically! This AI-powered tool creates engaging, expert-level dialogues in minutes. ๐
โจ Key Features: Multiple Input URL: Web articles, blog posts, news content PDF: Research papers, documents, reports Keywords: Topics like "AI Ethics", "Quantum Computing"
๐ค Smart AI Conversation Generation Local LLM: Mistral-Small 24B model for privacy protection API Fallback: Together AI API support Expert Style: In-depth discussions between host and expert Length: 12-20 exchanges for comprehensive coverage
๐ Multilingual Support English: Alex (Host) & Jordan (Expert) Korean: Junsu (Host) & Minho (Expert)
๐ต High-Quality Text-to-Speech Edge-TTS: Natural cloud-based voices Spark-TTS: Local AI voice model MeloTTS: GPU-powered local synthesis
๐ Real-time Information Search Brave Search API for latest information retrieval Automatic content generation from keywords
๐ฏ How to Use Select Input: Choose URL/PDF/Keyword Set Language: Korean or English Generate Dialogue: AI creates professional podcast script Edit Freely: Modify the generated conversation as needed Create Audio: Generate audio with your preferred TTS engine
๐ก What Makes It Special Professional Quality: Deep analysis rather than simple summaries Data-Driven: Includes statistics, research findings, real examples Fully Editable: Customize conversations after generation Offline Capable: Works without internet using local models
๐ Output Creates approximately 5-minute professional podcast episodes:
๐ค Community & Support For questions, feature requests, or technical issues, please reach out through the Community tab above. We'd love to hear your feedback and help you create amazing podcast content!
๐พ NH Prediction: AI System for Korean Agricultural Price Forecasting ๐พ
๐ Project Introduction Price volatility in agricultural markets has significant impacts from producers to consumers! NH Prediction is an innovative system that utilizes cutting-edge AI technology to predict Korean agricultural wholesale prices based on extensive data spanning 40 years. ๐
๐ง VIDraft's 14 Enhanced Prediction Models The VIDraft research team has developed 14 advanced prediction models by reinforcing existing forecasting approaches:
๐ฎ VID-SARIMA Series: Precisely models seasonality and trends (up to 99.99% accuracy) โ๏ธ VID-ETS Series: Captures multiplicative/additive variation patterns ๐ VID-Holt/Holt-Winters: Simultaneous analysis of linear trends and seasonality ๐ VID-MovingAverage/WeightedMA: Noise removal and medium-term trend identification ๐ VID-Fourier+LR: Hybrid approach capturing complex periodicity
โจ Key Features
๐ Item-Specific Optimization: Customized predictions for each agricultural product (rice, cabbage, apples, and 50+ more) ๐ Ensemble Approach: Enhanced prediction robustness by combining top models ๐ฑ Bilingual Support: Korean/English interfaces ๐๏ธ Integrated Forecast Periods: Simultaneous long-term and short-term predictions ๐ Advanced Visualization
Introduction ๐ AI BOOK MAKER is a revolutionary platform that converts text and PDF files into intelligent AI books. With just a single file upload, our automatic RAG (Retrieval-Augmented Generation) system activates an AI chatbot that perfectly comprehends your content, delivering a next-generation digital book experience that combines interactive flipbooks with conversational intelligence! ๐โจ Groundbreaking Core Features ๐
One-Click RAG System ๐: Automatic knowledge base creation and AI conversation engine activation with just one text or PDF upload Industry-Leading Flip Effects ๐โก๏ธ๐: Exclusive AI-driven page transition technology for an immersive experience beyond physical books Perfect Cross-Platform Support ๐ฑ: Intelligent responsive design providing optimized experiences on any device Automatic Unique URL Generation ๐: Exclusive system creating personalized links for instant sharing with friends, family, and colleagues AI Auto-Summary Engine ๐ค: Intelligent summarization and insight extraction features that instantly grasp the essence of your content Ultra-Intelligent AI Chatbot ๐ฌ: Personalized knowledge assistant to ask questions and get answers about book content
Game-Changer For People Who ๐
๐ Authors and creators wanting to share their knowledge and content as AI-powered interactive books ๐ Educators and students looking to transform research materials and learning content into smart, conversational flipbooks ๐จโ๐ผ Professionals seeking to upgrade business documents into intelligent books shareable with clients and team members ๐ Anyone wanting to share valuable documents with their network while exploring new experiences with AI assistance
Start the Magic in 3 Seconds ๐ ๏ธ
Single Upload ๐ค Ultra-Fast AI Conversion โก Custom URL Acquisition ๐ Explore with AI ๐ฌ
This project is provided under the MIT license, allowing anyone to freely use, modify, and distribute it. ๐ฏ โจ Key Features
๐ Bookmark Management: Manage your frequently visited AI websites and Hugging Face spaces in one place ๐๏ธ Live Preview: Instantly preview sites without having to visit them ๐ผ๏ธ Dual Viewing Modes: Supports both LIVE and STATIC snapshot modes ๐๏ธ Category Organization: Neatly organizes resources by categories like Productivity, Multimodal, Professional, Image, LLM/VLM ๐พ Secure Storage: Dual storage system using both SQLite and JSON for data integrity
๐ Usage Scenarios 1๏ธโฃ AI Researchers & Developers
Manage commonly used AI demos, models, and tools in one centralized location Share valuable AI resources with team members Quickly access the latest AI tools categorized by function
2๏ธโฃ Educators & Students
Systematically organize AI learning materials and demos Easily share resources needed for classes or study groups Efficiently explore learning materials with real-time previews
3๏ธโฃ AI Communities
Create collections of useful AI projects Develop topic-specific AI tool compilations Manage community-recommended resources
๐ก Extension Ideas You can create various projects based on this code:
๐ซ AI Education Portal: A collection of AI learning resources for students ๐จโ๐ป Personal AI Dashboard: Customized AI tool collection for developers ๐ AI Trend Curation: Automatically collect and categorize the latest AI projects ๐ Team Resource Hub: AI resource sharing and management system for organizations
๐ง Technical Features
๐ Flask-based: Lightweight and extensible web framework ๐ฑ Responsive Design: Works on both mobile and desktop ๐ Live/Static Switching: Automatically optimizes viewing mode based on site characteristics ๐ Security Focused
๐ฎ Vibe Game Craft: Create Your Own Web Games Through Multi-LLM and Agent Collaboration for Free โจ Hello, game development enthusiasts! Today I'm introducing an innovative tool that lets you create web games without any coding knowledge. Vibe Game Craft is a magical tool that transforms your ideas into actual games through the collaboration of Claude 3.7 Sonnet and various AI agents! ๐ซ โจ Key Features
๐ค Multi-LLM Collaboration System: Multiple AI agents working together with Claude 3.7 Sonnet at the core to generate high-quality game code ๐ฐ 100% Free to Use: Experience advanced AI game creation technology at no cost ๐น๏ธ Instant Preview: Test your generated games immediately in a sandbox environment ๐ One-Click Deployment: Easily deploy your games to the internet via Vercel ๐ Diverse Templates: Access over 30 game templates including Tetris, Chess, Snake, and more ๐ AI Optimization: Multiple AIs collaborate to streamline code and optimize performance
๐ค AI Collaboration System
Claude 3.7 Sonnet: Main code generation and game logic design Specialized Agents: Dedicated AIs handling graphics, sound, and game mechanisms Optimization Engine: AI system that automatically organizes code and improves performance Feedback Loop: AI feedback mechanism that analyzes and finds improvements for generated games
๐ก Usage Ideas ๐ Programming Learning ๐ Special Gifts ๐งฉ Prototype Creation ๐จโ๐ฉโ๐งโ๐ฆ Family Activity ๐ซ Educational Toolbutton
Hello, AI creators! ๐ Today I'm introducing Ilรบvatar, an amazing tool that automatically generates innovative design and invention ideas.
โจ Key Features
๐ง AI-Powered Idea Generation: Creates detailed design/invention ideas from simple prompts ๐ Web Search Integration: Incorporates real-time information to reflect latest trends ๐ Kaggle Dataset Analysis: Provides data-driven insights ๐ผ๏ธ Automatic Image Generation: Creates image prompts visualizing your ideas ๐ File Upload Support: Analyzes reference materials (text, CSV, PDF) ๐ Business Frameworks: Includes SWOT, Porter's 5 Forces, BCG Matrix analyses ๐ Multilingual Support: Available in both English and Korean
๐ฏ Perfect For
๐ผ Product Designers/Developers: When you need fresh product concepts ๐ฌ Researchers/Inventors: When you need innovative idea inspiration ๐ Planners/Marketers: When you need differentiated business strategies ๐ Students/Educators: For creative thinking and problem-solving education
๐ Start Creating Now! Utilizing 24 categories and ~1,100 items as design SEEDS, the system generates combinations across 2-6 depth levels, creating up to 1,100 trillion design variables. A "water-air transitional device" might combine structural self-reorganization, material transformation, biomimetic movement, and propulsion optimization. The LLM analyzes correlations between user queries and design combinations, identifying innovative elements like hybrid propulsion systems inspired by nature. By integrating data from Kaggle datasets, web searches, and research, the system prioritizes groundbreaking combinations such as "graphene morphing wings + AI fluid dynamics + quantum dot solar cells" with feasibility assessments.
Samsung Hacking Incident: Samsung Electronics' Official Hugging Face Account Compromised Samsung Electronics' official Hugging Face account has been hacked. Approximately 17 hours ago, two new language models (LLMs) were registered under Samsung Electronics' official Hugging Face account. These models are:
The model descriptions contain absurd and false claims, such as being trained on "1 million W200 GPUs," hardware that doesn't even exist. Moreover, community participants on Hugging Face who have noticed this issue are continuously posting that Samsung Electronics' account has been compromised. There is concern about potential secondary and tertiary damage if users download these LLMs released under the Samsung Electronics account, trusting Samsung's reputation without knowing about the hack. Samsung Electronics appears to be unaware of this situation, as they have not taken any visible measures yet, such as changing the account password. Source: https://discord.gg/openfreeai
๐ CycleNavigator: Visualizing Economic and Political Cycles Through AI at a Glance! ๐ง ๐น
๐ซ Strategic Intelligence Tool for Navigating Historical Waves and Forecasting the Future
Hello there! ๐ CycleNavigator brings you an innovative fusion of economic history, data visualization, and generative AI. This open-source project revolutionizes decision-making by displaying four major economic and political cycles through interactive visualizations!
๐ Experience Four Major Cycles in One View:
Business Cycle (โ9 years) โฑ๏ธ - The 'heartbeat' of investment and inventory Kondratiev Wave (โ50 years) ๐ - Long technological innovation waves Finance Cycle (โ80 years) ๐ฐ - Rhythm of debt and financial crises Hegemony Cycle (โ250 years) ๐๏ธ - Transitions in global order
โจ Cutting-Edge Features:
Interactive Wave Visualization ๐ฏ - Intuitive graphs powered by Plotly AI-Powered Historical Similarity Mapping ๐งฉ - Connecting past events via SBERT embeddings Real-time News Integration ๐ฐ - Linking current issues to long cycles with Brave API GPT-Enhanced Analysis ๐ค - Delivering structured insights through optimized prompting
๐ก Practical Applications:
Improve decision accuracy โก by instantly grasping economic trends Identify connections ๐ between breaking news and long-term cycles Gain reliable insights ๐ through verifiable data and transparent methodology Extend to multiple domains ๐ - education, research, asset management, policy institutes
๐ A New Intelligence Paradigm: When slow cycles (9-50-80-250 years) and fast headlines (Brave API) meet on a single canvas, experience an innovative decision-making environment where you can reconstruct the past, interpret the present, and design future scenarios!
โจ DreamO Video: From Customized Images to Videos โจ Hello, AI creators! Today I'm introducing a truly special project. DreamO Video is an integrated framework that generates customized images based on reference images and transforms them into videos with natural movement. ๐ฌโจ
Image Reference (IP): Maintain object appearance while applying to new backgrounds and situations ID Preservation: Retain facial features across various environments Style Transfer: Apply unique styles from reference images to other content ๐๏ธ Video Generation: Create natural 2-second videos from generated images
๐ก How to Use
Upload Reference Images: One or two images (people, objects, landscapes, etc.) Select Task Type: Choose between IP (Image Preservation), ID (Face Feature Retention), or Style Enter Prompt: Describe your desired result (e.g., "a woman playing guitar on a cloud") Click Generate Image: โจ Create customized AI images! Generate Video: Click the ๐ฌ button on the generated image to create a 2-second natural video
๐ Usage Examples
๐ Virtual Fitting: Combine clothes and people to visualize outfit appearance ๐ผ๏ธ Artwork Transformation: Create new images in your favorite styles ๐ธ Portrait Modification: Create appearances in various environments and situations ๐ญ Character Design: Develop new characters based on reference images ๐ฅ Short Animations: Transform static images into vivid videos
โ ๏ธ Demo Version Notice In the current demo version, video generation is restricted to 2 seconds only. The full version supports generation of up to 60 seconds. ๐ Latest Updates
2025.05.13: DreamO Video Integration version released! 2025.05.11: Improved oversaturation and unnatural face issues
Create amazing content with DreamO Video! If you have any questions or feedback, please don't hesitate to contact us. We look forward to seeing your creations! ๐ซ๐จ #AI #ImageGeneration #VideoGeneration #DreamO #HuggingFace
Hello there! Would you like to transform your 3D models into stunning animations? This space can help you! โจ
## ๐ What Can It Do?
This tool converts your uploaded GLB model into: 1. ๐ฎ A transformed GLB file 2. ๐ฌ An animated GIF preview 3. ๐ A metadata JSON file
## โ Key Features
* ๐ฅ๏ธ Works in headless server environments (EGL + pyglet-headless โ pyrender fallback) * ๐ Objects in GIFs appear 3x larger (global scale ร3) * ๐จ Clean interface with pastel background
## ๐ฎ Animation Types
* ๐ Rotate - Object rotates around the Y-axis * โฌ๏ธ Float - Object moves smoothly up and down * ๐ฅ Explode - Object moves sideways * ๐งฉ Assemble - Object returns to its original position * ๐ Pulse - Object changes in size * ๐ Swing - Object swings around the Z-axis
## ๐ ๏ธ How to Use
1. Upload your GLB model ๐ค 2. Select your desired animation type ๐ฌ 3. Adjust the duration and FPS โฑ๏ธ 4. Click the "Generate Animation" button โถ๏ธ 5. Download your results ๐ฅ
## ๐ป Technical Details
* Rendering system using trimesh and pyrender * Automatic fallback method for rendering failures to ensure stability * GIF generation supporting up to 60 frames
Breathe life into your static 3D models with this tool! ๐ If you have any questions or feedback, please let us know. Happy 3D modeling! โจ