monjue

francisgroup

AI & ML interests

None yet

Recent Activity

reacted to openfree's post with ๐Ÿš€ about 2 hours ago
๐ŸŒ Whisper-OCR Multilingual Translation Space ๐Ÿš€ Welcome! This Space takes English audio, video, images, and PDFs and instantly converts them into Chinese (ZH), Thai (TH), and Russian (RU)โ€”no other source language required. https://huggingface.co/spaces/VIDraft/voice-trans โœจ Key Features ๐ŸŽค Microphoneโ€‚โ€“ Record English speech โ†’ transcript + 3-language translation ๐Ÿ”Š Audio Fileโ€‚โ€“ Upload English audio โ†’ transcript + translation ๐ŸŽฌ Video Fileโ€‚โ€“ Auto-extract audio with FFmpeg โ†’ transcript + translation ๐Ÿ–ผ๏ธ Imageโ€‚โ€“ Nanonets-OCR pulls text โ†’ translation ๐Ÿ“„ PDFโ€‚โ€“ Up to 50 pages of text & tables โ†’ translation ๐Ÿ”„ Realtime Modeโ€‚โ€“ Flush every 10-15 s; newest lines appear at the top ๐Ÿ› ๏ธ Quick Start Click โ€œDuplicateโ€ to fork, or launch directly. Pick a tab (๐ŸŽค/๐Ÿ”Š/๐ŸŽฌ/๐Ÿ–ผ๏ธ/๐Ÿ“„/๐Ÿ”„) and feed it English input. After a few seconds, see the ๐Ÿ“œ original and ๐ŸŒ 3-language translation side by side. โšก Tech Stack openai/whisper-large-v3-turbo โ€” fast, high-accuracy ASR Nanonets-OCR-s (+ Flash Attention 2) โ€” document/image OCR Gradio Blocks โ€” clean tabbed UI PyTorch + CUDA โ€” auto GPU allocation & ThreadPool load balancing ๐Ÿ“Œ Notes Translation quality depends on audio quality, lighting, and resolution. Huge videos hit the HF Space upload cap (~2 GB). Realtime tab requires browser microphone permission.
reacted to ginipick's post with ๐Ÿš€ about 2 hours ago
๐ŸŽฌ VEO3 Directors - All-in-One AI Video Creation Suite ๐Ÿš€ What is VEO3 Directors? VEO3 Directors is a revolutionary end-to-end AI video creation platform that transforms your ideas into cinematic reality. From story conception to final video with synchronized audio - all in one seamless workflow! ๐Ÿ”— Try It Now https://huggingface.co/spaces/ginigen/VEO3-Directors https://huggingface.co/spaces/ginigen/VEO3-Free https://huggingface.co/spaces/ginigen/VEO3-Free-mirror โœจ Key Features ๐Ÿ“ Story Seed Generator ๐ŸŽฒ Instantly generate creative story ideas across multiple genres ๐ŸŒ Bilingual support (English/Korean) ๐ŸŽญ Rich categories: Genre, Setting, Characters, and more ๐ŸŽฅ AI Script & Prompt Crafting ๐Ÿ’ฌ Powered by Friendli API for Hollywood-quality prompts ๐Ÿค– AI Director writes detailed cinematography instructions ๐ŸŽฌ Professional elements: camera movements, lighting, VFX ๐ŸŽฌ Video + Audio Generation ๐ŸŽจ Wan2.1-T2V-14B for stunning visual quality โšก NAG 4-step inference - 10x faster generation ๐ŸŽต MMAudio auto-generates matching soundscapes ๐ŸŽ›๏ธ Full control over resolution, duration, and style ๐Ÿ’ฌLLM(API): VIDraft/Gemma-3-R1984-27B ๐Ÿ’ก How It Works Generate Story โ†’ "The Time Traveler's Final Choice" ๐Ÿ•ฐ๏ธ Create Script โ†’ AI writes cinematic scene descriptions ๐Ÿ“œ Produce Video โ†’ 4-8 second clip with synchronized audio ๐ŸŽž๏ธ ๐ŸŽฏ What Makes It Special Unified Workflow: From idea to video in one interface Director-Level Prompts: Professional cinematography language Lightning Fast: Minutes, not hours Smart Audio: Context-aware sound generation ๐Ÿ† Use Cases ๐Ÿ“ฑ Social Media Content ๐ŸŽ“ Educational Videos ๐Ÿ“บ Marketing & Ads ๐ŸŽฎ Game Cutscene Prototyping ๐ŸŽจ Digital Art Creation
reacted to seawolf2357's post with ๐Ÿ”ฅ about 2 hours ago
โšก FusionX Enhanced Wan 2.1 I2V (14B) ๐ŸŽฌ ๐Ÿš€ Revolutionary Image-to-Video Generation Model Generate cinematic-quality videos in just 8 steps! https://huggingface.co/spaces/Heartsync/WAN2-1-fast-T2V-FusioniX โœจ Key Features ๐ŸŽฏ Ultra-Fast Generation: Premium quality in just 8-10 steps ๐ŸŽฌ Cinematic Quality: Smooth motion with detailed textures ๐Ÿ”ฅ FusionX Technology: Enhanced with CausVid + MPS Rewards LoRA ๐Ÿ“ Optimized Resolution: 576ร—1024 default settings โšก 50% Speed Boost: Faster rendering compared to base models ๐Ÿ› ๏ธ Technical Stack Base Model: Wan2.1 I2V 14B Enhancement Technologies: ๐Ÿ”— CausVid LoRA (1.0 strength) - Motion modeling ๐Ÿ”— MPS Rewards LoRA (0.7 strength) - Detail optimization Scheduler: UniPC Multistep (flow_shift=8.0) Auto Prompt Enhancement: Automatic cinematic keyword injection ๐ŸŽจ How to Use Upload Image - Select your starting image Enter Prompt - Describe desired motion and style Adjust Settings - 8 steps, 2-5 seconds recommended Generate - Complete in just minutes! ๐Ÿ’ก Optimization Tips โœ… Recommended Settings: 8-10 steps, 576ร—1024 resolution โœ… Prompting: Use "cinematic motion, smooth animation" keywords โœ… Duration: 2-5 seconds for optimal quality โœ… Motion: Emphasize natural movement and camera work ๐Ÿ† FusionX Enhanced vs Standard Models Performance Comparison: While standard models typically require 15-20 inference steps to achieve decent quality, our FusionX Enhanced version delivers premium results in just 8-10 steps - that's more than 50% faster! The rendering speed has been dramatically improved through optimized LoRA fusion, allowing creators to iterate quickly without sacrificing quality. Motion quality has been significantly enhanced with advanced causal modeling, producing smoother, more realistic animations compared to base implementations. Detail preservation is substantially better thanks to MPS Rewards training, maintaining crisp textures and consistent temporal coherence throughout the generated sequences.
View all activity

Organizations

None yet