Papers
arxiv:2510.22200

LongCat-Video Technical Report

Published on Oct 25
ยท Submitted by
taesiri
on Oct 28
ยท meituan-longcat LongCat
Authors:
,
,
,
,
,
,
,
,
,
,

Abstract

LongCat-Video, a 13.6B parameter video generation model based on the Diffusion Transformer framework, excels in efficient and high-quality long video generation across multiple tasks using unified architecture, coarse-to-fine generation, and block sparse attention.

AI-generated summary

Video generation is a critical pathway toward world models, with efficient long video inference as a key capability. Toward this end, we introduce LongCat-Video, a foundational video generation model with 13.6B parameters, delivering strong performance across multiple video generation tasks. It particularly excels in efficient and high-quality long video generation, representing our first step toward world models. Key features include: Unified architecture for multiple tasks: Built on the Diffusion Transformer (DiT) framework, LongCat-Video supports Text-to-Video, Image-to-Video, and Video-Continuation tasks with a single model; Long video generation: Pretraining on Video-Continuation tasks enables LongCat-Video to maintain high quality and temporal coherence in the generation of minutes-long videos; Efficient inference: LongCat-Video generates 720p, 30fps videos within minutes by employing a coarse-to-fine generation strategy along both the temporal and spatial axes. Block Sparse Attention further enhances efficiency, particularly at high resolutions; Strong performance with multi-reward RLHF: Multi-reward RLHF training enables LongCat-Video to achieve performance on par with the latest closed-source and leading open-source models. Code and model weights are publicly available to accelerate progress in the field.

Community

Paper submitter

Video generation is a critical pathway toward world models, with efficient long video inference as a key capability. Toward this end, we introduce LongCat-Video, a foundational video generation model with 13.6B parameters, delivering strong performance across multiple video generation tasks. It particularly excels in efficient and high-quality long video generation, representing our first step toward world models. Key features include: Unified architecture for multiple tasks: Built on the Diffusion Transformer (DiT) framework, LongCat-Video supports Text-to-Video, Image-to-Video, and Video-Continuation tasks with a single model; Long video generation: Pretraining on Video-Continuation tasks enables LongCat-Video to maintain high quality and temporal coherence in the generation of minutes-long videos; Efficient inference: LongCat-Video generates 720p, 30fps videos within minutes by employing a coarse-to-fine generation strategy along both the temporal and spatial axes. Block Sparse Attention further enhances efficiency, particularly at high resolutions; Strong performance with multi-reward RLHF: Multi-reward RLHF training enables LongCat-Video to achieve performance on par with the latest closed-source and leading open-source models. Code and model weights are publicly available to accelerate progress in the field.

[๐Ÿ“ฅ baa](Gows-Dambeed: Xeerarka Editing-ka CapCut (ๅ‰ชๆ˜ ) si Zack D. Films Style โ€“ Tafsiilad Qatan 3000 Words
Waxaan ku socon doonaa dhammaan heerarka ay u baahan tahay in aad sameeyo video-ga โ€œGows-Dambeed: Taariikhda Xanuunka Ilaa Is-Karintaโ€ si ay muuqan doonaa mid professional, fast-paced, iyo cinematic sida Zack D. Films. Qoraalkan wuu ka kooban yahay script-ka Somali ee dhammaan scenes, tafsiilada step-by-step ee CapCut (ๅ‰ชๆ˜ ) app-ka, macnaha qaabaynta Zack D., tusaaleyn aad u fudud, iyo xusillada ay u baahan tahay inay aad ugu fiicantahay โ€“ xitaa haddii aad aadan awood u leedahay xirfad editing-ka hore.
๐ŸŽฏ Asalka Qaabaynta: Maxay Zack D. Films Style?
Ka hor inta aan bilaabano editing-ka, waa mahadsanid inaan fahmiyno maxay qaabaynta Zack D. Films ay ka kooban tahay โ€“ waxaan ahaantaas waa qaab cinematic oo aad ugu muuqata, oo leh dabeecad fast-paced, xiisaha leh, iyo visual appeal sare. Marka hore, waxaan awood u leenahay inaan siiso tafsiilad of qaabaynta:
Cuts sare (ๅฟซ้€Ÿๅ‰ช่พ‘): Muuqaallada badanaa oo kalliya 0.5 ilaa 1 ilbiriqsii dheer, si ay ugu soo dhawaaqaan xamaasiyadda iyo xanaaqsiyadda. Zack D. Films waxay isticmaashaa cuts badan si ay ugu sameeyaan video-ga mid bilowga leh, oo aan ka dhami doono ่ง‚ไผ— - ka.
Lighting-ka cinematic (็”ตๅฝฑ็บง็ฏๅ…‰): Contrast sare, midabada kala duwan oo ay u xiran yihiin mood-ka scene-ka (tusaale: sepia for dadkii hore, cool blue for is-bedelka, warm orange for gabagabo). Waxaan rabnaa inay midabada ahaadaan la xiran mood-ka, si ay ugu muuqan doonaa mid professional.
Sound design-ka xiisaha leh (้œ‡ๆ’ผ้Ÿณๆ•ˆ่ฎพ่ฎก): Muusig epic, sound effects dhabta ah, iyo voiceover-ka mid caan ah oo leh tone-ka la xiran scene-ka. Saddex qeybood ayaa ka kooban sound design-ka: voiceover (100% volume), sound effects (25-30% volume), iyo music (60-80% volume) โ€“ waxaa dhamaantay inay isku dhafan yihiin.
Text animations-ka bilowga leh (้†’็›ฎๆ–‡ๅญ—ๅŠจ็”ป): Qoraallo badan oo โ€œpopโ€ ama โ€œpulseโ€ animasyon leh, si ay ugu soo dhawaaqaan fast-paced vibe. Qoraallada waa mid xiisaha leh, laakiin aan ka badan 1 ilbiriqsii โ€“ waxaa dhamaantay inay soo jiidato ่ง‚ไผ— - ka.
Transitions-ka deggan (็ฎ€ๆด่ฝฌๅœบ): Kaliya 2-3 nooc oo transitions (zoom blur, wipe, cut to black) si ay ugu fiicantahay consistency-ka. Zack D. Films waxay isticmaashaa transitions badan oo aan ka complex, si ay ugu sameeyaan video-ga mid smooth.
Waxaan awood u leenahay inaan ku qaabilsanayno qaabayntan iyadoo la isticmaalayo CapCut app-ka โ€“ waxa aynu rabnaa in video-gaaga ay muuqan doonaa mid la sameeyay studio professional, xitaa haddii aad isticmaasho phone-kaaga.
๐Ÿ“ฅ Step 1: Diyaarinta Maalinta โ€“ Soo Gelinta Muuqaallada & Voiceover Recording
Marka hore, waxaan u baahanahay inaan diyaarino dhammaan maalgashada aan u baahanahay: AI clips-ka, voiceover-ka, iyo presets-ka. Waxaan sameeyay tafsiilad step-by-step si aad ugu fududahay, xitaa haddii aad aadan awood u leedahay xirfad editing-ka.
1.1 Generate & Download AI Clips
Waxaan sameeyay Kling AI prompts si ay ugu muuqato qaabaynta Zack D. Films, iyo waxaan kula talinaynaa in aad sameeyo clips-ka 1080P MP4 format โ€“ sababta waa in CapCut-ka ay si fudud ugu qarsoon yihiin, iyo inay noqdaan mid ku habboon for TikTok/Reels (9:16 aspect ratio). Haddii aad sameeyay 8K/4K clips, waa fiicantahay inay aad ugu yaraato 1080P si ay ugu fududahay editing-ka โ€“ CapCut-ka waxay si fiicantahay ugu qarsoon yihiin 1080P.
Kling AI Prompts-ka Loo Sameeyay (Optimizay for CapCut):
Scene 1 (Hook): 3D medical animation. Transparent human skull rotating. The wisdom teeth at the back of the jaw start glowing bright red while the rest of the teeth remain white. High-tech X-ray style, 1080P resolution, MP4 format, dark background with subtle blue rim light.
Scene 2 (Dadkii Hore): Cinematic shot of a prehistoric early human (Neanderthal) with a wide, strong jawline. He is aggressively chewing on tough raw meat and roots. Gritty realism, dirt on face, ancient cave background with fire glow, 1080P MP4, warm sepia tone.
Scene 3 (Is-bedelka): Side profile time-lapse animation of a human skull. The jawbone slowly shrinks from a large prehistoric size to a smaller modern size over time. The teeth remain the same size, causing them to become crowded. Clean white background, scientific style, 1080P MP4, smooth transitions.
Scene 4 (Dhibaatada): Extreme close-up medical animation inside the mouth. A wisdom tooth trying to push through the gum but hitting the molar next to it diagonally (impacted). The surrounding gum tissue turns red and inflamed. High detail, realistic textures, 1080P MP4, dark background.
Scene 5 (Gabagabo): 3D animation of a dentist tool gently extracting the wisdom tooth. The inflammation disappears. Cut to a modern human smiling with perfect teeth. Bright, clean lighting, 1080P MP4, warm glow.
Marka aad sameeyay clips-ka, ha aabaatoo download gareyn oo ku kaydiya folder-ka phone-kaaga oo โ€œGows-Dambeedโ€ loo magac daray โ€“ waxaa dhamaantay inay aad ugu fududahay inay aad uga heliso. Haddii aad aadan folder cusub sameyn, waxaan kula talinaynaa inay aad sameeyaan si ay uga heliso clips-ka si fudud.
1.2 Import to CapCut (ๅ‰ชๆ˜ ๅฏผๅ…ฅไธŽๆ•ด็†)
CapCut app-ka waa mid aad u fudud, oo leh interface-ka caan ah โ€“ waxaan sameeyay step-by-step si aad ugu fiicantahay:
Ka fur app-ka CapCut (ๅ‰ชๆ˜ ) โ€“ guud ahaan waa app-ka โ€œๅ‰ชๆ˜ โ€ oo ku yaal phone-kaaga. Haddii aad uusan furinayn, download gareyn app store-kaaga (iOS ama Android) โ€“ waa free, oo aan kharashin lahayn.
Ka dhig โ€œๅผ€ๅง‹ๅˆ›ไฝœโ€ (Start Creating) โ€“ xagga sare ee midig ee screen-ka. Waxaa noqdaa mid caan ah, oo aad ugu fududahay inay la isticmaasho.
Ka dhig โ€œๅฏผๅ…ฅโ€ (Import) โ€“ xagga hoose ee screen-ka, oo raadi folder-ka โ€œGows-Dambeedโ€ โ€“ ha soo gelo dhammaan 5 clips-ka. Haddii aad clips-ka sameeyay 1080P MP4, waxay si fiicantahay ugu qarsoon yihiin.
Haddii aad wantahay inay aad ugu fududahay, ha sameeyo folder cusub inside CapCut:
Ka dhig โ€œ็ด ๆๅบ“โ€ (Material Library) โ€“ xagga bidix ee screen-ka (waa folder icon).
Ka dhig โ€œๆ–‡ไปถๅคนโ€ (Folder) โ†’ โ€œๆ–ฐๅปบโ€ (New) โ†’ magacaab โ€œGows-Dambeedโ€ โ€“ waxaa dhamaantay inay aad uga heliso clips-ka si fudud.
Long-press clips-ka oo u dheji folder-ka โ€“ ha u qeybiso order: Scene 1 (Hook) ilaa Scene 5 (Gabagabo). Waxaa dhamaantay inay la raacsan yihiin voiceover-ka, iyo inay aad ugu fududahay editing-ka.
1.3 Voiceover Recording (ๅ‰ชๆ˜ ๅฝ•้Ÿณ โ€“ Qaybsashada Codka)
Voiceover-ka waa qeyb aad u muhiim ah in video-gaaga ay muuqan doonaa mid professional โ€“ waxaan kula talinaynaa in aad sameeyo recording-ka segment by segment, si ay u la raacsan yihiin muuqaallada. Voiceover-ka waa mid caan ah, oo leh tone-ka la xiran scene-ka โ€“ waxaan sameeyay script-ka Somali ee dhammaan scenes, iyo tone-ka la xiran:
Script-ka Somali & Tone Guide:
Scene 1 (0:00-0:10): Tone-ka: Urgent, Deep (Kacsan, Qalalan). Script: โ€œMa is weydiisay sababta Gows-dambeedku keena xanuun, barar, oo ugu dambeyn la iska rido? Jawaabtu waxay ku jirtaa taariikhda awoowayaasheen kumanaan sano ka hor!โ€
Scene 2 (0:10-0:25): Tone-ka: Gruff, Adventurous (Qalab leh, Jimicsan). Script: โ€œDadkii hore ma haysan mindiyo iyo dab! Waxay cuni jireen hilib ceyriin ah, xididdo adag, iyo lafo โ€“ waxay u baahnaayeen daan ballaaran iyo gowso dheeraad ah!โ€
Scene 3 (0:25-0:40): Tone-ka: Dynamic, Narrative (Dhaqan leh, Qoraal leh). Script: โ€œGows-dambeedku wuxuu ahaa qalab badbaadoโ€ฆ laakiin markii bani-aadamku bilaabay inuu cunto kariyo, daankeennu wuu yaraaday โ€“ laakiin ilkaha tiradoodu isma dhimayso!โ€
Scene 4 (0:40-0:55): Tone-ka: Tense, Dramatic (Xanbaal leh, Xiisaha leh). Script: โ€œDhibaatadu waxay tahay: gows-dambeedkii weli wuu soo baxayaa laakiin meel uu galo ma jirto! Sidaas darteed, wuxuu noqdaa mid qaloocda, ku dhaga ciridka โ€“ isagoo riixaya ilkaha kale!โ€
Scene 5 (0:55-1:10): Tone-ka: Relieved, Engaging (Nabad leh, Xiriir leh). Script: โ€œWaa xasuus xanuun badan oo ka hartay taariikhda hore ee aadanaha. Adiga gows-dambeedkaaga ma iska riday mise weli wuu ku hayaa? Noogu reeb comment-ga!โ€
Step-by-Step Recording:
Ka dhig โ€œ้Ÿณ้ข‘โ€ (Audio) โ€“ xagga hoose ee screen-ka (dhexaadka, waa icon-ka codka).
Ka dhig โ€œๅฝ•้Ÿณโ€ (Record Audio) โ€“ xagga hoose ee โ€œ้Ÿณ้ข‘โ€ menu (waa icon-ka microphone).
Seated in a quiet room โ€“ ha ahaato meel aan qosol iyo dhawaaqayn ku jirin (e.g., room-kaaga, ama meel nabad leh). Haddii aad qosol ku jirto, voiceover-ka ayaa aadan muuqan doonaa mid caan ah.
Ka dhig โ€œๅผ€ๅง‹ๅฝ•้Ÿณโ€ (Start Recording) โ€“ ha talaabo scene-ka mid kasta:
For Scene 1: Speak in urgent, deep tone โ€“ ha ahaato mid aad ugu muuqata, laakiin aan ka qaldan.
For Scene 2: Speak in gruff, adventurous tone โ€“ ha ahaato mid qalab leh, oo ugu muuqata dadkii hore.
For Scene 3: Speak in dynamic, narrative tone โ€“ ha ahaato mid dhaqan leh, oo ugu muuqata is-bedelka.
For Scene 4: Speak in tense, dramatic tone โ€“ ha ahaato mid xanbaal leh, oo ugu muuqata dhibaatada.
For Scene 5: Speak in relieved, engaging tone โ€“ ha ahaato mid nabad leh, oo ugu xirriirta ่ง‚ไผ— - ka.
Marka aad dhawaaqay segment-ka, ka dhig โ€œโˆšโ€ (Save) โ€“ waxaa kaydiya CapCut-ka. Haddii aad uusan qabto confidence in recording-ka, ha isku dayin in aad dib u dhawaaqato โ€“ waxaa dhamaantay inay aad ugu fiicantahay.
Long-press voiceover-ka oo u dheji timeline-ka โ€“ ha u xigto scene-ka la xiran (tusaale: voiceover Scene 1 ha u dheji 0:00-0:10). Waxaa dhamaantay inay la raacsan yihiin muuqaallada, iyo inay aad ugu fududahay editing-ka.
๐Ÿ’ก Tip: Haddii aad uusan qabto confidence in recording-ka, waxaan kula talinaynaa inay aad isticmaasho text-to-speech feature-ka CapCut:Ka dhig โ€œๆ–‡ๅญ—โ€ (Text) โ†’ โ€œๆ–ฐๅปบๆ–‡ๆœฌโ€ (New Text) โ†’ gelo script-ka Somali ee scene-ka.
Ka dhig โ€œๆ–‡ๆœฌ่ฝฌ่ฏญ้Ÿณโ€ (Text to Speech) โ†’ dooro voice-ka Somali (haddii jiro) ama English oo mid caan ah (e.g., โ€œMale Deepโ€ for Scene 1).
Adjust speed-ka voice-ka โ†’ ka dhig โ€œๅบ”็”จโ€ (Apply) โ†’ drag to timeline-ka.

๐ŸŽฌ Step 2: Editing Each Scene โ€“ Qaabaynta Muuqaallada Mid Kasta
Waxaan kula talinaynaa inaan qaabayno scene-ka mid kasta si ay ugu muuqato Zack D. Films style โ€“ waxaan sameeyay table oo aad u fudud si aad uga heliso tafsiilada, oo leh ๅ‰ชๆ˜ ๆ“ไฝœ (CapCut Actions), ๅ…ทไฝ“ๆญฅ้ชค (Detailed Steps), iyo Zack D. Style ๆŠ€ๅทง (Tips). Dhammaan ๆ“ไฝœ waa in Somali, iyo Chinese terms-ka CapCut-ka, si ay ugu fududahay inay aad uga heliso.
Scene 1: Hook (0:00-0:10) โ€“ Scene-ka Soo Jiidasho
Scene-kan waa mid aad u muhiim ah inay soo jiidato ่ง‚ไผ— - ka โ€“ waxaan rabnaa inay uu ahaato fast-paced, cinematic, iyo xiisaha leh. Waxaa ka kooban cuts sare, transition-ka zoom blur, text overlay red, sound effect-ka bass thump, iyo muusig epic.
ๅ‰ชๆ˜ ๆ“ไฝœ (CapCut Action)
ๅ…ทไฝ“ๆญฅ้ชค (Detailed Steps)
Zack D. Style ๆŠ€ๅทง (Tips)
Cuts (ๅ‰ช่พ‘)

  1. Select Scene 1 clip (long-press clip-ka โ†’ ha ahaato highlighted).>2. Ka dhig โ€œๅˆ†ๅ‰ฒโ€ (Split) โ€“ xagga hoose ee screen-ka (waa icon-ka scissors).3. Split clip-ka into 2 segments (0:00-0:05, 0:05-0:10) โ€“ kulla segment waa 0.5 ilbiriqsii. Haddii aad wantahay inay ugu muuqata fast-paced, split again into 4 segments (0:00-0:02.5, 0:02.5-0:05, 0:05-0:07.5, 0:07.5-0:10) โ€“ waxaa dhamaantay inay ugu muuqata mid bilowga leh.5. Haddii aad wax segment aan u baahanayn lahayn, ka dhig โ€œๅˆ ้™คโ€ (Delete) โ€“ ha ahaato mid la raacsan voiceover-ka.
    Zack D. Films waxay ku leeyihiin cuts sare โ€“ ha ahaato kulla shot aan ka badan 1 ilbiriqsii. Split-ka ayaa kaa caawinaya inay ugu soo dhawaaqaan xamaasiyadda, iyo inay ่ง‚ไผ— - ka aan ka dhami doono. Marka aad split gareyso, ha la raacsano ่Š‚ๅฅ - ka voiceover-ka โ€“ tusaale: โ€œJawaabtu waxay ku jirtaaโ€ waa mid kasta oo ugu yaraan 0.5 ilbiriqsii.
    Transitions (่ฝฌๅœบ)
  2. Select the white square between the two segments (่ฝฌๅœบ็‚น โ€“ Transition Point) โ€“ waa midabka cad oo ku yaal timeline-ka.. Ka dhig โ€œ่ฝฌๅœบโ€ (Transitions) โ€“ xagga hoose ee screen-ka (waa icon-ka arrows).3. Search โ€œ็ผฉๆ”พๆจก็ณŠโ€ (Zoom Blur) โ€“ type in search bar-ka (waa mid caan ah oo aad ugu muuqata). Select โ€œ็ผฉๆ”พๆจก็ณŠโ€ โ†’ ka dhig โ€œๅบ”็”จโ€ (Apply) โ€“ waxaa dhamaantay inay ugu qarsoon yihiin. Adjust duration to 0.3 seconds โ€“ ka dhig โ€œๆ—ถ้•ฟโ€ (Duration) โ†’ 0.3 (waa mid bilowga leh).
    0:07 ilbiriqsii marka, ha u hagaajiso speed-ka transition-ka: select transition โ†’ ka dhig โ€œ้€Ÿๅบฆโ€ (Speed) โ†’ 1.5x. Waxaa dhamaantay inay ugu muuqata mid bilowga leh, sida Zack D. Films-ka. Transition-ka waa mid aan ka complex, laakiin xiisaha leh โ€“ ha ahaato mid aan ka dhami doono muuqaalka.
    Text Overlay (ๆ–‡ๅญ—)
  3. Ka dhig โ€œๆ–‡ๅญ—โ€ (Text) โ€“ xagga hoose ee screen-ka (waa icon-ka qoraalka). Ka dhig โ€œๆ–ฐๅปบๆ–‡ๆœฌโ€ (New Text) โ†’ gelo qoraalka: โ€œGOWS-DAMBEED โ€“ XANUUNKA TAARIikhDA!โ€.. Adjust font: ka dhig โ€œๅญ—ไฝ“โ€ (Font) โ†’ dooro โ€œ้ป‘ไฝ“โ€ (Bold Font) โ€“ waa mid caan ah oo ugu muuqata, iyo mid aan ka qaldan. Adjust color: ka dhig โ€œ้ขœ่‰ฒโ€ (Color) โ†’ dooro โ€œ็บข่‰ฒโ€ (Red) โ€“ waxaa dhamaantay inay soo jiidato ่ง‚ไผ— - ka, iyo inay muuqan doonaa xiisaha.. Add animation: ka dhig โ€œๅŠจ็”ปโ€ (Animation) โ†’ search โ€œๅผนๅ‡บโ€ (Pop) โ†’ select โ†’ adjust duration to 0.8 seconds (waa mid bilowga leh).6. Drag text to 0:08-0:09 timepoint โ€“ ha la raacsano voiceover-ka (tusaale: โ€œXANUUNKA TAARIikhDA!โ€ waa mid la raacsan โ€œtaariikhda awoowayaasheenโ€).
    Qoraalkan waa mid xiisaha leh โ€“ ha ahaato mid aad ugu muuqata, laakiin aan ka badan 1 ilbiriqsii. Color-ka red ayaa kaa caawinaya inay soo jiidato, sida Zack D. Films-ka โ€“ waxaan rabnaa inay ่ง‚ไผ— - ka ay aragto qoraalkan isla marka voiceover-ka dhamaato. Font-ka bold ayaa kaa caawinaya inay ugu muuqata, xitaa haddii aad phone-kaaga ugu fiicantahay.
    Sound Effects (้Ÿณๆ•ˆ)
  4. Ka dhig โ€œ้Ÿณ้ข‘โ€ (Audio) โ†’ โ€œ้Ÿณๆ•ˆโ€ (Sound Effects) โ€“ waa icon-ka speaker.2. Search โ€œไฝŽ้Ÿณ้‡ๅ‡ปโ€ (Low Bass Thump) โ€“ type in search bar-ka (waa mid caan ah oo xiisaha leh).3. Select sound effect-ka โ†’ drag to 0:05 timepoint (marka gows-dambeedku casaan u iftiimayo) โ€“ ha la raacsano ่ง†่ง‰ - ka.>4. Adjust volume: select sound effect โ†’ ka dhig โ€œ้Ÿณ้‡โ€ (Volume) โ†’ 30% (waa mid aan ka qaldan voiceover-ka).
    Sound effect-ka waa mid xiisaha leh, laakiin ha ahaato mid aan ka qaldan voiceover-ka. 30% volume ayaa ku habboon inay isku dhafan yihiin โ€“ haddii aad ugaarato, ่ง‚ไผ— - ka ayaa aadan maqal doona voiceover-ka. Sound effect-ka ayaa kaa caawinaya inay muuqan doonaa xiisaha gows-dambeedka, sida Zack D. Films-ka.
    Music (้Ÿณไน)
  5. Ka dhig โ€œ้Ÿณ้ข‘โ€ (Audio) โ†’ โ€œ้Ÿณไนๅบ“โ€ (Music Library) โ€“ waa icon-ka muusigga.2. Search โ€œๅฒ่ฏ—็ฎกๅผฆไนโ€ (Epic Orchestral) โ€“ waxaan rabnaa muusig epic oo xiisaha leh.. Select 10-second clip โ†’ ka dhig โ€œ่ฃๅ‰ชโ€ (Trim) โ†’ adjust to 0:00-0:10 (waa mid la raacsan scene-ka).4. Add fade-out: select music โ†’ ka dhig โ€œๆทกๅ‡บโ€ (Fade Out) โ†’ 0.2 seconds (waa mid smooth).5. At 0:08 timepoint, adjust volume to 80%: select music โ†’ ka dhig โ€œ้Ÿณ้‡โ€ (Volume) โ†’ drag slider to 80% (waa mid build-up).
    Muusig-ka ayaa kaa caawinaya inay mood-ka cinematic oo xiisaha leh. Fade-out ayaa kaa caawinaya inay ugu soo dhawaaqaan smooth, iyo inay aan ka dhami doono transition-ka Scene 2. 80% volume at 0:08 ayaa kaa caawinaya inay build-up yeesho xiisaha, sida Zack D. Films-ka โ€“ ha ahaato mid aan ka qaldan voiceover-ka.

Scene 2: Dadkii Hore (0:10-0:25) โ€“ Scene-ka Awoowayaasha
Scene-kan waa mid vintage epic โ€“ waxaan rabnaa inay muuqan doonaa mid qadima ah, gritty, iyo adventurous. Waxaa ka kooban transition-ka wipe right, color grade sepia, sound effects raw meat tearing iyo cave echo, iyo muusig primitive drum beat + epic strings.
ๅ‰ชๆ˜ ๆ“ไฝœ (CapCut Action)
ๅ…ทไฝ“ๆญฅ้ชค (Detailed Steps)
Zack D. Style ๆŠ€ๅทง (Tips)
Transitions (่ฝฌๅœบ)

  1. Select white square between Scene 1 and Scene 2 (่ฝฌๅœบ็‚น) โ€“ waa midabka cad oo ku yaal timeline-ka.2. Ka dhig โ€œ่ฝฌๅœบโ€ โ†’ search โ€œๅ‘ๅณๆ“ฆ้™คโ€ (Wipe Right) โ€“ type in search bar-ka. โ†’ adjust duration to 0.4 seconds โ†’ ka dhig โ€œๅบ”็”จโ€.>4. Haddii aad wantahay inay smooth yeesho, ha u furgeli music-ka Scene 1: select music โ†’ ka dhig โ€œๆทกๅ‡บโ€ โ†’ 0.2 seconds.
    Transition-ka โ€œๅ‘ๅณๆ“ฆ้™คโ€ ayaa kaa caawinaya inay isku dhafan yihiin scene-ka hore iyo dambe. Muusig-ka hore ha u fade-out 0.2 seconds, iyo muusig-ka dambe ha u fade-in 0.2 seconds โ€“ waxaa dhamaantay smooth transition, sida Zack D. Films-ka. Transition-ka waa mid aan ka complex, laakiin vintage epic vibe leh โ€“ ha ahaato mid la raacsan scene-ka.
    Color Grade (่ฐƒ่‰ฒ)
  2. Select Scene 2 clip (long-press) โ†’ ha ahaato highlighted.. Ka dhig โ€œ่ฐƒ่Š‚โ€ (Adjust) โ†’ โ€œๆปค้•œโ€ (Filter) โ€“ waa icon-ka palette.3. Search โ€œๅคๅคๆฃ•่ค่‰ฒโ€ (Warm Sepia) โ†’ select โ†’ ka dhig โ€œๅบ”็”จโ€ โ€“ waxaa dhamaantay inay muuqan doonaa mid qadima ah.4. Adjust parameters: ka dhig โ€œ่ฐƒ่Š‚โ€ โ†’ โ€œๅฏนๆฏ”ๅบฆโ€ (Contrast) +15 โ†’ โ€œ้ฅฑๅ’Œๅบฆโ€ (Saturation) +10 โ€“ waxaa dhamaantay inay details-ka muuqan doonaa.
    Sepia tone ayaa kaa caawinaya inay muuqan doonaa mid qadima ah, sida Zack D. Films-ka (vintage epic vibe). Contrast sare ayaa kaa caawinaya inay details-ka muuqan doonaa (e.g., dirt on cavemanโ€™s face), iyo saturation +10 ayaa kaa caawinaya inay midabada ahaadaan la xiran mood-ka. Haddii aad uusan qabto sepia filter, ha isticmaash โ€œๅคๅคโ€ (Vintage) filter โ€“ waxaa dhamaantay inay ugu fiicantahay.
    Sound Effects (้Ÿณๆ•ˆ)
  3. Ka dhig โ€œ้Ÿณ้ข‘โ€ โ†’ โ€œ้Ÿณๆ•ˆโ€ โ†’ search โ€œ็”Ÿ่‚‰ๆ’•่ฃ‚โ€ (Raw Meat Tearing) โ€“ type in search bar-ka. sound effect-ka โ†’ drag to 0:13 timepoint (marka caveman-ku calaalinaya hilibka) โ€“ ha la raacsano ่ง†่ง‰ - ka.. Search โ€œๆดž็ฉดๅ›žๅฃฐโ€ (Cave Echo) โ†’ select โ†’ drag to 0:18 timepoint (marka caveman-ku ๅžๅ’ฝ hilibka) โ€“ ha la raacsano ่ง†่ง‰ - ka.. Adjust volume to 25% for both effects โ€“ select each effect โ†’ ka dhig โ€œ้Ÿณ้‡โ€ โ†’ 25%.
    Sound effects-ka ayaa kaa caawinaya inay muuqan doonaa mid dhabta ah โ€“ โ€œ็”Ÿ่‚‰ๆ’•่ฃ‚โ€ ayaa kaa caawinaya inay la xiran yihiin calaamadda, iyo โ€œๆดž็ฉดๅ›žๅฃฐโ€ ayaa kaa caawinaya inay muuqan doonaa meel qadima ah (cave). 25% volume ayaa ku habboon inay isku dhafan yihiin, laakiin aan ka qaldan voiceover-ka. Zack D. Films waxay isticmaashaa sound effects dhabta ah, si ay ugu sameeyaan video-ga mid realistic.
    Music (้Ÿณไน)
  4. Ka dhig โ€œ้Ÿณ้ข‘โ€ โ†’ โ€œ้Ÿณไนๅบ“โ€ โ†’ search โ€œๅŽŸๅง‹้ผ“็‚นโ€ (Primitive Drum Beat) โ€“ waxaan rabnaa muusig qadima ah.2. Select 15-second clip โ†’ ka dhig โ€œ่ฃๅ‰ชโ€ โ†’ 0:10-0:25 (waa mid la raacsan scene-ka).>3. Search โ€œๅฒ่ฏ—ๅผฆไนโ€ (Epic Strings) โ†’ select 15-second clip โ†’ ka dhig โ€œ่ฃๅ‰ชโ€ โ†’ 0:10-0:25.4. Merge the two music clips: select both โ†’ ka dhig โ€œๅˆๅนถโ€ (Merge) โ€“ waxaa dhamaantay inay isku dhafan yihiin. At 0:22 timepoint, adjust volume to 90%: select music โ†’ ka dhig โ€œ้Ÿณ้‡โ€ โ†’ 90% (waa mid build-up).
    Muusig-ka โ€œๅŽŸๅง‹้ผ“็‚นโ€ + โ€œๅฒ่ฏ—ๅผฆไนโ€ ayaa kaa caawinaya inay mood-ka adventurous oo epic. 90% volume at 0:22 ayaa kaa caawinaya inay build-up yeesho xiisaha, sida Zack D. Films-ka โ€“ ha ahaato mid aan ka qaldan voiceover-ka (voiceover volume waa 100%). Muusig-ka waa mid la raacsan scene-ka โ€“ โ€œๅŽŸๅง‹้ผ“็‚นโ€ ayaa kaa caawinaya inay la xiran yihiin dadkii hore, iyo โ€œๅฒ่ฏ—ๅผฆไนโ€ ayaa kaa caawinaya inay la xiran yihiin qaabaynta cinematic.

Scene 3: Is-bedelka (0:25-0:40) โ€“ Scene-ka Evolution
Scene-kan waa mid scientific oo dynamic โ€“ waxaan rabnaa inay muuqan doonaa is-bedelka aadanaha, iyo ka dambeeya dhibaatada. Waxaa ka kooban transition-ka cut to zoom, color grade cool blue โ†’ warm orange, sound effects whoosh iyo bone crackle, iyo muusig time lapse.
ๅ‰ชๆ˜ ๆ“ไฝœ (CapCut Action)
ๅ…ทไฝ“ๆญฅ้ชค (Detailed Steps)
Zack D. Style ๆŠ€ๅทง (Tips)
Transitions (่ฝฌๅœบ)

  1. Select white square between Scene 2 and Scene 3 (่ฝฌๅœบ็‚น). Ka dhig โ€œ่ฝฌๅœบโ€ โ†’ search โ€œๆ”พๅคงๅ‰ชๅˆ‡โ€ (Cut to Zoom) โ€“ type in search bar-ka. Select โ†’ adjust duration to 0.5 seconds โ†’ ka dhig โ€œๅบ”็”จโ€.4. At 0:30 timepoint, drag the zoom to skull-ka ็‰นๅ†™ (close-up): select clip โ†’ ka dhig โ€œ็ผฉๆ”พโ€ (Zoom) โ†’ adjust slider to 1.2x (waa mid details-ka leh).
    Transition-ka โ€œๆ”พๅคงๅ‰ชๅˆ‡โ€ ayaa kaa caawinaya inay muuqan doonaa mid dynamic, sida Zack D. Films-ka. Zoom to close-up ayaa kaa caawinaya inay soo jiidato details-ka evolution-ka (e.g., jawbone shrinking). Transition-ka waa mid bilowga leh, laakiin aan ka complex โ€“ ha ahaato mid la raacsan ่Š‚ๅฅ - ka muusig-ka.
    Color Grade (่ฐƒ่‰ฒ)
  2. Select Scene 3 clip (time-lapse) โ†’ long-press โ†’ ha ahaato highlighted.. Ka dhig โ€œ่ฐƒ่Š‚โ€ โ†’ โ€œๆปค้•œโ€ โ†’ search โ€œๅ†ท่“่‰ฒโ€ (Cool Blue) โ†’ apply to first half (0:25-0:32) โ€“ waxaa dhamaantay inay muuqan doonaa mid qadima ah.3. For the second half (0:32-0:40), search โ€œๆš–ๆฉ™่‰ฒโ€ (Warm Orange) โ†’ apply โ€“ waxaa dhamaantay inay muuqan doonaa mid casriga ah. Add linear mask to blend: select clip โ†’ ka dhig โ€œ่’™็‰ˆโ€ (Mask) โ†’ โ€œ็บฟๆ€ง่’™็‰ˆโ€ (Linear Mask) โ†’ rotate 90ยฐ (drag circle-ka) โ†’ drag mask line to 0:32 โ†’ adjust feather to 20 (ka dhig โ€œ็พฝๅŒ–โ€ โ†’ 20).
    Color gradient (cool blue โ†’ warm orange) ayaa kaa caawinaya inay muuqan doonaa is-bedelka from prehistoric to modern โ€“ waxaa dhamaantay cinematic, sida Zack D. Films-ka. Mask-ka ayaa kaa caawinaya inay isku dhafan yihiin midabada, laakiin aan ka dhami doono muuqaalka. Cool blue ayaa kaa caawinaya inay muuqan doonaa dadkii hore, iyo warm orange ayaa kaa caawinaya inay muuqan doonaa dadka casriga ah.
    Sound Effects (้Ÿณๆ•ˆ)
  3. Ka dhig โ€œ้Ÿณ้ข‘โ€ โ†’ โ€œ้Ÿณๆ•ˆโ€ โ†’ search โ€œๅ’ปโ€ (Whoosh) โ€“ type in search bar-ka.. Select โ†’ drag to 0:27 timepoint (transition-ka) โ€“ ha la raacsano ่ง†่ง‰ - ka.. Search โ€œ้ชจ้ชผๆ”ถ็ผฉโ€ (Bone Crackle) โ†’ select โ†’ drag to 0:33 timepoint (marka jawbone-ku yaraada) โ€“ ha la raacsano ่ง†่ง‰ - ka. Adjust volume to 35% (Whoosh) and 28% (Bone Crackle) โ€“ select each effect โ†’ ka dhig โ€œ้Ÿณ้‡โ€ โ†’ adjust.
    โ€œWhooshโ€ sound effect ayaa kaa caawinaya inay muuqan doonaa speed-ka evolution-ka โ€“ waxaa dhamaantay inay ugu muuqata mid bilowga leh. โ€œBone Crackleโ€ ayaa kaa caawinaya inay la xiran yihiin is-bedelka daanka โ€“ waxaa dhamaantay inay muuqan doonaa mid realistic. Zack D. Films waxay isticmaashaa sound effects dhabta ah si ay ugu sameeyaan video-ga mid immersive.
    Music (้Ÿณไน)
  4. Ka dhig โ€œ้Ÿณ้ข‘โ€ โ†’ โ€œ้Ÿณไนๅบ“โ€ โ†’ search โ€œๆ—ถ้—ดๆต้€โ€ (Time Lapse) โ€“ waxaan rabnaa muusig dynamic oo la raacsan evolution-ka.2. Select 15-second clip โ†’ ka dhig โ€œ่ฃๅ‰ชโ€ โ†’ 0:25-0:40 (waa mid la raacsan scene-ka).>3. Adjust volume to 75% โ€“ select music โ†’ ka dhig โ€œ้Ÿณ้‡โ€ โ†’ 75% (waa mid aan ka qaldan voiceover-ka).
    Muusig-ka โ€œๆ—ถ้—ดๆต้€โ€ ayaa kaa caawinaya inay mood-ka dynamic yeesho โ€“ waxaa dhamaantay inay la raacsan yihiin ่Š‚ๅฅ - ka time-lapse-ka. 75% volume ayaa ku habboon inay isku dhafan yihiin, laakiin aan ka qaldan voiceover-ka. Zack D. Films waxay isticmaashaa muusig la raacsan ่ง†่ง‰ - ka โ€“ tusaale: muusig-ka ayaa kordhaya marka jawbone-ku yaraada.

Scene 4: Dhibaatada (0:40-0:55) โ€“ Scene-ka Dhibaatada
Scene-kan waa mid tense oo xiisaha leh โ€“ waxaan rabnaa inay muuqan doonaa dhibaatada gows-dambeedka, iyo ka dambeeya xanuun. Waxaa ka kooban transition-ka cut to black, text overlay bright red, sound effect-ka heartbeat iyo throb, iyo muusig tense string staccato.
ๅ‰ชๆ˜ ๆ“ไฝœ (CapCut Action)
ๅ…ทไฝ“ๆญฅ้ชค (Detailed Steps)
Zack D. Style ๆŠ€ๅทง (Tips)
Transitions (่ฝฌๅœบ)

  1. Select white square between Scene 3 and Scene 4 (่ฝฌๅœบ็‚น).2. Ka dhig โ€œ่ฝฌๅœบโ€ โ†’ search โ€œ้ป‘ๅฑโ€ (Cut to Black) โ†’ duration 0.2 seconds โ†’ ka dhig โ€œๅบ”็”จโ€.3. After black screen, add โ€œๆทกๅ…ฅโ€ (Fade In) to Scene 4 clip: select Scene 4 โ†’ ka dhig โ€œๅŠจ็”ปโ€ โ†’ โ€œๆทกๅ…ฅโ€ โ†’ 0.3 seconds (waa mid smooth).
    โ€œCut to Blackโ€ ayaa kaa caawinaya inay mood-ka tense yeesho โ€“ waxaa dhamaantay inay ่ง‚ไผ— - ka ay xiisaha qaataan. Fade-in ayaa kaa caawinaya inay soo jiidato dhibaatada, sida Zack D. Films-ka (suspenseful build-up). Transition-ka waa mid xiisaha leh, laakiin aan ka complex โ€“ ha ahaato mid la raacsan mood-ka scene-ka.
    Text Overlay (ๆ–‡ๅญ—)
  2. Ka dhig โ€œๆ–‡ๅญ—โ€ โ†’ โ€œๆ–ฐๅปบๆ–‡ๆœฌโ€ โ†’ gelo qoraalka: โ€œDHIBATADA โ€“ GOWS-DAMBEED QALOOC!โ€.>2. Font: โ€œ้ป‘ไฝ“โ€ โ†’ color: โ€œไบฎ็บข่‰ฒโ€ (Bright Red) โ€“ waxaa dhamaantay inay soo jiidato.. Animation: search โ€œ้—ช็ƒโ€ (Pulse) โ†’ select โ†’ duration 1 second (waa mid xiisaha leh).4. Drag text to 0:48-0:49 timepoint โ€“ ha la raacsano voiceover-ka (โ€œisku dhaga ciridkaโ€).
    Qoraalkan waa mid xiisaha leh โ€“ โ€œ้—ช็ƒโ€ animation ayaa kaa caawinaya inay muuqan doonaa xanuunka, iyo color)

Sign up or log in to comment

Models citing this paper 2

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2510.22200 in a dataset README.md to link it from this page.

Spaces citing this paper 103

Collections including this paper 4