Post
2309
Fine-tune Gemma3n on videos with audios inside with Colab A100 π₯
Just dropped the notebook where you can learn how to fine-tune Gemma3n on images+audio+text at the same time!
keep in mind, it's made for educational purposes π«‘ we do LoRA, audio resampling & video downsampling to be able to train <40GB VRAM
stretch modalities and unfreeze layers as you wish! ππ» merve/smol-vision
Just dropped the notebook where you can learn how to fine-tune Gemma3n on images+audio+text at the same time!
keep in mind, it's made for educational purposes π«‘ we do LoRA, audio resampling & video downsampling to be able to train <40GB VRAM
stretch modalities and unfreeze layers as you wish! ππ» merve/smol-vision