Extreme Training efficiency?

by aaronday3 - opened Jul 31, 2024

Jul 31, 2024

Hi there, great model!

The thing that caught my attention is that you trained a 64 rank qlora for a 70B model on 4x4090s which would have around 96GB of VRAM.

This is extremely efficient and I was wondering if it's possible to do this in axolotl or whether qlora-pipe has special vram optimizations that axolotl doesn't. Is it possible to do this in axolotl?

aaronday3

Jul 31, 2024

•

edited Jul 31, 2024

I reviewed the config you posted in the other discussion and I have a question, did you make qlora-pipe specifically for VRAM efficiency? Is it more efficient than axolotl?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment