Running on GPU via HF transformers
#1
by
sudhir2016
- opened
Runs out of memory on free tier Google Colab.
As suggested by Eric Alcaide I tried quantization with Hugging Face Quanto. It works fine now. Thanks to @dacorvo for the excellent blog post on Quanto.
sudhir2016
changed discussion status to
closed