Ambarish Jash
ajash
AI & ML interests
NLP / LLM
Organizations
None yet
ajash's activity
Using the Accelerate API to train models on multiple GPUs
8
#28 opened about 1 year ago
by
ajash
Librarian Bot: Add base_model information to model
#1 opened about 1 year ago
by
librarian-bot
Librarian Bot: Add base_model information to model
#1 opened about 1 year ago
by
librarian-bot
Installing ! pip install git+https://github.com/HazyResearch/flash-attention.git#subdirectory=csrc/rotary but flah_llama still erroring out
4
#25 opened about 1 year ago
by
ajash
Fine tune togethercomputer/LLaMA-2-7B-32K with LoRA
7
#22 opened about 1 year ago
by
ajash
Fix RuntimeError: pad attn scores back to original query sequence length, instead of unpadded sequence length (i.e. no change).
1
#17 opened about 1 year ago
by
Birchlabs
Fine tune togethercomputer/LLaMA-2-7B-32K with LoRA
7
#22 opened about 1 year ago
by
ajash
Fine tune togethercomputer/LLaMA-2-7B-32K with LoRA
7
#22 opened about 1 year ago
by
ajash