Ambarish Jash's picture

6 2

Ambarish Jash

ajash

ajash

AI & ML interests

NLP / LLM

Organizations

None yet

ajash's activity

New activity in togethercomputer/LLaMA-2-7B-32K about 1 year ago

Using the Accelerate API to train models on multiple GPUs

#28 opened about 1 year ago by

New activity in ajash/Amazon-lm about 1 year ago

Librarian Bot: Add base_model information to model

#1 opened about 1 year ago by

New activity in ajash/Amazon-lm-10k about 1 year ago

Librarian Bot: Add base_model information to model

#1 opened about 1 year ago by

New activity in togethercomputer/LLaMA-2-7B-32K about 1 year ago

Installing ! pip install git+https://github.com/HazyResearch/flash-attention.git#subdirectory=csrc/rotary but flah_llama still erroring out

#25 opened about 1 year ago by

Fine tune togethercomputer/LLaMA-2-7B-32K with LoRA

#22 opened about 1 year ago by

Fix RuntimeError: pad attn scores back to original query sequence length, instead of unpadded sequence length (i.e. no change).

#17 opened about 1 year ago by

Fine tune togethercomputer/LLaMA-2-7B-32K with LoRA

#22 opened about 1 year ago by

Fine tune togethercomputer/LLaMA-2-7B-32K with LoRA

#22 opened about 1 year ago by