Aastha Varma

aastha6

AI & ML interests

Mechanistic Interpretability

Recent Activity

updated a model about 4 hours ago
aastha6/crosscoders-gemma-2-2b
published a model about 4 hours ago
aastha6/crosscoders-gemma-2-2b
View all activity

Organizations

Amazon Web Services's profile picture scikit-learn's profile picture

aastha6's activity

New activity in mistralai/Codestral-22B-v0.1 8 months ago

how to fine tune this model?

4
#16 opened 9 months ago by
leo009
New activity in mistralai/Codestral-22B-v0.1 9 months ago
New activity in TheBloke/Mistral-7B-Instruct-v0.1-GPTQ over 1 year ago
New activity in amazon/FalconLite2 over 1 year ago
New activity in Open-Orca/Mistral-7B-OpenOrca over 1 year ago
New activity in michaelfeil/ct2fast-flan-ul2 over 1 year ago

Not able to deploy in Sagemaker

2
#3 opened over 1 year ago by
aastha6
New activity in philschmid/gpt-j-6B-fp16-sharded over 1 year ago

Code to shard a model weights

#1 opened over 1 year ago by
aastha6