Aastha Varma
aastha6
AI & ML interests
Mechanistic Interpretability
Recent Activity
updated
a model
about 4 hours ago
aastha6/crosscoders-gemma-2-2b
published
a model
about 4 hours ago
aastha6/crosscoders-gemma-2-2b
updated
a dataset
1 day ago
aastha6/pile-lmsys-mix-500k-tokenized-qwen2.5
Organizations
aastha6's activity
Facing error while converting to HF
1
#2 opened 4 months ago
by
aastha6
how to fine tune this model?
4
#16 opened 9 months ago
by
leo009

How to load in multi-gpu instance ?
6
#19 opened 9 months ago
by
aastha6
Cuda error for MAX_TOTAL_TOKENS = 8192
1
#5 opened over 1 year ago
by
aastha6
Trying to deploy this model with vllm in Sagemaker
#2 opened over 1 year ago
by
aastha6
Not able to launch using TGI in Sagemaker
#11 opened over 1 year ago
by
aastha6
Not able to deploy in Sagemaker
2
#3 opened over 1 year ago
by
aastha6
Code to shard a model weights
#1 opened over 1 year ago
by
aastha6