Chew Kok Wah
chewkokwah
ยท
AI & ML interests
Open Domain Question Answering
Recent Activity
upvoted
a
collection
about 11 hours ago
DeepSeek-R1-Distill Quantized
upvoted
a
paper
3 days ago
SIFT: Grounding LLM Reasoning in Contexts via Stickers
upvoted
a
paper
16 days ago
TransMLA: Multi-head Latent Attention Is All You Need
Organizations
chewkokwah's activity
[Experiment] Applying GRPO to DeepSeek-R1-Distill-Qwen-1.5B with LIMO
21
#15 opened 21 days ago
by
lewtun

License of Your Model
#4 opened about 1 month ago
by
chewkokwah
License of your model
1
#4 opened about 1 month ago
by
chewkokwah
May I know what Calibration Dataset and Tools you used to quantize the model?
#2 opened 2 months ago
by
chewkokwah