umangkaushik
ubermenchh
AI & ML interests
None yet
Recent Activity
liked
a model
about 17 hours ago
sarvamai/sarvam-m
liked
a dataset
about 1 month ago
nvidia/OpenMathReasoning
upvoted
an
article
about 1 month ago
Gotchas in Tokenizer Behavior Every Developer Should Know
Organizations
Collections
2
spaces
21
models
33

ubermenchh/Qwen2.5-3B-open-r1-math
Text Generation
β’
Updated
β’
3

ubermenchh/Qwen2.5-3B-open-r1-math-lora
Updated

ubermenchh/Qwen2.5-3B-openr1-math
Text Generation
β’
Updated
β’
2

ubermenchh/Qwen2.5-0.5B-openr1-math
Updated

ubermenchh/llama3.1-8B-gsm8k-grpo
Updated
β’
1

ubermenchh/SmolLM2-SFT-sarvam-samvaad
Text Generation
β’
Updated
β’
2

ubermenchh/SmolLM2-360M-r1-grpo-countdown
Updated

ubermenchh/SmolLM2-DPO-ultrafeedback-binarized-preferences
Text Generation
β’
Updated
β’
3

ubermenchh/SmolLM2-DPO
Text Generation
β’
Updated
β’
3

ubermenchh/SmolLM2-FT-the-smol-stack
Text Generation
β’
Updated
β’
2