Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Building on HF
11.8
TFLOPS
15
8
60
Kshitij Thakkar
PRO
kshitijthakkar
Follow
cgthayer's profile picture
21world's profile picture
Peter905's profile picture
26 followers
·
88 following
Mandark-droid
kshitij-thakkar-2061b924
AI & ML interests
AI observability + MoE efficiency engineer. Building tools that make GenAI traceable, measurable, and production-ready.
Recent Activity
liked
a Space
about 1 hour ago
ml-agent-explorers/efficient-optimizer-dashboard
published
a model
about 7 hours ago
kshitijthakkar/deepseek-v4-mini-1B-from-flash
published
a model
about 7 hours ago
kshitijthakkar/deepseek-v4-mini-300M-from-flash
View all activity
Organizations
kshitijthakkar
's models
136
Sort: Recently updated
kshitijthakkar/deepseek-v4-mini-1B-from-flash
Text Generation
•
1B
•
Updated
about 7 hours ago
kshitijthakkar/deepseek-v4-mini-300M-from-flash
Text Generation
•
0.3B
•
Updated
about 11 hours ago
kshitijthakkar/deepseek-v4-mini-6B-init
Text Generation
•
8B
•
Updated
about 13 hours ago
kshitijthakkar/deepseek-v4-mini-3B-init
Text Generation
•
3B
•
Updated
about 20 hours ago
kshitijthakkar/deepseek-v4-mini-1B-init
Text Generation
•
1B
•
Updated
about 22 hours ago
kshitijthakkar/deepseek-v4-mini-300M-init
Text Generation
•
0.3B
•
Updated
about 23 hours ago
•
4
kshitijthakkar/loggenix-0.4b-traceverse-grpo-test
Text Generation
•
0.4B
•
Updated
Mar 28
•
10
kshitijthakkar/loggenix-moe-1b-pretrain
Text Generation
•
1B
•
Updated
Mar 14
•
60
kshitijthakkar/loggenix-moe-0.4B-0.2A-sft-s4
Text Generation
•
0.4B
•
Updated
Mar 6
•
7
kshitijthakkar/loggenix-moe-0.4B-0.2A-sft-s4-checkpoints
Updated
Mar 6
kshitijthakkar/loggenix-moe-0.4b-cpt-muon
Updated
Mar 5
kshitijthakkar/qwen3.5-0.8b-moe-from-scratch
3B
•
Updated
Mar 4
•
3
kshitijthakkar/qwen3.5-moe-0.87B-d0.8B-cpt-muon
Updated
Mar 4
•
1
kshitijthakkar/qwen3.5-moe-4.7B-d4B
Image-Text-to-Text
•
5B
•
Updated
Mar 4
•
42
kshitijthakkar/qwen3.5-moe-2.3B-d2B
Image-Text-to-Text
•
3B
•
Updated
Mar 4
•
8
kshitijthakkar/qwen3.5-moe-0.87B-d0.8B
Image-Text-to-Text
•
1B
•
Updated
Mar 4
•
21
•
1
kshitijthakkar/qwen3.5-from-scratch-tiny
2B
•
Updated
Mar 4
•
5
kshitijthakkar/qwen3.5-tiny-test
Image-Text-to-Text
•
0.1B
•
Updated
Feb 25
•
16
kshitijthakkar/LFM2.5-1.2B-Instruct-liquidchat-lora-cactus
Updated
Feb 24
kshitijthakkar/LFM2.5-1.2B-Instruct-liquidchat-lora
Updated
Feb 24
kshitijthakkar/loggenix-moe-0.4b-cpt-muon-v1
Updated
Feb 24
kshitijthakkar/lfm25-mobile-actions-cactus
Updated
Feb 23
kshitijthakkar/loggenix-moe-0.4b-cactus
Updated
Feb 23
kshitijthakkar/poc-pipeline-e2e-test
Text Generation
•
39.4M
•
Updated
Feb 23
•
7
kshitijthakkar/lfm-finetuned
Text Generation
•
Updated
Feb 20
kshitijthakkar/LFM2.5-1.2B-Instruct-mobile-actions
Text Generation
•
Updated
Feb 20
•
3
kshitijthakkar/moe-1083m-781m-16x8-8L-large-moe-1.3b-bs4-ctx1024
Updated
Feb 7
kshitijthakkar/moe-1083m-781m-16x8-8L-large-moe-1.3b-bs2-ctx2048
Updated
Feb 7
kshitijthakkar/moe-1083m-781m-16x8-8L-large-moe-1.3b-bs2-ctx1024
Updated
Feb 7
kshitijthakkar/moe-1083m-781m-16x8-8L-large-moe-1.3b-bs1-ctx2048
Updated
Feb 7
Previous
1
2
3
...
5
Next