Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
18
52
151
Krishna Kaasyap
KrishnaKaasyap
Follow
ltim's profile picture
victor's profile picture
Juanelopo's profile picture
4 followers
·
27 following
krishnakaasyap
krishnakaasyap.bsky.social
AI & ML interests
Test Time Training Multimodal & Inter-Modality Transfer Learning Mechanistic Interpretability Evolutionary Model Merging Swarm Intelligence of multiple models with different architectures and different algorithms MuZero approach to general tasks
Recent Activity
liked
a model
1 day ago
nari-labs/Dia-1.6B
liked
a model
3 days ago
baichuan-inc/Baichuan-M1-14B-Instruct
liked
a model
9 days ago
microsoft/bitnet-b1.58-2B-4T
View all activity
Organizations
KrishnaKaasyap
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
1 day ago
nari-labs/Dia-1.6B
Text-to-Speech
•
Updated
1 day ago
•
36.6k
•
•
886
liked
a model
3 days ago
baichuan-inc/Baichuan-M1-14B-Instruct
Updated
Feb 20
•
18.2k
•
56
liked
a model
9 days ago
microsoft/bitnet-b1.58-2B-4T
Text Generation
•
Updated
1 day ago
•
25.8k
•
743
upvoted
a
collection
19 days ago
Llama 4
Collection
Llama 4 release
•
10 items
•
Updated
19 days ago
•
447
liked
a model
29 days ago
Qwen/Qwen2.5-Omni-7B
Any-to-Any
•
Updated
9 days ago
•
218k
•
1.47k
liked
2 models
about 1 month ago
CohereLabs/c4ai-command-a-03-2025
Text Generation
•
Updated
9 days ago
•
11.1k
•
•
346
Qwen/QwQ-32B
Text Generation
•
Updated
Mar 11
•
658k
•
•
2.71k
New activity in
RekaAI/reka-flash-3
about 1 month ago
Context length and reasoning length?
1
#6 opened about 1 month ago by
KrishnaKaasyap
liked
a model
about 1 month ago
RekaAI/reka-flash-3
Updated
Mar 13
•
2.75k
•
365
liked
3 models
3 months ago
deepseek-ai/Janus-Pro-1B
Any-to-Any
•
Updated
Feb 1
•
35.2k
•
431
deepseek-ai/Janus-Pro-7B
Any-to-Any
•
Updated
Feb 1
•
203k
•
3.35k
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation
•
Updated
Feb 24
•
256k
•
•
666
New activity in
deepseek-ai/DeepSeek-R1
3 months ago
Is this the same as DeepSeek-R1 (Preview) mentioned on LiveCodeBench?
1
2
#10 opened 3 months ago by
KrishnaKaasyap
New activity in
deepseek-ai/DeepSeek-R1-Zero
3 months ago
Hail CCP!!! God bless Chyna!
19
8
#3 opened 3 months ago by
mnemojeet
Thank you deepseek
29
2
#8 opened 3 months ago by
teknium
liked
3 models
3 months ago
deepseek-ai/DeepSeek-R1
Text Generation
•
Updated
28 days ago
•
1.8M
•
•
12k
deepseek-ai/DeepSeek-R1-Zero
Text Generation
•
Updated
28 days ago
•
5.5k
•
902
MiniMaxAI/MiniMax-Text-01
Text Generation
•
Updated
7 days ago
•
7.77k
•
574
liked
a Space
4 months ago
Running
1.15k
1.15k
InstantCoder
🦀
Generate app code from ideas
liked
a dataset
4 months ago
PowerInfer/QWQ-LONGCOT-500K
Viewer
•
Updated
Dec 26, 2024
•
286k
•
315
•
122
Load more