agentic
updated
NousResearch/Hermes-4-70B
Text Generation
•
71B
•
Updated
•
13.6k
•
•
164
unsloth/Kimi-K2-Instruct-0905-GGUF
1T
•
Updated
•
1.39k
•
52
Text-to-Image
•
Updated
CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning
in Large Language Models
Paper
•
2509.09675
•
Published
•
28
NousResearch/Hermes-4-14B-FP8
Text Generation
•
15B
•
Updated
•
1.59k
•
15
NousResearch/Hermes-4-70B-FP8
Text Generation
•
71B
•
Updated
•
172
•
25
NousResearch/DeepHermes-ToolCalling-Specialist-Atropos
Reinforcement Learning
•
8B
•
Updated
•
64
•
14
NousResearch/DeepHermes-Financial-Fundamentals-Prediction-Specialist-Atropos
Text Generation
•
8B
•
Updated
•
55
•
14
NousResearch/DeepHermes-Egregore-v1-RLAIF-8b-Atropos
Reinforcement Learning
•
8B
•
Updated
•
49
•
3
NousResearch/DeepHermes-Egregore-v2-RLAIF-8b-Atropos
Reinforcement Learning
•
8B
•
Updated
•
50
•
6
NousResearch/DeepHermes-AscensionMaze-RLAIF-8b-Atropos
Reinforcement Learning
•
8B
•
Updated
•
54
•
7
NousResearch/Hermes-4-405B-FP8
Text Generation
•
406B
•
Updated
•
447
•
20
deepseek-ai/DeepSeek-V3.1-Terminus
Text Generation
•
685B
•
Updated
•
21.5k
•
•
358
nvidia/NVIDIA-Nemotron-Nano-9B-v2-FP8
Text Generation
•
9B
•
Updated
•
6.99k
•
7
nvidia/nemocurator-fineweb-nemotron-4-edu-classifier
0.1B
•
Updated
•
2.98k
•
11
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-to-Text
•
236B
•
Updated
•
51.9k
•
360
BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent
Paper
•
2509.15566
•
Published
•
14
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and
Open Resources
Paper
•
2509.21268
•
Published
•
104
Text Generation
•
358B
•
Updated
•
18.7k
•
•
96
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation
•
685B
•
Updated
•
73.1k
•
•
934
NousResearch/Hermes-4-14B
Text Generation
•
425k
•
Updated
•
1.47k
•
107