3 64 253

Kristoffer Rolf Deinoff

gatepoet

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 hours ago

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

upvoted a paper 7 days ago

High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning

liked a model 12 days ago

agentica-org/DeepSWE-Preview

View all activity

Organizations

None yet

upvoted a paper about 2 hours ago

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

Paper • 2507.07104 • Published 7 days ago • 32

upvoted a paper 7 days ago

High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning

Paper • 2507.05920 • Published 8 days ago • 11

liked 4 models 12 days ago

liked a model 13 days ago

Cactus-Compute/Qwen3-embedding-600m-GGUF

0.6B • Updated 29 days ago • 56 • 1

liked a model 14 days ago

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B

Text Generation • 8B • Updated May 29 • 347k • • 871

upvoted a paper 28 days ago

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published 30 days ago • 39

liked 2 models 29 days ago

wavespeed/Wan2.1-VACE-14B-bf16

Image-to-Video • Updated about 1 month ago • 326 • 1

mistralai/Magistral-Small-2506

Text Generation • 24B • Updated 6 days ago • 71k • • 573

upvoted a paper about 1 month ago

ComfyUI-R1: Exploring Reasoning Models for Workflow Generation

Paper • 2506.09790 • Published Jun 11 • 52

liked a model about 1 month ago

mistralai/Mistral-Small-3.1-24B-Base-2503

24B • Updated May 23 • 9.72k • 242

upvoted a paper 2 months ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12 • 81

liked a Space 2 months ago

471

LLM Model VRAM Calculator

📈

Calculate VRAM requirements for running large language models

liked 2 models 2 months ago

Qwen/Qwen3-0.6B

Text Generation • 0.8B • Updated May 21 • 3.22M • • 452

nari-labs/Dia-1.6B

Text-to-Speech • 2B • Updated Jun 1 • 114k • • 2.63k

upvoted a paper 2 months ago

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Paper • 2505.02625 • Published May 5 • 22

liked 2 models 2 months ago

mradermacher/WebThinker-QwQ-32B-i1-GGUF

33B • Updated May 2 • 318 • 2

allura-org/TQ2.5-14B-Aletheia-v1

Text Generation • 15B • Updated Dec 20, 2024 • 31 • • 7

Kristoffer Rolf Deinoff

AI & ML interests

Recent Activity

Organizations

gatepoet's activity

LLM Model VRAM Calculator