1 16 56

MC

Dreamer312

Dreamer

AI & ML interests

NLP, CV, LLM, AGENT, RL

Recent Activity

commented on a paper 1 day ago

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

commented on a paper 1 day ago

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

upvoted a paper 2 days ago

Scaling Law for Quantization-Aware Training

View all activity

Organizations

None yet

Dreamer312's activity

upvoted a paper 2 days ago

Scaling Law for Quantization-Aware Training

Paper • 2505.14302 • Published 4 days ago • 61

upvoted a collection 2 days ago

Llama 4

Collection

Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! • 15 items • Updated 23 days ago • 46

upvoted a paper 4 days ago

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization

Paper • 2505.12346 • Published 6 days ago • 18

upvoted a paper 6 days ago

Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation

Paper • 2409.10262 • Published Sep 16, 2024 • 1

upvoted an article 10 days ago

Article

Mixture of Experts Explained

and 5 others •

Dec 11, 2023

• 636

upvoted a collection 10 days ago

Qwen3

Collection

40 items • Updated 3 days ago • 693

upvoted an article 30 days ago

Article

Proximal Policy Optimization (PPO)

•

Aug 5, 2022

• 40

upvoted an article about 1 month ago

Article

Merge Large Language Models with mergekit

•

Jan 9, 2024

• 119

upvoted an article 2 months ago

Article

Trace & Evaluate your Agent with Arize Phoenix

and 2 others •

Feb 28

• 40

upvoted an article 3 months ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

•

Jan 31

• 50

upvoted a paper 7 months ago

Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models

Paper • 2404.13013 • Published Apr 19, 2024 • 32

upvoted 4 articles 9 months ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

and 2 others •

Aug 14, 2024

• 62

Article

TGI Multi-LoRA: Deploy Once, Serve 30 Models

and 2 others •

Jul 18, 2024

• 58

Article

Preference Optimization for Vision Language Models

and 3 others •

Jul 10, 2024

• 74

Article

Docmatix - a huge dataset for Document Visual Question Answering

and 1 other •

Jul 18, 2024

• 73

upvoted a paper over 1 year ago

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 243