24 284 94

Eni Grand

Enigrand

AI & ML interests

None yet

Recent Activity

upvoted a paper about 9 hours ago

Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

upvoted a paper about 13 hours ago

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

liked a model 2 days ago

aoi-ot/VibeVoice-Large

View all activity

Organizations

upvoted a paper about 9 hours ago

Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

Paper • 2509.03867 • Published 3 days ago • 161

upvoted a paper about 13 hours ago

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Paper • 2509.04292 • Published 2 days ago • 45

upvoted a paper 5 days ago

Model-Task Alignment Drives Distinct RL Outcomes

Paper • 2508.21188 • Published 9 days ago • 8

upvoted a paper 10 days ago

Hermes 4 Technical Report

Paper • 2508.18255 • Published 12 days ago • 34

upvoted a paper 11 days ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published 12 days ago • 179

upvoted a collection 11 days ago

InternVL3.5

Collection

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 54 items • Updated 9 days ago • 85

upvoted a paper 23 days ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published about 1 month ago • 170

upvoted a paper 24 days ago

Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL

Paper • 2508.07976 • Published 27 days ago • 48

upvoted 6 papers about 1 month ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 294

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 234

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19 • 128

MetaCLIP 2: A Worldwide Scaling Recipe

Paper • 2507.22062 • Published Jul 29 • 25

Geometric-Mean Policy Optimization

Paper • 2507.20673 • Published Jul 28 • 31

Small Batch Size Training for Language Models: When Vanilla SGD Works, and Why Gradient Accumulation Is Wasteful

Paper • 2507.07101 • Published Jul 9 • 3

upvoted 3 papers about 2 months ago

Voxtral

Paper • 2507.13264 • Published Jul 17 • 25

LayerCake: Token-Aware Contrastive Decoding within Large Language Model Layers

Paper • 2507.04404 • Published Jul 6 • 21

BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity

Paper • 2507.08771 • Published Jul 11 • 9

upvoted a collection about 2 months ago

MetaStone-S1

Collection

The open-source model of MetaStone-S1. • 4 items • Updated Jul 30 • 10

upvoted a paper about 2 months ago

Test-Time Scaling with Reflective Generative Model

Paper • 2507.01951 • Published Jul 2 • 106

upvoted a collection about 2 months ago

🧠 SmolLM3

Collection

Smol, multilingual, long-context reasoner • 12 items • Updated Aug 5 • 72

Eni Grand

AI & ML interests

Recent Activity

Organizations

Enigrand's activity