24 284 94

Eni Grand

Enigrand

AI & ML interests

None yet

Recent Activity

upvoted a paper about 14 hours ago

Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

upvoted a paper about 18 hours ago

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

liked a model 3 days ago

aoi-ot/VibeVoice-Large

View all activity

Organizations

upvoted a paper about 14 hours ago

Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

Paper • 2509.03867 • Published 3 days ago • 165

upvoted a paper about 18 hours ago

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Paper • 2509.04292 • Published 3 days ago • 45

liked a model 3 days ago

aoi-ot/VibeVoice-Large

Text-to-Speech • 9B • Updated 3 days ago • 5.5k • 83

upvoted a paper 5 days ago

Model-Task Alignment Drives Distinct RL Outcomes

Paper • 2508.21188 • Published 9 days ago • 8

liked a model 10 days ago

Qwen/Qwen3-235B-A22B-Instruct-2507

Text Generation • 235B • Updated 21 days ago • 97.5k • • 668

upvoted a paper 11 days ago

Hermes 4 Technical Report

Paper • 2508.18255 • Published 13 days ago • 34

liked a model 12 days ago

Qwen/Qwen3-235B-A22B-Instruct-2507-FP8

Text Generation • 235B • Updated Jul 30 • 29.4k • 119

upvoted a paper 12 days ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published 13 days ago • 179

upvoted a collection 12 days ago

InternVL3.5

Collection

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 54 items • Updated 9 days ago • 85

liked a model 17 days ago

ByteDance-Seed/Seed-OSS-36B-Instruct

Text Generation • 36B • Updated 12 days ago • 18.1k • 403

liked 2 models 18 days ago

huizimao/gpt-oss-20b-uncensored-mxfp4

21B • Updated 29 days ago • 493 • 10

Qwen/Qwen3-4B-Instruct-2507-FP8

Text Generation • 4B • Updated Aug 6 • 29.2k • 27

liked a model 19 days ago

Qwen/Qwen-Image-Edit

Image-to-Image • Updated 13 days ago • 120k • • 1.68k

upvoted a paper 23 days ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published about 1 month ago • 170

upvoted a paper 24 days ago

Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL

Paper • 2508.07976 • Published 27 days ago • 48

liked a model 27 days ago

baichuan-inc/Baichuan-M2-32B

Text Generation • 33B • Updated 4 days ago • 141k • • 90

liked a model 28 days ago

GSAI-ML/LLaDA-1.5

Text Generation • 8B • Updated Jun 4 • 18.9k • 27

New activity in Qwen/Qwen3-32B about 1 month ago

Will Qwen3-32B be updated just like Qwen3-235B-A22B?

#40 opened about 1 month ago by

Enigrand

New activity in openbmb/MiniCPM-V-4 about 1 month ago

License?

#1 opened about 1 month ago by

merve

New activity in kernels-community/vllm-flash-attn3 about 1 month ago

Support for sm120?

#2 opened about 1 month ago by

Enigrand

Eni Grand

AI & ML interests

Recent Activity

Organizations

Enigrand's activity

Will Qwen3-32B be updated just like Qwen3-235B-A22B?

License?

Support for sm120?