37 33 10

Mohamed Mekkouri

medmekk

AI & ML interests

None yet

Recent Activity

updated a model about 1 month ago

medmekk/Qwen3.5-35B-A3B-MXFP4

published a model about 1 month ago

medmekk/Qwen3.5-35B-A3B-MXFP4

new activity 3 months ago

kernels-community/rmsnorm:Add Metal (Apple Silicon) build variants

View all activity

Organizations

upvoted an article 5 months ago

Article

Shadow AI - Where are the CIOs?

jeffboudier

•

Dec 19, 2025

• 31

upvoted 2 articles 6 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

lysandre, ArthurZ, cyrilvallez, reach-vb

•

Dec 1, 2025

• 311

Article

Easily Build and Share ROCm Kernels with Hugging Face

badaoui, daniehua, ColorsWind, ftyghome

•

Nov 17, 2025

• 38

upvoted an article 9 months ago

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

drbh, danieldk

•

Aug 18, 2025

• 100

upvoted 2 articles about 1 year ago

Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

wenhuach, Haihao, weiweiz1, n1ck-guo, isaacmac, kding1, IlyasMoutawwakil, marcsun13, medmekk

•

Apr 29, 2025

• 44

Article

A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons

NormalUhr

•

Feb 4, 2025

• 35

upvoted 14 papers about 1 year ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18, 2025 • 146

V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning

Paper • 2503.11495 • Published Mar 14, 2025 • 14

Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills

Paper • 2503.12533 • Published Mar 16, 2025 • 68

CoSTAast: Cost-Sensitive Toolpath Agent for Multi-turn Image Editing

Paper • 2503.10613 • Published Mar 13, 2025 • 79

Reangle-A-Video: 4D Video Generation as Video-to-Video Translation

Paper • 2503.09151 • Published Mar 12, 2025 • 32

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12, 2025 • 77

TPDiff: Temporal Pyramid Video Diffusion Model

Paper • 2503.09566 • Published Mar 12, 2025 • 45

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published Mar 10, 2025 • 88

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11, 2025 • 73

FedRand: Enhancing Privacy in Federated Learning with Randomized LoRA Subparameter Updates

Paper • 2503.07216 • Published Mar 10, 2025 • 31

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

Paper • 2503.07605 • Published Mar 10, 2025 • 66

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published Mar 10, 2025 • 61

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5, 2025 • 233

EuroBERT: Scaling Multilingual Encoders for European Languages

Paper • 2503.05500 • Published Mar 7, 2025 • 81

Mohamed Mekkouri

AI & ML interests

Recent Activity

Organizations

medmekk's activity

Shadow AI - Where are the CIOs?

Transformers v5: Simple model definitions powering the AI ecosystem

Easily Build and Share ROCm Kernels with Hugging Face

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons