Elie Bakouch's picture

Elie Bakouch PRO

eliebak

HuggingFaceTB

·

AI & ML interests

Training LLM's @ 🤗

Recent Activity

upvoted a paper 2 days ago

mHC: Manifold-Constrained Hyper-Connections

updated a collection 4 days ago

Open Korean LLM (MSIT 2025)

upvoted a collection 4 days ago

Open Korean LLM (MSIT 2025)

View all activity

Organizations

upvoted a paper 2 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 4 days ago • 169

upvoted 3 collections 4 days ago

Open Korean LLM (MSIT 2025)

6 items • Updated 1 day ago • 12

A.X K

3 items • Updated 5 days ago • 6

📝 Research & Long-Form Blog Posts

In-depth technical articles and research pieces published by Hugging Face • 8 items • Updated 4 days ago • 14

upvoted a collection 6 days ago

HyperCLOVA X SEED

HyperCLOVA X SEED is NAVER's lightweight open-source lineup with a strong focus on Korean language performance • 6 items • Updated 11 days ago • 37

upvoted an article 15 days ago

Article

Supercharge your OCR Pipelines with Open Models

+5

Oct 21, 2025

•

291

upvoted 2 collections 20 days ago

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models • 6 items • Updated 4 days ago • 110

Nemotron-Pre-Training-Datasets

Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 12 days ago • 87

upvoted a changelog 27 days ago

Changelog

Team & Enterprise Articles Now Featured on the Hugging Face Blog

27 days ago

• 75

upvoted 2 papers about 1 month ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 95

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 282

upvoted an article about 1 month ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

Dec 1, 2025

•

264

upvoted a collection about 1 month ago

INTELLECT-3

INTELLECT-3: A 100B+ MoE trained with large-scale RL • 4 items • Updated Nov 28, 2025 • 11

upvoted an article about 1 month ago

Article

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

Nov 19, 2025

•

33

upvoted a collection about 2 months ago

NeMo Gym

Collection of RL verifiable data for NeMo Gym • 13 items • Updated 12 days ago • 32

upvoted a paper about 2 months ago

Motif 2 12.7B technical report

Paper • 2511.07464 • Published Nov 7, 2025 • 39

upvoted 3 collections 2 months ago

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7, 2025 • 396

gpt-oss-safeguard

gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated Oct 29, 2025 • 58

Reproducing-TRM

3 items • Updated Oct 22, 2025 • 5

upvoted an article 2 months ago

Article

Building the Open Agent Ecosystem Together: Introducing OpenEnv

+8

Oct 23, 2025

•

139