22 23 50

Noob

noobmldude

AI & ML interests

Explainable AI

Recent Activity

liked a model about 1 hour ago

apple/DiffuCoder-7B-cpGRPO

upvoted an article about 17 hours ago

Welcome Gemma 2 - Google's new open LLM

upvoted a paper 11 days ago

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

View all activity

Organizations

upvoted an article about 17 hours ago

Article

Welcome Gemma 2 - Google's new open LLM

and 5 others •

Jun 27, 2024

• 130

upvoted a paper 11 days ago

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published 19 days ago • 118

upvoted an article 14 days ago

Article

RegMix: Data Mixture as Regression for Language Model Pre-training

•

Jul 11, 2024

• 15

upvoted a paper 15 days ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 14

upvoted an article 15 days ago

Article

Selective fine-tuning of Language Models with Spectrum

•

Sep 3, 2024

• 36

upvoted a paper 18 days ago

Magistral

Paper • 2506.10910 • Published 26 days ago • 61

upvoted a paper 19 days ago

Can LLMs Generate High-Quality Test Cases for Algorithm Problems? TestCase-Eval: A Systematic Evaluation of Fault Coverage and Exposure

Paper • 2506.12278 • Published 25 days ago • 17

upvoted an article 3 months ago

Article

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

and 8 others •

Apr 29, 2024

• 78

upvoted 2 papers 3 months ago

LocAgent: Graph-Guided LLM Agents for Code Localization

Paper • 2503.09089 • Published Mar 12 • 13

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20 • 63

upvoted 3 papers 4 months ago

2BP: 2-Stage Backpropagation

Paper • 2405.18047 • Published May 28, 2024 • 27

UFT: Unifying Fine-Tuning of SFT and RLHF/DPO/UNA through a Generalized Implicit Reward Function

Paper • 2410.21438 • Published Oct 28, 2024 • 2

Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond

Paper • 2503.10460 • Published Mar 13 • 29

upvoted a paper 5 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 235

upvoted a collection 9 months ago

Code Evaluation

Collection

Collection of Papers on Code Evaluation (from code generation language models) • 45 items • Updated Oct 29, 2024 • 15

upvoted an article 9 months ago

Article

FineVideo: behind the scenes

and 5 others •

Sep 23, 2024

• 33

upvoted a collection 10 months ago

Nemotron 4 340B

Collection

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated about 20 hours ago • 163

upvoted an article 11 months ago

Article

XetHub is joining Hugging Face!

and 1 other •

Aug 8, 2024

• 101

upvoted an article 12 months ago

Article

🪆 Introduction to Matryoshka Embedding Models

and 2 others •

Feb 23, 2024

• 144

upvoted a paper about 1 year ago

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Paper • 2406.15877 • Published Jun 22, 2024 • 48

Noob

AI & ML interests

Recent Activity

Organizations

noobmldude's activity

Welcome Gemma 2 - Google's new open LLM

RegMix: Data Mixture as Regression for Language Model Pre-training

Selective fine-tuning of Language Models with Spectrum

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

FineVideo: behind the scenes

XetHub is joining Hugging Face!

🪆 Introduction to Matryoshka Embedding Models