7 33 82

neuralink

https://phucnguyen.dev

AI & ML interests

nanotron @ hf

Recent Activity

liked a model 24 days ago

baidu/ERNIE-4.5-0.3B-PT

upvoted an article about 1 month ago

Arc Virtual Cell Challenge: A Primer

upvoted an article 4 months ago

The Transformers Library: standardizing model definitions

View all activity

Organizations

upvoted an article about 1 month ago

Article

Arc Virtual Cell Challenge: A Primer

and 1 other •

Jul 18

• 54

upvoted an article 4 months ago

Article

The Transformers Library: standardizing model definitions

and 3 others •

May 15

• 117

upvoted 2 articles 5 months ago

Article

You could have designed state of the art positional encoding

•

Nov 25, 2024

• 357

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

and 6 others •

Apr 5

• 146

upvoted a paper 5 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 199

upvoted 2 articles 6 months ago

Article

Open R1: Update #3

and 9 others •

Mar 11

• 295

Article

LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone!

and 1 other •

Mar 7

• 78

upvoted 3 articles 7 months ago

Article

Open-source DeepResearch – Freeing our search agents

and 4 others •

Feb 4

• 1.3k

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 878

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

upvoted 2 papers 8 months ago

Domino: Eliminating Communication in LLM Training via Generic Tensor Slicing and Overlapping

Paper • 2409.15241 • Published Sep 23, 2024 • 1

Scaling Laws for Floating Point Quantization Training

Paper • 2501.02423 • Published Jan 5 • 27

upvoted a paper 10 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 257

upvoted a paper about 1 year ago

Small-scale proxies for large-scale Transformer training instabilities

Paper • 2309.14322 • Published Sep 25, 2023 • 21

upvoted an article about 1 year ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

and 7 others •

Jul 11, 2024

• 122

upvoted a paper about 1 year ago

Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

Paper • 2201.02177 • Published Jan 6, 2022 • 2

upvoted an article about 1 year ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

and 2 others •

Aug 14, 2024

• 69

upvoted 3 papers about 1 year ago

neuralink

AI & ML interests

Recent Activity

Organizations

neuralink's activity

Arc Virtual Cell Challenge: A Primer

The Transformers Library: standardizing model definitions

You could have designed state of the art positional encoding

Welcome Llama 4 Maverick & Scout on Hugging Face!

Open R1: Update #3

LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone!

Open-source DeepResearch – Freeing our search agents

Open-R1: a fully open reproduction of DeepSeek-R1

Open-R1: Update #1

How NuminaMath Won the 1st AIMO Progress Prize

A failed experiment: Infini-Attention, and why we should keep trying?