Ricardo Corso Fernandes Jr

g4ry

G-4-R-Y

AI & ML interests

NLP, NLU, Textless NLP, Multimodal Modelling and Speech Processing.

Recent Activity

upvoted an article 19 days ago

SmolVLM2: Bringing Video Understanding to Every Device

upvoted an article 19 days ago

VideoMamba: State Space Model for Efficient Video Understanding

upvoted an article 21 days ago

Diffusion Language Models: The New Paradigm

View all activity

Organizations

None yet

upvoted 2 articles 19 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

and 6 others •

Feb 20

• 296

Article

VideoMamba: State Space Model for Efficient Video Understanding

•

Mar 16, 2024

• 1

upvoted an article 21 days ago

Article

Diffusion Language Models: The New Paradigm

•

Jun 10

• 12

upvoted an article 2 months ago

Article

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 164

upvoted a paper 3 months ago

MMaDA: Multimodal Large Diffusion Language Models

Paper • 2505.15809 • Published May 21 • 95

upvoted 2 articles 3 months ago

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

May 12

• 514

Article

Probabilistic Fractal Activation Function (P-FAF) and Its Advantages Over Traditional Word Vectorization

•

Feb 8, 2024

• 14

upvoted a paper 3 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 184

liked a model 3 months ago

PrimeIntellect/INTELLECT-2

33B • Updated May 13 • 1.03k • 201

upvoted 6 articles 4 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 117

Article

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

and 5 others •

Mar 9, 2023

• 62

Article

Putting RL back in RLHF

and 1 other •

Jun 12, 2024

• 100

Article

Red-Teaming Large Language Models

and 2 others •

Feb 24, 2023

• 29

Article

The N Implementation Details of RLHF with PPO

and 2 others •

Oct 24, 2023

• 67

Article

Fine-tune Llama 2 with DPO

and 2 others •

Aug 8, 2023

• 60

liked a model 5 months ago

ds4sd/SmolDocling-256M-preview

Image-Text-to-Text • 0.3B • Updated about 12 hours ago • 30.5k • 1.55k

liked 3 models 6 months ago

liked a model 12 months ago

facebook/mms-lid-1024

Audio Classification • 1.0B • Updated Jun 13, 2023 • 2.26k • 9

Ricardo Corso Fernandes Jr

AI & ML interests

Recent Activity

Organizations

g4ry's activity

SmolVLM2: Bringing Video Understanding to Every Device

VideoMamba: State Space Model for Efficient Video Understanding

Diffusion Language Models: The New Paradigm

Introduction to State Space Models (SSM)

Vision Language Models (Better, Faster, Stronger)

Probabilistic Fractal Activation Function (P-FAF) and Its Advantages Over Traditional Word Vectorization

KV Caching Explained: Optimizing Transformer Inference Efficiency

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Putting RL back in RLHF

Red-Teaming Large Language Models

The N Implementation Details of RLHF with PPO

Fine-tune Llama 2 with DPO