Ricardo Corso Fernandes Jr

g4ry

G-4-R-Y

AI & ML interests

NLP, NLU, Textless NLP, Multimodal Modelling and Speech Processing.

Recent Activity

upvoted an article 19 days ago

SmolVLM2: Bringing Video Understanding to Every Device

upvoted an article 19 days ago

VideoMamba: State Space Model for Efficient Video Understanding

upvoted an article 21 days ago

Diffusion Language Models: The New Paradigm

View all activity

Organizations

None yet

upvoted 2 articles 19 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

and 6 others •

Feb 20

• 296

Article

VideoMamba: State Space Model for Efficient Video Understanding

•

Mar 16, 2024

• 1

upvoted an article 21 days ago

Article

Diffusion Language Models: The New Paradigm

•

Jun 10

• 12

upvoted an article 2 months ago

Article

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 164

upvoted a paper 3 months ago

MMaDA: Multimodal Large Diffusion Language Models

Paper • 2505.15809 • Published May 21 • 95

upvoted 2 articles 3 months ago

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

May 12

• 514

Article

Probabilistic Fractal Activation Function (P-FAF) and Its Advantages Over Traditional Word Vectorization

•

Feb 8, 2024

• 14

upvoted a paper 3 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 184

upvoted 6 articles 4 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 117

Article

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

and 5 others •

Mar 9, 2023

• 62

Article

Putting RL back in RLHF

and 1 other •

Jun 12, 2024

• 100

Article

Red-Teaming Large Language Models

and 2 others •

Feb 24, 2023

• 29

Article

The N Implementation Details of RLHF with PPO

and 2 others •

Oct 24, 2023

• 67

Article

Fine-tune Llama 2 with DPO

and 2 others •

Aug 8, 2023

• 60

upvoted 2 articles about 1 year ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 354

Article

Deploying 🤗 Hub models in Vertex AI

•

Feb 27, 2024

• 16

Ricardo Corso Fernandes Jr

AI & ML interests

Recent Activity

Organizations

g4ry's activity

SmolVLM2: Bringing Video Understanding to Every Device

VideoMamba: State Space Model for Efficient Video Understanding

Diffusion Language Models: The New Paradigm

Introduction to State Space Models (SSM)

Vision Language Models (Better, Faster, Stronger)

Probabilistic Fractal Activation Function (P-FAF) and Its Advantages Over Traditional Word Vectorization

KV Caching Explained: Optimizing Transformer Inference Efficiency

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Putting RL back in RLHF

Red-Teaming Large Language Models

The N Implementation Details of RLHF with PPO

Fine-tune Llama 2 with DPO

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Deploying 🤗 Hub models in Vertex AI