Celso F's picture

Celso F

celsowm

·

AI & ML interests

None yet

Recent Activity

new activity 3 days ago

swiss-ai/Apertus-70B-2509:Any place to test it online?

liked a dataset 17 days ago

nvidia/Nemotron-Post-Training-Dataset-v2

new activity 17 days ago

nvidia/Nemotron-Post-Training-Dataset-v2:Please add portuguese in the next version

View all activity

Organizations

None yet

upvoted an article 26 days ago

Article

Transformers backend integration in SGLang

By

and 4 others •

Jun 23

• 53

upvoted a collection 3 months ago

StarVector SVG Datasets (🏆SVG-Bench)

Datasets for training and evaluating SVG generation models • 11 items • Updated Jan 12 • 21

upvoted a paper 3 months ago

GraLoRA: Granular Low-Rank Adaptation for Parameter-Efficient Fine-Tuning

Paper • 2505.20355 • Published May 26 • 36

upvoted a paper 4 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 286

upvoted a collection 4 months ago

Qwen3

84 items • Updated Aug 6 • 1.19k

upvoted 2 collections 5 months ago

Llama 4

Llama 4 release • 13 items • Updated Apr 29 • 617

Portuguese LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the PT-LLM leaderboard: • 19 items • Updated 42 minutes ago • 37

upvoted 2 papers 6 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 139

Long Context Tuning for Video Generation

Paper • 2503.10589 • Published Mar 13 • 14

upvoted a paper 10 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 68

upvoted an article about 1 year ago

Article

Welcome FalconMamba: The first strong attention-free 7B model

By

and 5 others •

Aug 12, 2024

• 113

upvoted an article over 1 year ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 239

upvoted a collection over 1 year ago

Gemma release

Groups the Gemma models released by the Google team. • 40 items • Updated Jul 10 • 343

upvoted a paper about 2 years ago

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Paper • 2307.02486 • Published Jul 5, 2023 • 81