Juan CM's picture

Juan CM

jucamohedano

·

AI & ML interests

AI Systems MSc at Trento 🚀🤖

Recent Activity

liked a Space about 14 hours ago

nanotron/ultrascale-playbook

upvoted a collection 17 days ago

updated a collection about 1 month ago

Model search via model weights

View all activity

Organizations

upvoted a collection 17 days ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 243

upvoted 2 papers about 1 month ago

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

Paper • 2502.09619 • Published Feb 13 • 36

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Paper • 2505.22453 • Published May 28 • 46

upvoted a collection 5 months ago

🤖 Agents

21 items • Updated Dec 31, 2024 • 161

upvoted an article 5 months ago

Article

Introducing smolagents: simple agents that write actions in code.

By

and 2 others •

Dec 31, 2024

• 1.08k

upvoted a paper 5 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 235

upvoted 2 articles 5 months ago

Article

Open-source DeepResearch – Freeing our search agents

By

and 4 others •

Feb 4

• 1.27k

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

By

and 2 others •

Jan 23

• 182

upvoted 2 articles about 1 year ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

By

and 2 others •

May 14, 2024

• 259

Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

By

•

Jun 23, 2024

• 34

upvoted a collection about 1 year ago

[lecture artifacts] aligning open language models

artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin • 63 items • Updated Apr 17, 2024 • 56

upvoted 3 articles about 1 year ago

Article

Fine-tuning a large language model on Kaggle Notebooks (or even on your own computer) for solving real-world tasks

By

•

Feb 21, 2024

• 16

Article

Design choices for Vision Language Models in 2024

By

•

Apr 16, 2024

• 29

Article

Vision Language Models Explained

By

and 1 other •

Apr 11, 2024

• 408

upvoted 5 articles over 1 year ago

Article

Mixture of Experts Explained

By

and 5 others •

Dec 11, 2023

• 725

Article

Fine-tune Llama 2 with DPO

By

and 2 others •

Aug 8, 2023

• 55

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

By

and 4 others •

Jan 18, 2024

• 66

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

By

and 4 others •

May 24, 2023

• 156

Article

Faster fine-tuning using TRL & Unsloth

By

•

Jan 10, 2024

• 62

upvoted a collection over 1 year ago

VILA: On Pre-training for Visual Language Models

10 items • Updated Apr 17 • 54