Manuel Romero's picture

In a Training Loop 🔄

Manuel Romero PRO

mrm8488

·

https://mrm8488.github.io

AI & ML interests

#AI Research and Democratization. NLP/NLG 🤗

Recent Activity

liked a model about 12 hours ago

bartowski/mrm8488_Qwen3-14B-ft-limo-GGUF

liked a model 9 days ago

nvidia/atlas-era5

upvoted a collection about 1 month ago

🧮functiongemma ft mobile-actions

View all activity

Organizations

upvoted 2 collections about 1 month ago

🧮functiongemma ft mobile-actions

A collection of functiongemma-270m-it models fine-tuned on mobile actions dataset for Spanish, French and Italian • 3 items • Updated 30 days ago • 3

JustRL

2 items • Updated Nov 1, 2025 • 5

upvoted 2 articles about 1 month ago

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30, 2025

•

42

Article

Encoding the World's Medical Knowledge into 970K

Dec 22, 2025

•

15

upvoted a paper about 2 months ago

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 236

upvoted an article 2 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

587

upvoted a collection 2 months ago

Nemotron RAG

Set of tools to build retrieval-augmented generation (RAG) systems, improve search and ranking accuracy, and extract structured data from complex docs • 11 items • Updated 5 days ago • 67

upvoted a paper 2 months ago

Latent Collaboration in Multi-Agent Systems

Paper • 2511.20639 • Published Nov 25, 2025 • 121

upvoted a collection 2 months ago

Olmo 3 Post-training

All artifacts for post-training Olmo 3. Datasets follow the model that resulted from training on them. • 32 items • Updated Dec 23, 2025 • 48

upvoted a paper 3 months ago

Black-Box On-Policy Distillation of Large Language Models

Paper • 2511.10643 • Published Nov 13, 2025 • 51

upvoted an article 3 months ago

Article

How to train a new language model from scratch using Transformers and Tokenizers

Feb 14, 2020

•

58

upvoted a paper 3 months ago

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published Oct 6, 2025 • 129

upvoted a collection 3 months ago

Luth Datasets

6 items • Updated Sep 24, 2025 • 5

upvoted an article 3 months ago

Article

Luth: Efficient French Specialization for Small Language Models

Aug 11, 2025

•

18

upvoted a collection 3 months ago

Luth x Qwen3

4 items • Updated Sep 24, 2025 • 7

upvoted an article 4 months ago

Article

Supercharge your OCR Pipelines with Open Models

+5

Oct 21, 2025

•

301

upvoted a paper 4 months ago

Fantastic (small) Retrievers and How to Train Them: mxbai-edge-colbert-v0 Tech Report

Paper • 2510.14880 • Published Oct 16, 2025 • 19

upvoted an article 4 months ago

Article

Model statistics of the 50 most downloaded entities on Hugging Face

Oct 13, 2025

•

37

upvoted a collection 4 months ago

smol2operator Release

4 items • Updated Sep 23, 2025 • 24

upvoted an article 4 months ago

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

+3

Sep 23, 2025

•

135