Iñigo López-Riobóo Botana's picture

Iñigo López-Riobóo Botana

ibotana

·

https://www.linkedin.com/in/%C3%AD%C3%B1igo-luis-l%C3%B3pez-riob%C3%B3o-botana-4a43001a2/

AI & ML interests

Senior NLP Engineer at Newtral

Recent Activity

liked a model about 8 hours ago

open-thoughts/OpenThinker3-7B

View all activity

Organizations

ibotana's activity

upvoted a paper 7 days ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 146

upvoted 2 articles 8 days ago

Article

Introducing smolagents: simple agents that write actions in code.

By

and 2 others •

Dec 31, 2024

• 1.06k

Article

Welcome to Inference Providers on the Hub 🔥

By

and 6 others •

Jan 28

• 483

upvoted a collection 16 days ago

DeepHermes

Preview models of hybrid reasoner Hermes series • 6 items • Updated Mar 13 • 39

upvoted an article 3 months ago

Article

Accelerate Large Model Training using DeepSpeed

By

and 1 other •

Jun 28, 2022

• 6

upvoted 3 collections 3 months ago

Command Models

Latest Cohere Labs Command models • 6 items • Updated Apr 15 • 24

Cohere Labs Aya Expanse

Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 4 items • Updated Apr 15 • 40

Cohere Labs Aya 23

Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. • 3 items • Updated Apr 15 • 55

upvoted a collection 4 months ago

DeepSeek-R1

10 items • Updated 10 days ago • 707

upvoted 2 articles 4 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By

•

Feb 7

• 148

Article

From PyTorch DDP to 🤗 Accelerate to 🤗 Trainer, mastery of distributed training with ease

By

•

Oct 21, 2022

• 31

upvoted a collection 4 months ago

Hermes 3

The Hermes 3 Series of Models • 12 items • Updated Feb 13 • 122

upvoted an article 5 months ago

Article

TTS Arena: Benchmarking Text-to-Speech Models in the Wild

By

and 6 others •

Feb 27, 2024

• 67

upvoted 5 collections 5 months ago

NVILA

10 items • Updated 19 days ago • 14

InternVL2.5

Better than InternVL 2.0 • 19 items • Updated Apr 20 • 90

VideoChat-Flash

Faster and more powerful VideoChat. • 15 items • Updated Apr 20 • 11

VideoChat

Chat-Centric Video Understanding • 8 items • Updated Apr 20 • 3

InternVideo2

InternVideo2 • 20 items • Updated Apr 20 • 20

upvoted a paper 5 months ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 146

upvoted a collection 5 months ago

VILA: On Pre-training for Visual Language Models

10 items • Updated Apr 17 • 53