Guille Pérez-Torró

guishe

AI & ML interests

Information Retrieval, Few-Shot Learning, Named Entity Recognition, Named Entity Disambiguation, Semantic Search, Aspect-based Sentiment Analysis

Recent Activity

updated a collection 18 days ago

Small LLMs

upvoted an article 19 days ago

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

updated a collection about 1 month ago

Instruct LLMs

View all activity

Organizations

None yet

upvoted an article 19 days ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

and 1 other •

Oct 14, 2024

• 94

upvoted a collection about 2 months ago

Qwen3

Collection

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 65 items • Updated 29 days ago • 156

upvoted an article 2 months ago

Article

MIEB: The Benchmark That Stress-Tests Image-Text Embeddings Like Never Before

and 2 others •

Apr 24

• 14

upvoted a collection 2 months ago

Unsloth Dynamic 2.0 Quants

Collection

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 36 items • Updated 1 day ago • 129

upvoted a paper 3 months ago

Atla Selene Mini: A General Purpose Evaluation Model

Paper • 2501.17195 • Published Jan 27 • 36

upvoted an article 3 months ago

Article

Judge Arena: Benchmarking LLMs as Evaluators

and 7 others •

Nov 19, 2024

• 57

upvoted 3 papers 3 months ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 108

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 132

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 114

upvoted an article 3 months ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

and 5 others •

Feb 4

• 92

upvoted a collection 3 months ago

reranking series v2

Collection

V2 crispy rerank series • 3 items • Updated 3 days ago • 23

upvoted a collection 4 months ago

Unsloth 4-bit Dynamic Quants

Collection

Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 28 items • Updated 29 days ago • 82

upvoted a paper 4 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 94

upvoted a collection 4 months ago

4bit Instruct Models

Collection

18 items • Updated 29 days ago • 30

upvoted an article 4 months ago

Article

Tutorial: Quantizing Llama 3+ Models for Efficient Deployment

•

Dec 15, 2024

• 5

upvoted a paper 4 months ago

Matryoshka Quantization

Paper • 2502.06786 • Published Feb 10 • 30

upvoted 3 articles 5 months ago

Article

Unleashing the Power of Unsloth and QLora:Redefining Language Model Fine-Tuning

•

Jan 19, 2024

• 17

Article

Selene 1 Mini: the best small language model-as-a-judge

and 10 others •

Jan 29

• 13

Article

Train 400x faster Static Embedding Models with Sentence Transformers

•

Jan 15

• 195

upvoted a paper 7 months ago

GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks

Paper • 2406.12925 • Published Jun 14, 2024 • 26

Guille Pérez-Torró

AI & ML interests

Recent Activity

Organizations

guishe's activity

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

MIEB: The Benchmark That Stress-Tests Image-Text Embeddings Like Never Before

Judge Arena: Benchmarking LLMs as Evaluators

DABStep: Data Agent Benchmark for Multi-step Reasoning

Tutorial: Quantizing Llama 3+ Models for Efficient Deployment

Unleashing the Power of Unsloth and QLora:Redefining Language Model Fine-Tuning

Selene 1 Mini: the best small language model-as-a-judge

Train 400x faster Static Embedding Models with Sentence Transformers