Luigi's picture

Luigi

Luigi

·

AI & ML interests

NLP / ASR / Chat LLM / VLMs

Recent Activity

updated a Space about 4 hours ago

Luigi/tiny-scribe

updated a Space about 6 hours ago

Luigi/Qwen3-ASR-0.6B-CPU

updated a model about 23 hours ago

Luigi/Qwen3-ASR-0.6B-chatllm-quantized

View all activity

Organizations

upvoted a collection 21 days ago

Falcon-H1-Tiny

A series of extremely small, yet powerful language models redefining capabilities at small scale • 22 items • Updated 22 days ago • 34

upvoted a paper 4 months ago

In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7, 2025 • 107

upvoted a paper 5 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 196

upvoted an article 5 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

+4

Sep 4, 2025

•

273

upvoted a changelog 5 months ago

Changelog

Introducing HF Jobs: Run scalable compute jobs on Hugging Face

Jul 30, 2025

• 201

upvoted 6 papers 5 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 212

FastVLM: Efficient Vision Encoding for Vision Language Models

Paper • 2412.13303 • Published Dec 17, 2024 • 73

MobileCLIP2: Improving Multi-Modal Reinforced Training

Paper • 2508.20691 • Published Aug 28, 2025 • 5

LoRA: Low-Rank Adaptation of Large Language Models

Paper • 2106.09685 • Published Jun 17, 2021 • 58

QLoRA: Efficient Finetuning of Quantized LLMs

Paper • 2305.14314 • Published May 23, 2023 • 58

MiniRAG: Towards Extremely Simple Retrieval-Augmented Generation

Paper • 2501.06713 • Published Jan 12, 2025 • 4

upvoted 4 articles 9 months ago

Article

Deriving DPO's Loss

Dec 24, 2024

•

29

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

+3

Jan 18, 2024

•

78

Article

Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies

Feb 17, 2025

•

28

Article

Proximal Policy Optimization (PPO)

Aug 5, 2022

•

76

upvoted 2 collections 10 months ago

high-quality Chinese training datasets

a suite of high-quality Chinese datasets, used for pretraining, fine-tuning or preference alignment. And the models trained on these datasets. • 13 items • Updated May 22, 2025 • 24

Chinese Tiny LLM

9 items • Updated Apr 5, 2024 • 8

upvoted an article over 1 year ago

Article

SmolLM - blazingly fast and remarkably powerful

+1

Jul 16, 2024

•

441

upvoted a paper almost 2 years ago

BiLLM: Pushing the Limit of Post-Training Quantization for LLMs

Paper • 2402.04291 • Published Feb 6, 2024 • 50

upvoted a collection about 2 years ago

Function Calling v3

Models fine-tuned for function-calling • 14 items • Updated Apr 27, 2024 • 21