LUCA 's picture

LUCA

Gargaz

·

AI & ML interests

None yet

Recent Activity

new activity 8 days ago

Gargaz/llama-2-7b-fine-tune-new:Adding `safetensors` variant of this model

updated a dataset about 2 months ago

Gargaz/Test

published a dataset about 2 months ago

Gargaz/Test

View all activity

Organizations

Gargaz's activity

upvoted an article 4 months ago

Article

Welcome to Inference Providers on the Hub 🔥

By

and 6 others •

Jan 28

• 483

upvoted a paper 5 months ago

OMNI: Open-endedness via Models of human Notions of Interestingness

Paper • 2306.01711 • Published Jun 2, 2023 • 1

upvoted an article 6 months ago

Article

Fine-Tuning 1B LLaMA 3.2: A Comprehensive Step-by-Step Guide with Code

By

•

Oct 2, 2024

• 68

upvoted a collection 6 months ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated May 1 • 571

upvoted a paper 6 months ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 146

upvoted 5 collections 6 months ago

Qwen2.5

12 items • Updated Apr 27 • 4

Falcon3

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 25 days ago • 86

Granite 3.1 Language Models

A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 9 items • Updated May 2 • 62

Bamba

Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 9 items • Updated Apr 28 • 23

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 144

upvoted a paper 6 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 368

upvoted 5 collections 6 months ago

whisper-guaraní

5 items • Updated Nov 25, 2024 • 2

TimesFM Release

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting. • 4 items • Updated 16 days ago • 16

The Big Benchmarks Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 231

Centaurus

Series of uncensored models based on Llama-3. • 5 items • Updated May 27, 2024 • 4

Recommended small models

This is everything recent smaller than ~25B parameters that are high quality/reputable • 19 items • Updated Nov 30, 2024 • 118

upvoted a collection over 1 year ago

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 65 items • Updated Mar 20 • 605