55 28 386

Pablo Carrera (パブロ) PRO

pabloce

AI & ML interests

Hello, hello!

Recent Activity

upvoted a changelog 3 days ago

Introducing a better Hugging Face CLI

updated a Space 4 days ago

pabloce/exllama

liked a model 7 days ago

OmniSVG/OmniSVG

View all activity

Organizations

upvoted a changelog 3 days ago

Changelog

Introducing a better Hugging Face CLI

4 days ago

• 41

updated a Space 4 days ago

Exllama

😽

Chat: exllama v2

liked a model 7 days ago

OmniSVG/OmniSVG

Text Generation • Updated 8 days ago • 2.54k • 121

upvoted an article 12 days ago

Article

Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever.

•

12 days ago

• 124

liked 2 models 27 days ago

kyutai/mimi

Feature Extraction • 0.1B • Updated 27 days ago • 345k • • 227

black-forest-labs/FLUX.1-Kontext-dev

Image-to-Image • Updated Jun 27 • 367k • • 1.89k

reacted to cgeorgiaw's post with ❤️ about 1 month ago

Post

2610

Huge new bio datasets just dropped!!!

Check out them out @

ginkgo-datapoints
Read the blog for more info: https://huggingface.co/blog/cgeorgiaw/gdp

1 reply

reacted to bartowski's post with 🤗 about 1 month ago

Post

16024

Was going to post this on /r/LocalLLaMa, but apparently it's without moderation at this time :')

bartowski/mistralai_Mistral-Small-3.2-24B-Instruct-2506-GGUF

Was able to use previous mistral chat templates, some hints from Qwen templates, and Claude to piece together a seemingly working chat template, tested it with llama.cpp server and got perfect results, though lmstudio still seems to be struggling for some reason (don't know how to specify a jinja file there)

Outlined the details of the script and results in my llama.cpp PR to add the jinja template:

https://github.com/ggml-org/llama.cpp/pull/14349

Start server with a command like this:

./llama-server -m /models/mistralai_Mistral-Small-3.2-24B-Instruct-2506-Q4_K_M.gguf --jinja --chat-template-file /models/Mistral-Small-3.2-24B-Instruct-2506.jinja

and it should be perfect! Hoping it'll work for ALL tools if lmstudio gets an update or something, not just llama.cpp, but very happy to see it works flawlessly in llama.cpp

In the meantime, will try to open a PR to minja to make the strftime work, but no promises :)

liked 4 models about 1 month ago

updated a model about 1 month ago

somosnlp-hackathon-2025/mistral-7b-gastronomia-hispana-qlora-GGUF

7B • Updated Jun 14 • 37

published a model about 1 month ago

somosnlp-hackathon-2025/mistral-7b-gastronomia-hispana-qlora-GGUF

7B • Updated Jun 14 • 37

upvoted an article about 2 months ago

Article

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

and 4 others •

Jun 11

• 71

reacted to danielhanchen's post with 🔥 about 2 months ago

Post

2129

Mistral releases Magistral, their new reasoning models! 🔥
GGUFs to run: unsloth/Magistral-Small-2506-GGUF

Magistral-Small-2506 excels at mathematics and coding.

You can run the 24B model locally with just 32GB RAM by using our Dynamic GGUFs.

reacted to cbensimon's post with 🔥🤗❤️🚀 about 2 months ago

Post

3301

🚀 ZeroGPU now supports PyTorch native quantization via torchao

While it hasn’t been battle-tested yet, Int8WeightOnlyConfig is already working flawlessly in our tests.

Let us know if you run into any issues — and we’re excited to see what the community will build!

import spaces
from diffusers import FluxPipeline
from torchao.quantization.quant_api import Int8WeightOnlyConfig, quantize_

pipeline = FluxPipeline.from_pretrained(...).to('cuda')
quantize_(pipeline.transformer, Int8WeightOnlyConfig()) # Or any other component(s)

@spaces.GPU
def generate(prompt: str):
    return pipeline(prompt).images[0]

5 replies

Pablo Carrera (パブロ) PRO

AI & ML interests

Recent Activity

Organizations

pabloce's activity

Introducing a better Hugging Face CLI

Exllama

Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever.

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm