Alberto Cetoli PRO

fractalego

AI & ML interests

Entity/relation extraction, Q&A, Summarisation

Recent Activity

liked a model 3 days ago
Almawave/Velvet-14B
upvoted an article 3 days ago
Open-R1: Update #1
liked a model 4 days ago
iGeniusAI/Italia-9B-Instruct-v0.1
View all activity

Articles

Organizations

Blog-explorers's profile picture Hugging Face Discord Community's profile picture open/ acc's profile picture

fractalego's activity

upvoted an article 3 days ago
upvoted an article 6 days ago
view article
Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

By manu •
• 197
upvoted an article 21 days ago
view article
Article

Visualize and understand GPU memory in PyTorch

• 180
reacted to mitkox's post with 🤯🔥➕ 28 days ago
view post
Post
2457
Can it run DeepSeek V3 671B is the new 'can it run Doom'.

How minimalistic can I go with on device AI with behemoth models - here I'm running DeepSeek V3 MoE on a single A6000 GPU.

Not great, not terrible, for this minimalistic setup. I love the Mixture of Experts architectures. Typically I'm running my core LLM distributed over the 4 GPUs.

Make sure you own your AI. AI in the cloud is not aligned with you; it's aligned with the company that owns it.
·