3 19 91

João Palmeiro PRO

joaompalmeiro

AI & ML interests

None yet

Recent Activity

liked a dataset 4 days ago

HuggingFaceM4/FineVision

liked a Space 4 days ago

HuggingFaceM4/FineVision

reacted to AdinaY's post with 🔥 12 days ago

MiniCPM-V 4.5 🚀 New MLLM for image, multi-image & video understanding, running even on your phone, released by OpenBMB https://huggingface.co/openbmb/MiniCPM-V-4_5 ✨ SOTA vision language capability ✨ 96× video token compression > high-FPS & long video reasoning ✨ Switchable fast vs deep thinking modes ✨ Strong OCR, document parsing, supports 30+ languages

View all activity

Organizations

upvoted an article about 1 month ago

Article

Introducing Command A Vision: Multimodal AI built for Business

and 3 others •

Jul 31

• 63

upvoted 2 articles 2 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

and 1 other •

Jul 9

• 669

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

Jul 8

• 649

upvoted a paper 2 months ago

Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens

Paper • 2506.17218 • Published Jun 20 • 28

upvoted 2 papers 3 months ago

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Paper • 2412.04467 • Published Dec 5, 2024 • 119

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 131

upvoted an article 3 months ago

Article

Context Is Gold to Find the Gold Passage: Evaluating and Training Contextual Document Embeddings

and 1 other •

Jun 2

• 25

upvoted a paper 3 months ago

Why Vision Language Models Struggle with Visual Arithmetic? Towards Enhanced Chart and Geometry Understanding

Paper • 2502.11492 • Published Feb 17 • 2

upvoted an article 4 months ago

Article

Common AI Model Formats

•

Feb 27

• 47

upvoted a paper 4 months ago

ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models

Paper • 2505.13444 • Published May 19 • 16

upvoted 2 articles 4 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 356

Article

Preference Optimization for Vision Language Models

and 3 others •

Jul 10, 2024

• 80

upvoted a paper 4 months ago

Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis

Paper • 2505.09358 • Published May 14 • 26

upvoted a collection 4 months ago

Marigold Computer Vision

Collection

All things Marigold • 17 items • Updated May 15 • 21

upvoted 3 articles 4 months ago

Article

The Transformers Library: standardizing model definitions

and 3 others •

May 15

• 117

Article

Synthetic data: save money, time and carbon with open source

•

Feb 16, 2024

• 79

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

May 12

• 522

upvoted a paper 5 months ago

ChartQAPro: A More Diverse and Challenging Benchmark for Chart Question Answering

Paper • 2504.05506 • Published Apr 7 • 24

upvoted an article 5 months ago

Article

Optimise AI Models and Make Them Faster, Smaller, Cheaper, Greener

and 2 others •

Apr 4

• 19

João Palmeiro PRO

AI & ML interests

Recent Activity

Organizations

joaompalmeiro's activity

Introducing Command A Vision: Multimodal AI built for Business

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

SmolLM3: smol, multilingual, long-context reasoner

*Context Is Gold to Find the Gold Passage*: Evaluating and Training Contextual Document Embeddings

Common AI Model Formats

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Preference Optimization for Vision Language Models

The Transformers Library: standardizing model definitions

Synthetic data: save money, time and carbon with open source

Vision Language Models (Better, Faster, Stronger)

Optimise AI Models and Make Them Faster, Smaller, Cheaper, Greener

Context Is Gold to Find the Gold Passage: Evaluating and Training Contextual Document Embeddings