1 49 105

gerald hewes

gerald29

AI & ML interests

None yet

Recent Activity

liked a model 11 days ago

agentica-org/DeepSWE-Preview

liked a model 27 days ago

zai-org/GLM-4.1V-9B-Thinking

liked a model about 1 month ago

tencent/Hunyuan3D-2.1

View all activity

Organizations

upvoted an article about 1 month ago

Article

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

and 4 others •

Jun 19

• 80

upvoted a paper about 1 month ago

V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning

Paper • 2506.09985 • Published Jun 11 • 28

upvoted a paper about 2 months ago

ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development

Paper • 2506.05010 • Published Jun 5 • 74

upvoted 2 collections 3 months ago

D-FINE

Collection

State-of-the-art real-time object detection model with Apache 2.0 licence • 15 items • Updated May 5 • 55

Perception Encoder

Collection

17 items • Updated 19 days ago • 62

upvoted a paper 4 months ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 165

upvoted an article 4 months ago

Article

Introducing smolagents: simple agents that write actions in code.

and 2 others •

Dec 31, 2024

• 1.09k

upvoted a paper 4 months ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 159

upvoted an article 4 months ago

Article

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 322

upvoted a paper 4 months ago

SynCity: Training-Free Generation of 3D Worlds

Paper • 2503.16420 • Published Mar 20 • 26

upvoted 10 papers 5 months ago

LLM-based User Profile Management for Recommender System

Paper • 2502.14541 • Published Feb 20 • 6

From RAG to Memory: Non-Parametric Continual Learning for Large Language Models

Paper • 2502.14802 • Published Feb 20 • 13

Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data

Paper • 2502.14044 • Published Feb 19 • 8

RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers

Paper • 2502.14377 • Published Feb 20 • 12

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

Paper • 2502.14846 • Published Feb 20 • 14

NAVIG: Natural Language-guided Analysis with Vision Language Models for Image Geo-localization

Paper • 2502.14638 • Published Feb 20 • 11

S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

Paper • 2502.12853 • Published Feb 18 • 29

LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models

Paper • 2502.14834 • Published Feb 20 • 24

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20 • 63

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 146

gerald hewes

AI & ML interests

Recent Activity

Organizations

gerald29's activity

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

Introducing smolagents: simple agents that write actions in code.

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?