Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Nazzaroth2 's Collections
Reward Modeling
models to test out
data synthesis
RL_Papers in general
OCR
imageGen
VLM RL Reasoning
LLM-External_information
llm_compression
LLM_Reasoning-ErrorCorrection
Loras
3D (nerfs, gaussians, generation etc.)
t2i consistency works
videogames_roleplay
small_or_multimodal_llm
manga_translation
long_context
model training

OCR

updated Apr 16
Upvote
-

  • Gemma 3 Technical Report

    Paper • 2503.19786 • Published Mar 25 • 52

  • Kimi-VL Technical Report

    Paper • 2504.07491 • Published Apr 10 • 131

  • InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

    Paper • 2504.10479 • Published Apr 14 • 276

  • FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding

    Paper • 2504.09925 • Published Apr 14 • 38

  • Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

    Paper • 2412.04424 • Published Dec 5, 2024 • 64

  • Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

    Paper • 2504.08672 • Published Apr 11 • 55
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs