Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
poonyZ 's Collections
omni
T2I
agi
fancy
vlm eval
speech lm
vlm data
video LM
VLM
llm

fancy

updated Jan 2
Upvote
-

  • GenEx: Generating an Explorable World

    Paper • 2412.09624 • Published Dec 12, 2024 • 97

  • Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation

    Paper • 2412.09428 • Published Dec 12, 2024 • 7

  • BrushEdit: All-In-One Image Inpainting and Editing

    Paper • 2412.10316 • Published Dec 13, 2024 • 36

  • FashionComposer: Compositional Fashion Image Generation

    Paper • 2412.14168 • Published Dec 18, 2024 • 16

  • HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

    Paper • 2412.18925 • Published Dec 25, 2024 • 105

  • Edicho: Consistent Image Editing in the Wild

    Paper • 2412.21079 • Published Dec 30, 2024 • 23

  • TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization

    Paper • 2412.21037 • Published Dec 30, 2024 • 24
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs