Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
patmode 's Collections
Training-LLMs

Training-LLMs

updated about 11 hours ago
Upvote
-

  • Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

    Paper • 2410.17243 • Published Oct 22, 2024 • 95

  • AnimateAnything: Consistent and Controllable Animation for Video Generation

    Paper • 2411.10836 • Published Nov 16, 2024 • 25

  • LLaVA-o1: Let Vision Language Models Reason Step-by-Step

    Paper • 2411.10440 • Published Nov 15, 2024 • 125

  • MagicQuill: An Intelligent Interactive Image Editing System

    Paper • 2411.09703 • Published Nov 14, 2024 • 78

  • Free^2Guide: Gradient-Free Path Integral Control for Enhancing Text-to-Video Generation with Large Vision-Language Models

    Paper • 2411.17041 • Published Nov 26, 2024 • 13

  • Video-Guided Foley Sound Generation with Multimodal Controls

    Paper • 2411.17698 • Published Nov 26, 2024 • 10

  • GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration

    Paper • 2412.04440 • Published Dec 5, 2024 • 22

  • SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence

    Paper • 2506.15672 • Published 2 days ago • 8
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs