Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
tuyenTS 's Collections
multi-modalities
llm_inference
llms
llm_reasoning
voice
llm_compression
llms_editing
llm_finetuning
llm_explanation

multi-modalities

updated Dec 16, 2024
Upvote
-

  • AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model

    Paper • 2309.16058 • Published Sep 27, 2023 • 56

  • OneLLM: One Framework to Align All Modalities with Language

    Paper • 2312.03700 • Published Dec 6, 2023 • 24

  • Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models

    Paper • 2402.07865 • Published Feb 12, 2024 • 15

  • SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers

    Paper • 2401.08740 • Published Jan 16, 2024 • 14

  • Scaling Diffusion Language Models via Adaptation from Autoregressive Models

    Paper • 2410.17891 • Published Oct 23, 2024 • 17

  • Multimodal Latent Language Modeling with Next-Token Diffusion

    Paper • 2412.08635 • Published Dec 11, 2024 • 46
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs