AGI
- 112🚀
teknium/Mistral-Trismegistus-7B
Text Generation • Updated • 199 • 216- 346
Latent Consistency Models
⚡ - 2.47k
XTTS
🐸 - 357
VALL E X
🎙Generate audio from text with a custom voice
- 189
LLaMA Board
🦙Fine-tuning large language model with Gradio UI
- 4.84k
MusicGen
🎵Generate music from text and melody descriptions
- 1.43k
MagicAnimate
💃 - 516
Seamless M4T v2
📞 - 1.8k
Stable Video Diffusion 1.1
📺Generate a short video from an image
- 235
Video LLaVA
📚 - 168
Mustango
🐢 - 563
OpenAI TTS New
📊 - 287
3D Arena
🏢Vote on and view 3D leaderboard entries
- 225
Distil Whisper Web
👀Convert spoken words into text
- 282
Zero123++ Demo Space
🌒 - 107
InstaFlow
🐨 - 1.27k
CLIP Interrogator 2
🕵Generate text descriptions from images
- 743
ZoeDepth
🦀Create 3D models from images
- 41
LooseControl
📚 - 305
Enhance This DemoFusion SDXL
🔍Creative Upscaler High-Res Image Generation DemoFusion SDXL
axiong/PMC_LLaMA_13B
Text Generation • Updated • 789 • 32axiong/pmc_llama_instructions
Viewer • Updated • 514k • 107 • 29med-flamingo/med-flamingo
Updated • 51wikimedia/wikisource
Viewer • Updated • 1.66M • 1.47k • 81- 2.56k
OutfitAnyone
🏢Create virtual try-ons for clothing on images of people
Pixel Aligned Language Models
Paper • 2312.09237 • Published • 18- 191
Gemini Playground
💬 - 254
Singing Voice Conversion
🎼Transform your voice into a singer's
- 54
Text To Speech
🔥Generate speech from text with different speakers
- 29
Text To Audio
🌖 - 50
NaturalSpeech2
🎧 - 210
AnyDoor Online
👁Teleport target objects to new backgrounds
- 58
MotionCtrl
📊 - 117
MotionGPT
🏃Generate human motion from text or text from motion
MotionGPT: Human Motion as a Foreign Language
Paper • 2306.14795 • Published • 27- 432
GPT-Academic
😻Generate academic content using GPT
- 95
M2UGen Demo
💻 - 63
VCoder
✌ - 928
IP-Adapter-FaceID
🧑Generate AI images with your face
- 266
AnyText
👁Generate images with text and edit existing images
osunlp/Mind2Web
Viewer • Updated • 253 • 781 • 100- 142
FaceChain
🏆Display Hugging Face status and loading animation
- 225
Dreamtalk
😛Animate a portrait from audio speech
- 105
I2VGen-XL
🔥 - 918
ReplaceAnything
📚Replace objects in images with new content
- 1.89k
PhotoMaker
📷Create customized images using photos and prompts
- 332
Resemble Enhance
🚀Enhance and clean audio files
- 6
DiffusionGPT
👁Generate images from text prompts
- 15
DiffusionGPT XL
🐢 - 3.3k
InstantID
😻Generate personalized images with a face preservation
- 42
DuckDB NSQL 7B
🏢Generate DuckDB SQL queries from natural language prompts
- 316
Qwen-VL-Max
📷Interact with images and texts using Qwen-VL-Max
- 197
InstructIR
💻Improve images with text instructions
- 387
Qwen1.5 72B Chat
🚀Generate chat responses from user input
- 499
Image to Music v2
🎺Get a music sample inspired by the mood of an image
- 772
BRIA RMBG 1.4
💻Remove background from images
- 419
YOLO World
🔥Detect objects in images or videos
- 544
Vision Arena (Testing VLMs side-by-side)
🖼Analyze images to detect and label objects
- 1.67k
Stable Cascade
👁Generate images from text prompts
- 67
Diffusion Transformers (DiT)
🚀 - 469
SDXL Lightning
⚡Super-fast image generation on SDX
- 257
YOLO-World + EfficientSAM
🔥 - 126
Differential Diffusion
😻Edit images using prompts and change maps
- 51
YOLOv9 Object Detection w/ Transformers.js
🖼In-browser object detection w/ YOLOv9 and Transformers.js
- 75
Depth Anything Video
👁Generate depth maps for video frames
- 522
Depth Anything
🌖Generate depth map from image
- 445
MeloTTS
🗣Fast, efficient, & multilingual text-to-speech
- 1.11k
Playground V2.5
🌍Generate highly aesthetic images
- 98
MoMask
🎭Generate human motions from text prompts
- 656
PhotoMaker Style
📷Generate customized face images with styles
- 56
TCD
📈Official Demo Space for Trajectory Consistency Distillation
- 815
TripoSR
🐳 - 30
Magi Demo
🏢Generate transcript from comic image
- 1.33k
Animagine XL 3.1
🌍The most opinionated, anime-themed SDXL model
- 207
Img2img Turbo Sketch
📚 - 125
APISR
🏃Enhance low-resolution anime images
- 167
DynamiCrafter
🐨 - 279
DynamiCrafter
🐨Generate videos from images and text prompts
- 10
DragAPart
🏢 - 9.8k
AI Comic Factory
👩Create your own AI comic with a single prompt
- 83
GRM
🏆Generate realistic images using GRM Live Demo
- 118
Qwen1.5 32B Chat
🏢Chat with a powerful language model
- 71
AnyV2V
🎥Video Editing
- 49
DesignEdit
🌿 - 1.48k
InstructPix2Pix
🚀Transform images based on text instructions
- 823
Parler-TTS
🥖High-fidelity Text-To-Speech
- 107
MagicTime
🚀MagicTime: Time-lapse Video Generation Models as Metamorphic
- 48
CustomNet
🐠Customize objects in images with text prompts and viewpoints
- 234
PixArt Sigma
👁 - 46
Sd3 Api
😻Generate images from text prompts
- 1.4k
InstantMesh
📚Create a 3D model from an image in 10 seconds!
- 235
Hyper SDXL 1Step T2I
🐠Generate images from text prompts
- 1.88k
IDM VTON
👕High-fidelity Virtual Try-on
- 278
Qwen1.5 110B Chat Demo
🏃Chat with Qwen1.5-110B-Chat Bot
- 1.1k
IC Light
📈Generate relit images from your photo
- 272
Phi-3 WebGPU
🚀A private and powerful AI that runs locally in your browser
- 315
PaliGemma Demo
🤲Annotate and describe images with text prompts
- 92
Yolov10
📉Detect objects in an image
- 74
Open Sora Plan V1.1.0
⚡ - 340
Chattts Zero
🐢Generate audio from text with tuning options
- 962
ToonCrafter
😻Generate a cartoon video from two images
- 2.25k
Bark
🐶Generate realistic audio from text
- 684
Qwen2 72B Instruct
💻Chat with Qwen2-72B-instruct using a system prompt
- 125
MimicBrush
🐨Transfers textures from a reference image to a masked region in a source image
- 70
SD3 ControlNet
⚡Generate images using ControlNet with prompts
- 104
ChatTTS Speaker
🌍Browse and download ChatTTS speaker embeddings
- 257
SD3 Long Captioner
🏃Generate detailed captions for images
- 187
MassivelyMultilingualTTS
🌍Convert text to speech in multiple languages
- 192
Flash SD3
⚡Generate high-quality images from text prompts
- 748
Florence 2
📉Analyze images to generate captions, detect objects, or perform OCR
- 112
ExVideo SVD 128f V1
🐨 - 160
InternLM XComposer
🏢 - 263
Llm Pricing
📊Generate React TypeScript App
- 112
FoleyCrafter
📚Generate sound effects for silent videos
- 1.11k
PhotoMaker V2
📷Create customized face portraits using images and prompts
- 3.25k
Live Portrait
🤪Apply the motion of a video on a portrait
- 138
Diffree
🖼 - 12
ViPer
😻Generate personalized images based on user comments
- 990
Stable Fast 3D
🎮Generate a 3D mesh model from an image
- 176
Qwen2 Audio Instruct Demo
🌍Interact with a multimodal chatbot using text and audio
- 1.53k
Background Removal
🌘Remove backgrounds from images
- 162
LongWriter
💬LLM for long context
- 270
Qwen Math Demo
🧮Describe and solve math problems from images or text
- 8.1k
Kolors Virtual Try-On
👕Overlay garment on person image
- 929
CogVideoX-5B
🎥Text-to-Video
- 626
Qwen2-VL-72B
🌖Engage in multi-modal conversations with images and videos
- 54
Svd Keyframe Interpolation
🐨Generate in-between frames between two images to create a smooth video
- 471
Fish Speech 1
🏆Generate speech from text
- 477
Finegrain Object Cutter
✂Create high-quality HD cutouts with just a text prompt
- 358
GOT Online
💬Extract text from images using various OCR modes
- 21
Dream Machine
🦀 - 410
Pdf2audio
📚Generate detailed script for podcast or lecture from text input
- 389
Llama-Vision-11B
🚀Chat about images by uploading them and typing questions
- 8
Llama 3.2 90b Text Preview Groq
🌖 - 853
Whisper Turbo
🤯Transcribe audio or YouTube videos to text
- 1.06k
Open NotebookLM
🎙Personalised Podcasts For All - Available in 13 Languages
- 64
Podcastfy.ai - An Open Source alternative to NotebookLM's podcast feature
🚀Generate a podcast from text, URLs, PDFs, and images
- 303
PMRF
🖼A gradio demo for Posterior-Mean Rectified Flow (PMRF)
- 2.1k
F5-TTS
🗣F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
- 253
MaskGCT TTS Demo
😻MaskGCT TTS Demo
- 139
Fish Agent
💬An end-to-end (e2e) Voice Language Model by Fish Audio.
- 1.44k
Qwen2.5 Coder Artifacts
🐢Generate code from a description
- 428
Qwen2.5 Coder Demo
👁Chat with a Qwen AI assistant
- 366
SeedEdit-APP-V1.0
🎨Generate and edit images from text instructions
- 370
Qwen2.5 Turbo 1M Demo
💻Upload documents for Q&A
- 1.03k
OOTDiffusion
🥼High-quality virtual try-on ~ Your cyber fitting room
- 1.69k
MagicQuill
🪶Edit and enhance images with custom color and edge modifications
- 829
OminiControl
🌍Generate detailed images from a prompt and an image
- 646
IC Light V2-Vary
📈Execute commands based on environment variables
- 41
TryOffDiff
🔥Extract garment images from everyday images!
- 561
QVQ 72B Preview
🌍Upload images and ask questions to get answers
- 102
Janus Pro 7b
🌍A unified multimodal understanding and generation model.
- 1.91k
Chat With Janus-Pro-7B
🌍A unified multimodal understanding and generation model.
- 279
Llasa 3b Tts
🔥Zero Shot voice cloning with llasa 3b (Unofficial Demo)
- 88
Paligemma2 Mix
🌖Generate text or segment objects from an image
- 340
Gemini Co-Drawing
✏Gemini 2.0 native image generation co-doodling
- 518
Di♪♪Rhythm
🎶Blazingly Fast and Embarrassingly Simple Song Generation
- 513
InfiniteYou-FLUX
📸Flexible Photo Recrafting While Preserving Your Identity
- 74
Text2Human
🏃Generate human images from text descriptions and poses
- 987
GFPGAN
😁Enhance facial details in images