Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
SkalskiP 's Collections
CVPR 2025
Zero-Shot Detection and Segmentation
OpenAI Vision API
LMMs - Large Multimodal Models

CVPR 2025

updated Jun 11

A collection of models and demos linked to papers presented at CVPR 2025.

Upvote
1

  • Running on Zero
    MCP
    28
    28

    Gaze LLE

    👀

    Gaze Target Estimation


  • Running on Zero
    315
    315

    vggt

    🏆

    VGGT (CVPR 2025)


  • Running on Zero
    29
    29

    UniK3D Demo

    🏢

    UniK3D (CVPR 2025)


  • Running on Zero
    176
    176

    DepthCrafter

    🦀

    a super consistent video depth model


  • Running on L40S
    177
    177

    Video Depth Anything

    👀

    Generate depth video from input video


  • Running on Zero
    812
    812

    MMAudio — generating synchronized audio from video/text

    🔊

    Generate audio from video or text prompts


  • Running on Zero
    33
    33

    Semantic Draw Canvas X Animagine XL 3.1

    🔥

    Create and share 2K arts in 30s with Animagine XL 3.1


  • Running on Zero
    16
    16

    MINIMA

    📈


  • Runtime error
    34
    34

    EdgeTAM

    🚀

    On-Device Track Anything Model


  • Running on L4
    51
    51

    HSMR

    💀

    Convert images of humans to biomechanically accurate 3D skeletons


  • Running on L4
    178
    178

    MatAnyone

    🤡

    Gradio demo for MatAnyone


  • Running on Zero
    116
    116

    Molmo 7B D 0924

    👁


  • Running on Zero
    41
    41

    Magma UI

    📚

    Magma-8B model for UI Agents


  • Running on Zero
    237
    237

    ShowUI

    💻

    Generate clickable coordinates on a screenshot

Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs