CVPR Demo Track

non-profit

http://cvpr2022.thecvf.com/

Activity Feed Request to join this org

AI & ML interests

CVPR Demo Track @ CVPR 2022

Recent Activity

Mrinal authored a paper about 1 month ago

Multilingual State Space Models for Structured Question Answering in Indic Languages

IAMJB authored a paper about 2 months ago

SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning

noamrot authored a paper about 2 months ago

SingLoRA: Low Rank Adaptation Using a Single Matrix

View all activity

Nymbo

posted an update 1 day ago

Post

242

I built a general use MCP space ~ Fetch webpages, DuckDuckGo search, Python code execution, Kokoro TTS, Image Gen, Video Gen.

# Tools

1. Fetch webpage
2. Web search via DuckDuckGo (very concise, low excess context)
3. Python code executor
4. Kokoro-82M speech generation
5. Image Generation (use any model from HF Inference Providers)
6. Video Generation (use any model from HF Inference Providers)

The first four tools can be used without any API keys whatsoever. DDG search is free and the code execution and speech gen is done on CPU. Having a HF_READ_TOKEN in the env variables will show all tools. If there isn't a key present, The Image/Video Gen tools are hidden.

Nymbo/Tools

Nymbo

posted an update 9 days ago

Post

624

Anyone using Jan-v1-4B for local MCP-based web search, I highly recommend you try out Intelligent-Internet/II-Search-4B

Very impressed with this lil guy and it deserves more downloads. It's based on the original version of Qwen3-4B but find that it questions reality way less often. Jan-v1 seems to think that everything it sees is synthetic data and constantly gaslights me

Nyandwi

authored a paper 11 days ago

Grounding Multilingual Multimodal LLMs With Cultural Knowledge

Paper • 2508.07414 • Published 15 days ago

BwZhang

authored 4 papers 19 days ago

StyleSwin: Transformer-based GAN for High-resolution Image Generation

Paper • 2112.10762 • Published Dec 20, 2021

GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling

Paper • 2403.19655 • Published Mar 28, 2024 • 20

RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models

Paper • 2407.06938 • Published Jul 9, 2024 • 25

Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis

Paper • 2507.23785 • Published 25 days ago • 18

AtAndDev

posted an update about 1 month ago

Post

419

Qwen 3 Coder is a personal attack to k2, and I love it.
It achieves near SOTA on LCB while not having reasoning.
Finally people are understanding that reasoning isnt necessary for high benches...

Qwen ftw!

DECENTRALIZE DECENTRALIZE DECENTRALIZE

noamrot

authored a paper about 2 months ago

SingLoRA: Low Rank Adaptation Using a Single Matrix

Paper • 2507.05566 • Published Jul 8 • 111

Nymbo

posted an update about 2 months ago

Post

2809

Anyone know how to reset Claude web's MCP config? I connected mine when the HF MCP first released with just the default example spaces added. I added lots of other MCP spaces but Claude.ai doesn't update the available tools... "Disconnecting" the HF integration does nothing, deleting it and adding it again does nothing.

Refreshing tools works fine in VS Code because I can manually restart it in mcp.json, but claude.ai has no such option. Anyone got any ideas?

4 replies

deepkyu

authored a paper 3 months ago

Seeing Voices: Generating A-Roll Video from Audio with Mirage

Paper • 2506.08279 • Published Jun 9 • 28

mehdidc

authored 5 papers 3 months ago

A Comparative Study on Generative Models for High Resolution Solar Observation Imaging

Paper • 2304.07169 • Published Apr 14, 2023

DataComp: In search of the next generation of multimodal datasets

Paper • 2304.14108 • Published Apr 27, 2023 • 2

Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models

Paper • 2406.02061 • Published Jun 4, 2024 • 2

A Practitioner's Guide to Continual Multimodal Pretraining

Paper • 2408.14471 • Published Aug 26, 2024

Scaling Laws for Robust Comparison of Open Foundation Language-Vision Models and Datasets

Paper • 2506.04598 • Published Jun 5 • 6

abidlabs

posted an update 3 months ago

Post

3642

The Gradio x Agents x MCP hackathon keeps growing! We now have more $1,000,000 in credit for participants and and >$16,000 in cash prizes for winners.

We've kept registration open until the end of this week, so join and let's build cool stuff together as a community: https://huggingface.co/spaces/ysharma/gradio-hackathon-registration-2025

AtAndDev

posted an update 3 months ago

Post

2964

deepseek-ai/DeepSeek-R1-0528

This is the end

1 reply

Nymbo

posted an update 4 months ago

Post

4091

Haven't seen this posted anywhere - Llama-3.3-8B-Instruct is available on the new Llama API. Is this a new model or did someone mislabel Llama-3.1-8B?

1 reply

Nymbo

posted an update 4 months ago

Post

2765

PSA for anyone using Nymbo/Nymbo_Theme or Nymbo/Nymbo_Theme_5 in a Gradio space ~

Both of these themes have been updated to fix some of the long-standing inconsistencies ever since the transition to Gradio v5. Textboxes are no longer bright green and in-line code is readable now! Both themes are now visually identical across versions.

If your space is already using one of these themes, you just need to restart your space to get the latest version. No code changes needed.

AI & ML interests

Recent Activity

Team members 266

CVPR's activity