Describe Anything Model

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

longlian authored a paper 7 days ago

VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

longlian authored a paper about 2 months ago

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

richardaecn authored a paper 3 months ago

World Simulation with Video Foundation Models for Physical AI

View all activity

longlian

authored a paper 7 days ago

VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

Paper • 2601.16973 • Published 9 days ago • 40

longlian

authored a paper about 2 months ago

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

Paper • 2512.07843 • Published Nov 24, 2025 • 22

richardaecn

authored a paper 3 months ago

World Simulation with Video Foundation Models for Physical AI

Paper • 2511.00062 • Published Oct 28, 2025 • 41

richardaecn

authored a paper 8 months ago

Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

Paper • 2505.18116 • Published May 23, 2025 • 4

longlian

updated a model 9 months ago

DescribeAnythingModel/dam_3b_v1_self_contained

Updated Apr 23, 2025 • 2 • 1

longlian

published a model 9 months ago

DescribeAnythingModel/dam_3b_v1_self_contained

Updated Apr 23, 2025 • 2 • 1

longlian

authored 2 papers 9 months ago

Learning Adaptive Parallel Reasoning with Language Models

Paper • 2504.15466 • Published Apr 21, 2025 • 44

Describe Anything: Detailed Localized Image and Video Captioning

Paper • 2504.16072 • Published Apr 22, 2025 • 63

richardaecn

authored a paper 9 months ago

Describe Anything: Detailed Localized Image and Video Captioning

Paper • 2504.16072 • Published Apr 22, 2025 • 63

richardaecn

authored 11 papers 10 months ago

Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception

Paper • 2305.06324 • Published May 10, 2023 • 1

Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset

Paper • 2004.12276 • Published Apr 26, 2020 • 1

Spatiotemporal Contrastive Video Representation Learning

Paper • 2008.03800 • Published Aug 9, 2020

Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation

Paper • 2012.07177 • Published Dec 13, 2020

DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model

Paper • 2306.01736 • Published Jun 2, 2023 • 1

Open-vocabulary Object Detection via Vision and Language Knowledge Distillation

Paper • 2104.13921 • Published Apr 28, 2021

VideoGLUE: Video General Understanding Evaluation of Foundation Models

Paper • 2307.03166 • Published Jul 6, 2023 • 5

A Simple Zero-shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image Models

Paper • 2302.06235 • Published Feb 13, 2023

AI & ML interests

Recent Activity

Team members 2

DescribeAnythingModel's activity