3 8 9

dnk

dnkdnk

AI & ML interests

None yet

Recent Activity

liked a dataset 30 days ago

ontocord/MixtureVitae-VALID

upvoted a paper about 1 month ago

MotionSight: Boosting Fine-Grained Motion Understanding in Multimodal LLMs

updated a dataset 3 months ago

dnkdnk/CVTG-2K

View all activity

Organizations

None yet

liked a dataset 30 days ago

ontocord/MixtureVitae-VALID

Updated Apr 26 • 3.82k • 15

upvoted a paper about 1 month ago

MotionSight: Boosting Fine-Grained Motion Understanding in Multimodal LLMs

Paper • 2506.01674 • Published Jun 2 • 27

updated a dataset 3 months ago

dnkdnk/CVTG-2K

Updated Apr 1 • 25 • 3

liked a dataset 3 months ago

dnkdnk/CVTG-2K

Updated Apr 1 • 25 • 3

upvoted a paper 3 months ago

TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes

Paper • 2503.23461 • Published Mar 30 • 95

published a dataset 3 months ago

dnkdnk/CVTG-2K

Updated Apr 1 • 25 • 3

New activity in openai/clip-vit-large-patch14 5 months ago

OSError: We couldn't connect to 'https://huggingface.co' to load this file, couldn't find it...

👀 2

#31 opened 11 months ago by

dnkdnk

upvoted a paper 6 months ago

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Paper • 2501.02976 • Published Jan 6 • 56

upvoted 3 papers 7 months ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 146

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 97

InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption

Paper • 2412.09283 • Published Dec 12, 2024 • 19

liked a dataset 7 months ago

AnonMegumi/InstanceVid

Preview • Updated Dec 16, 2024 • 89 • 3

liked a Space 7 months ago

RAG Demo

👀

Generate images based on detailed prompts and layouts

liked a model 8 months ago

black-forest-labs/FLUX.1-Canny-dev

Text-to-Image • Updated 11 days ago • 7.34k • • 208

upvoted a paper 8 months ago

Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement

Paper • 2411.06558 • Published Nov 10, 2024 • 37

New activity in openai/clip-vit-large-patch14 11 months ago

OSError: It looks like the config file is not a valid JSON file.

👍 2

#2 opened almost 3 years ago by

xvjiarui

liked a Space 12 months ago

442

Open Sora

⚡

liked a Space about 1 year ago

331

MLLM-guided Image Editing (MGIE)

👩

Transform images based on textual instructions

liked a model about 1 year ago

tsujuifu/ml-mgie

Updated Feb 9, 2024 • 22

upvoted a paper about 1 year ago

OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation

Paper • 2407.02371 • Published Jul 2, 2024 • 55

dnk

AI & ML interests

Recent Activity

Organizations

dnkdnk's activity

OSError: We couldn't connect to 'https://huggingface.co' to load this file, couldn't find it...

RAG Demo

OSError: It looks like the config file is not a valid JSON file.

Open Sora

MLLM-guided Image Editing (MGIE)