Art Atk

ArtAtk

AI & ML interests

Multimodal Models

Recent Activity

upvoted a paper 2 days ago

T-LoRA: Single Image Diffusion Model Customization Without Overfitting

liked a Space 2 months ago

ACE-Step/ACE-Step

upvoted a paper 2 months ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

View all activity

Organizations

None yet

upvoted a paper 2 days ago

T-LoRA: Single Image Diffusion Model Customization Without Overfitting

Paper • 2507.05964 • Published 7 days ago • 98

liked a Space 2 months ago

506

ACE Step

😻

A Step Towards Music Generation Foundation Model

upvoted 2 papers 2 months ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published May 5 • 28

FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios

Paper • 2505.03730 • Published May 6 • 28

upvoted 6 papers 3 months ago

DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning

Paper • 2504.14509 • Published Apr 20 • 51

Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation

Paper • 2504.14899 • Published Apr 21 • 21

WORLDMEM: Long-term Consistent World Simulation with Memory

Paper • 2504.12369 • Published Apr 16 • 34

GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation

Paper • 2504.08736 • Published Apr 11 • 47

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published Apr 11 • 129

VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning

Paper • 2504.07960 • Published Apr 10 • 49

liked a model 3 months ago

Skywork/SkyReels-A2

Updated Apr 8 • 200 • 134

upvoted 2 papers 3 months ago

Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation

Paper • 2504.02542 • Published Apr 3 • 49

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

Paper • 2503.24379 • Published Mar 31 • 77

upvoted 5 papers 4 months ago

Synthetic Video Enhances Physical Fidelity in Video Synthesis

Paper • 2503.20822 • Published Mar 26 • 16

Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models

Paper • 2503.18446 • Published Mar 24 • 12

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24 • 120

FFN Fusion: Rethinking Sequential Computation in Large Language Models

Paper • 2503.18908 • Published Mar 24 • 20

One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation

Paper • 2503.13358 • Published Mar 17 • 96

liked a Space 4 months ago

186

Hunyuan3D 2mini Turbo

🔥

Fast Images-to-3D Generation within 1 Second

upvoted a paper 4 months ago

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published Mar 14 • 142