Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
86
64
111
Aryan V S
a-r-r-o-w
Follow
Lionhearrtt's profile picture
Temptressesxclusive's profile picture
danieldk's profile picture
158 followers
·
108 following
a-r-r-o-w
AI & ML interests
computer vision, reinforcement learning
Recent Activity
reacted
to
AdinaY
's
post
with 🔥
4 days ago
Kimi-K2 is now available on the hub🔥🚀 This is a trillion-parameter MoE model focused on long context, code, reasoning, and agentic behavior. https://huggingface.co/collections/moonshotai/kimi-k2-6871243b990f2af5ba60617d ✨ Base & Instruct ✨ 1T total / 32B active - Modified MIT License ✨ 128K context length ✨ Muon optimizer for stable trillion-scale training
upvoted
an
article
8 days ago
Creating custom kernels for the AMD MI300
posted
an
update
8 days ago
Caching is an essential technique used in diffusion inference serving for speeding up image/video generations. Diffusers just added support for another caching method: First Block Cache - a technique developed by @chengzeyi building upon the ideas of TeaCache. The idea in short is: if the model predictions do not vary much over successive inference steps, we can skip certain steps where the prediction difference is small. To figure out whether an inference step will make a significant improvement to the overall velocity/noise prediction, we calculate the relative difference of the output of the first transformer block at timestep $t$ with $t-1$, and compare it against a selected threshold. If the difference is lower than the threshold, we skip the step. A higher threshold will lead to more steps being skipped. However, skipping many steps is bad because it can throw off the model predictions, and so we need to test and select the threshold based on level of quality-speed tradeoff for every model we use it with. Diffusers usage with CogView4: ```python import torch from diffusers import CogView4Pipeline from diffusers.hooks import apply_first_block_cache, FirstBlockCacheConfig pipe = CogView4Pipeline.from_pretrained("THUDM/CogView4-6B", torch_dtype=torch.bfloat16) pipe.to("cuda") apply_first_block_cache(pipe.transformer, FirstBlockCacheConfig(threshold=0.2)) prompt = "A photo of an astronaut riding a horse on mars" image = pipe(prompt, generator=torch.Generator().manual_seed(42)).images[0] image.save("output.png") ``` Below, you'll find the benchmarks and visualizations of the predicted output at different blocks of the Flux DiT. Docs: https://huggingface.co/docs/diffusers/main/en/optimization/cache PR: https://github.com/huggingface/diffusers/pull/11180 References: - First Block Cache: https://github.com/chengzeyi/ParaAttention - TeaCache: https://github.com/ali-vilab/TeaCache
View all activity
Organizations
a-r-r-o-w
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
8 days ago
GPUMODE/kernelbot-data
Viewer
•
Updated
23 days ago
•
65.9k
•
1.77k
•
27
liked
a dataset
18 days ago
FreedomIntelligence/ShareGPT-4o-Image
Viewer
•
Updated
16 days ago
•
92.3k
•
5.29k
•
80
liked
a model
21 days ago
black-forest-labs/FLUX.1-Kontext-dev
Image-to-Image
•
Updated
19 days ago
•
295k
•
•
1.7k
liked
a model
2 months ago
Nebsh/MotionLoraNebCollection
Updated
Mar 4, 2024
•
21
liked
a dataset
3 months ago
VisualCloze/Graph200K
Viewer
•
Updated
May 18
•
183k
•
2.59k
•
16
liked
a Space
3 months ago
Running
7
7
Memory Viz
🧠
Memory Viz
liked
2 datasets
3 months ago
gofixyourself/EasyPortrait
Preview
•
Updated
May 12, 2023
•
76
•
7
facebook/PE-Video
Viewer
•
Updated
Apr 18
•
118k
•
8.78k
•
32
liked
2 models
3 months ago
lllyasviel/FramePackI2V_HY
Updated
Apr 13
•
101k
•
115
Wan-AI/Wan2.1-FLF2V-14B-720P
Updated
Apr 17
•
3.89k
•
214
liked
a dataset
3 months ago
OpenVideo/pexel-0808-complete-final-test
Viewer
•
Updated
Feb 12
•
104k
•
53
•
4
liked
2 datasets
4 months ago
diffusers/ShotDEAD-v0
Viewer
•
Updated
Mar 5
•
88.8k
•
112
•
4
bigdata-pw/TheSimpsons
Viewer
•
Updated
Sep 5, 2024
•
79.4k
•
27
•
3
liked
2 models
4 months ago
Cseti/Wan-LoRA-Arcane-Jinx-v1
Text-to-Image
•
Updated
May 13
•
21
•
11
THUDM/CogView4-6B
Text-to-Image
•
Updated
Mar 11
•
22.8k
•
•
238
liked
2 datasets
5 months ago
OpenGVLab/MVBench
Viewer
•
Updated
Oct 18, 2024
•
4k
•
9.7k
•
35
KwaiVGI/360Motion-Dataset
Viewer
•
Updated
Jan 22
•
52
•
466
•
31
liked
a model
5 months ago
Wan-AI/Wan2.1-T2V-14B
Text-to-Video
•
Updated
Mar 12
•
94.9k
•
•
1.35k
liked
a Space
5 months ago
Running
1.8k
1.8k
Wan2.1
💻
Wan: Open and Advanced Large-Scale Video Generative Models
liked
a dataset
5 months ago
nicolaus-huang/PhotoDoodle
Viewer
•
Updated
18 days ago
•
2.87k
•
1.05k
•
7
Load more