shane are's picture

shane are

arionsingul

·

AI & ML interests

None yet

Recent Activity

liked a Space 7 days ago

MathematicalModelingAgent/MathematicalModelingAgent

liked a dataset 26 days ago

m-a-p/Matrix

liked a model about 1 month ago

NSFW-API/NSFW_Wan_1.3b

View all activity

Organizations

upvoted a paper 3 months ago

Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models

Paper • 2304.08818 • Published Apr 18, 2023 • 8

upvoted a collection 3 months ago

VibeVoice

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated Dec 4, 2025 • 184

upvoted an article 7 months ago

Article

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

Jun 3, 2025

•

71

upvoted a collection 7 months ago

SkyReels-AX

7 items • Updated Apr 13, 2025 • 8

upvoted an article 7 months ago

Article

DiffRhythm: Revolutionizing Open Source AI Music Generator

Mar 5, 2025

•

13

upvoted a collection 9 months ago

AQLM+PV

Official AQLM quantizations for "PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression": https://arxiv.org/abs/2405.14852 • 26 items • Updated Feb 28, 2025 • 22

upvoted a paper 9 months ago

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

Paper • 2502.14837 • Published Feb 20, 2025 • 3

upvoted a collection 10 months ago

interesting papers

10 items • Updated Mar 20, 2025 • 1

upvoted 3 papers 11 months ago

Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication

Paper • 2402.18439 • Published Feb 28, 2024 • 1

OneLLM: One Framework to Align All Modalities with Language

Paper • 2312.03700 • Published Dec 6, 2023 • 24

LLMPirate: LLMs for Black-box Hardware IP Piracy

Paper • 2411.16111 • Published Nov 25, 2024 • 1

upvoted an article 11 months ago

Article

Optimum-NVIDIA Unlocking blazingly fast LLM inference in just 1 line of code

Dec 5, 2023

•

5

upvoted a collection about 1 year ago

NVLM 1.0

A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. • 2 items • Updated 13 days ago • 52