HanWang's picture

17 133

HanWang

eseedo

·

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

microsoft/bitnet-b1.58-2B-4T

upvoted a collection 11 days ago

liked a Space 13 days ago

3DAIGC/LHM

View all activity

Organizations

eseedo's activity

upvoted a collection 11 days ago

HiDream-I1

A collections of HiDream-I1 models. • 4 items • Updated 18 days ago • 26

upvoted a collection about 1 month ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated 26 days ago • 448

upvoted a collection about 2 months ago

Phi-4

Phi-4 family of small language and multi-modal models. • 9 items • Updated 9 days ago • 117

upvoted a paper 2 months ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 155

upvoted a collection 2 months ago

Deepseek Papers

Deepseek papers collection • 19 items • Updated 22 days ago • 191

upvoted an article 7 months ago

Article

Exploring the Daily Papers Page on Hugging Face

Sep 23, 2024

• 54

upvoted 2 collections 9 months ago

Llama3.1-Chinese-Chat

2 items • Updated Jul 26, 2024 • 7

H2O Danube3

7 items • Updated Nov 30, 2024 • 57

upvoted a paper about 1 year ago

Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video

Paper • 2404.09833 • Published Apr 15, 2024 • 31

upvoted 2 papers over 1 year ago

Masked Audio Generation using a Single Non-Autoregressive Transformer

Paper • 2401.04577 • Published Jan 9, 2024 • 44

GPT-4V(ision) is a Generalist Web Agent, if Grounded

Paper • 2401.01614 • Published Jan 3, 2024 • 23

upvoted a collection over 1 year ago

LLMs

16 items • Updated Jan 4, 2024 • 3

upvoted 4 papers over 1 year ago

DocLLM: A layout-aware generative language model for multimodal document understanding

Paper • 2401.00908 • Published Dec 31, 2023 • 184

DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision

Paper • 2312.16256 • Published Dec 26, 2023 • 17

PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar

Paper • 2312.14239 • Published Dec 21, 2023 • 12

Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

Paper • 2312.09911 • Published Dec 15, 2023 • 55

upvoted a collection over 1 year ago

Image to 3D

11 items • Updated Aug 20, 2024 • 8