4 10 28

Tim Wu

changtimwu

AI & ML interests

DL,IoT,Devop

Recent Activity

liked a model 2 days ago

RedHatAI/Qwen3-32B-NVFP4

new activity about 1 month ago

omeng-nvidia/saved_models_Qwen3-30B-A3B_nvfp4_hf:Can you explain how this model was built?

liked a model about 2 months ago

Qwen/Qwen3-32B-FP8

View all activity

Organizations

liked a model 2 days ago

RedHatAI/Qwen3-32B-NVFP4

Text Generation • 19B • Updated about 1 month ago • 1.45k • 3

New activity in omeng-nvidia/saved_models_Qwen3-30B-A3B_nvfp4_hf about 1 month ago

Can you explain how this model was built?

#2 opened about 1 month ago by

changtimwu

liked a model about 2 months ago

Qwen/Qwen3-32B-FP8

Text Generation • 33B • Updated 5 days ago • 76.4k • 57

liked a Space 3 months ago

2.85k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper 4 months ago

DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving

Paper • 2401.09670 • Published Jan 18, 2024 • 2

upvoted an article 5 months ago

Article

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 633

liked a model 5 months ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • 6B • Updated May 1 • 322k • 1.46k

liked a Space 5 months ago

257

Qwen2.5 VL 72B Instruct

💻

Interact with a multimodal chatbot using text and images

upvoted a paper 6 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 127

liked a model 6 months ago

QuantFactory/Llama-3.2-Taiwan-Legal-3B-Instruct-GGUF

Text Generation • 3B • Updated Nov 2, 2024 • 1.04k • 11

upvoted 2 papers 6 months ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published Jan 9 • 100

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 123

liked a Space 11 months ago

115

Llama3.1 S V0.2 Checkpoint 2024 08 20

😻

Convert text to audio and vice versa

liked 2 models about 1 year ago

shenzhi-wang/Llama3.1-8B-Chinese-Chat

Text Generation • 8B • Updated Jul 29, 2024 • 4.71k • • 262

openbmb/MiniCPM-Llama3-V-2_5-gguf

Updated Feb 27 • 2.77k • 213

liked a Space about 1 year ago

215

Microsoft Phi-3-Vision-128k

😻

Generate image descriptions

liked a model about 1 year ago

google/paligemma-3b-pt-224

Image-Text-to-Text • 3B • Updated Sep 21, 2024 • 41.7k • 340

updated a model about 1 year ago

changtimwu/speaker-segmentation-fine-tuned-callhome-jpn

0.0B • Updated May 2, 2024 • 5

liked 2 models over 1 year ago

crusoeai/Llama-3-8B-Instruct-262k-GGUF

8B • Updated May 5, 2024 • 378 • 48

bullerwins/gradientai_Llama-3-8B-Instruct-262k_exl2_8.0bpw

Text Generation • Updated Apr 26, 2024 • 4 • 3