Tung-Lin Wu's picture

Tung-Lin Wu

tunglinwu

·

tunglinwood

AI & ML interests

None yet

Organizations

None yet

upvoted 2 collections 4 months ago

Qwen3

84 items • Updated 10 days ago • 1.1k

GLM-4-0414

GLM-4-0414 series model • 8 items • Updated Jun 30 • 130

upvoted 2 papers 4 months ago

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published Feb 3 • 222

Training Sparse Mixture Of Experts Text Embedding Models

Paper • 2502.07972 • Published Feb 11 • 7

upvoted a collection 4 months ago

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated 26 days ago • 155

upvoted a paper 5 months ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 127

upvoted an article 5 months ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

By

•

May 28, 2024

• 241

upvoted 3 papers 6 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 241

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published Feb 24 • 31

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 416

upvoted 3 articles 6 months ago

Article

What is test-time compute and how to scale it?

By

and 1 other •

Feb 6

• 100

Article

Mixture of Experts Explained

By

and 5 others •

Dec 11, 2023

• 826

Article

Open-source DeepResearch – Freeing our search agents

By

and 4 others •

Feb 4

• 1.28k

upvoted a collection 8 months ago

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 180

upvoted a paper 9 months ago

HelpSteer2-Preference: Complementing Ratings with Preferences

Paper • 2410.01257 • Published Oct 2, 2024 • 25

upvoted a collection 10 months ago

Emu3

Emu3: Next-Token Prediction is All You Need • 7 items • Updated Feb 13 • 75

upvoted a paper 11 months ago

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17, 2024 • 75

upvoted a collection 11 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 628

upvoted a paper about 1 year ago

Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User's Casual Sketches

Paper • 2408.04567 • Published Aug 8, 2024 • 27