lhl PRO

leonardlin

·

https://randomfoo.net/

lhl
lhl

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

mitomtuna/MiMo-V2.5-0703-NVFP4-TP3

liked a model 4 days ago

sakamakismile/ThinkingCap-Qwen3.6-27B-NVFP4

liked a model 4 days ago

sakamakismile/Qwen3.6-27B-MTP-pi-tune-NVFP4

View all activity

Organizations

upvoted an article about 1 month ago

Article

GLM-5.2: Built for Long-Horizon Tasks

zai-org

•

Jun 17

• 130

upvoted an article 4 months ago

Article

TRL v1.0: Post-Training Library Built to Move with the Field

+2

qgallouedec, stevhliu, pcuenq, sergiopaniego

•

Mar 31

• 58

upvoted a collection 4 months ago

Qwen3.5

21 items • Updated Mar 9 • 1.72k

upvoted 2 papers 6 months ago

AnyDepth: Depth Estimation Made Easy

Paper • 2601.02760 • Published Jan 6 • 11

JP-TL-Bench: Anchored Pairwise LLM Evaluation for Bidirectional Japanese-English Translation

Paper • 2601.00223 • Published Jan 1 • 2

upvoted 2 articles 7 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

burtenshaw, evalstate

•

Dec 4, 2025

• 630

Article

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

nvidia

•

Dec 17, 2025

• 50

upvoted a paper 8 months ago

If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs

Paper • 2412.04144 • Published Dec 5, 2024 • 6

upvoted a collection 8 months ago

Olmo 3

Artifacts for the Olmo 3 release. • 7 items • Updated Mar 2 • 174

upvoted a collection 9 months ago

Granite 4.0 Nano Language Models

Ultra-compact language models designed for the edge and on-device deployment. • 9 items • Updated Apr 29 • 103

upvoted 2 papers 9 months ago

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model

Paper • 2510.18855 • Published Oct 21, 2025 • 73

Recent Advances in Speech Language Models: A Survey

Paper • 2410.03751 • Published Oct 1, 2024 • 2

upvoted 2 collections 11 months ago

Ovis2.5

Our next-generation MLLMs for native-resolution vision and advanced reasoning • 5 items • Updated Aug 19, 2025 • 58

SYNTHETIC-1

A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Oct 7, 2025 • 67

upvoted a paper about 1 year ago

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Paper • 2505.14669 • Published May 20, 2025 • 79

upvoted a collection about 1 year ago

ChatVector

モデル間の重みの加減算のみで構築した日本語LLM • 3 items • Updated Mar 2 • 2

upvoted a paper about 1 year ago

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published Apr 30, 2025 • 56

upvoted a collection over 1 year ago

Shisa V2

A family of bilingual JA/EN LLMs • 32 items • Updated Jun 4, 2025 • 9

upvoted 2 papers over 1 year ago

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2, 2025 • 87

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9, 2025 • 78