6 10 64

Yang

jacklanda

AI & ML interests

Language Modeling, Lexical Semantics

Recent Activity

upvoted a paper about 16 hours ago

Understanding and Leveraging the Expert Specialization of Context Faithfulness in Mixture-of-Experts LLMs

liked a dataset 1 day ago

uq-project/uq

liked a dataset 4 days ago

AI-MO/NuminaMath-CoT

View all activity

Organizations

None yet

upvoted a paper about 16 hours ago

Understanding and Leveraging the Expert Specialization of Context Faithfulness in Mixture-of-Experts LLMs

Paper • 2508.19594 • Published 1 day ago • 4

liked a dataset 1 day ago

uq-project/uq

Viewer • Updated 2 days ago • 500 • 237 • 9

liked a dataset 4 days ago

AI-MO/NuminaMath-CoT

Viewer • Updated Nov 25, 2024 • 860k • 4.7k • 488

liked a Space 10 days ago

Pipeline Parallelism Schedule Visualizer

📊

Visualize pipeline parallelism schedules

liked a model 23 days ago

openai/gpt-oss-20b

Text Generation • 22B • Updated 2 days ago • 8.06M • • 3.31k

liked a model 29 days ago

Goodfire/DeepSeek-R1-SAE-l37

Updated Apr 21 • 15

upvoted a paper 2 months ago

Resa: Transparent Reasoning Models via SAEs

Paper • 2506.09967 • Published Jun 11 • 22

authored a paper 2 months ago

RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling

Paper • 2506.08672 • Published Jun 10 • 31

upvoted a paper 3 months ago

RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling

Paper • 2506.08672 • Published Jun 10 • 31

New activity in RuleReasoner/rule-reasoning 3 months ago

Upload README.md with huggingface_hub

#2 opened 3 months ago by

jacklanda

Upload folder using huggingface_hub

#1 opened 3 months ago by

jacklanda

liked a dataset 3 months ago

bigai-nlco/ReflectionEvo

Viewer • Updated Jun 4 • 437k • 474 • 11

authored 2 papers 3 months ago

In-Context Meta LoRA Generation

Paper • 2501.17635 • Published Jan 29

ReflectEvo: Improving Meta Introspection of Small LLMs by Learning Self-Reflection

Paper • 2505.16475 • Published May 22 • 2

upvoted a paper 3 months ago

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Paper • 2505.13308 • Published May 19 • 27

upvoted a paper 4 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 184

liked a model 5 months ago

virtuoussy/Qwen2.5-7B-Instruct-RLVR

8B • Updated May 4 • 85 • 14

liked a dataset 5 months ago

opendatalab/ProverQA

Preview • Updated Jun 11 • 34 • 6

upvoted a paper 5 months ago

OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts

Paper • 2503.22952 • Published Mar 29 • 18

liked a dataset 6 months ago

facebook/natural_reasoning

Viewer • Updated Feb 21 • 1.15M • 1.55k • 516

Yang

AI & ML interests

Recent Activity

Organizations

jacklanda's activity

Pipeline Parallelism Schedule Visualizer

Upload README.md with huggingface_hub

Upload folder using huggingface_hub