2 5 2

ZHANG YONG

ReRaWo

yzhangchuck

AI & ML interests

ReRaWo = Reasoning and Rationale World

Recent Activity

new activity about 11 hours ago

ReRaWo/Sentinel:Add pipeline tag

liked a model 1 day ago

ReRaWo/Sentinel

updated a model 1 day ago

ReRaWo/Sentinel

View all activity

Organizations

None yet

ReRaWo's activity

New activity in ReRaWo/Sentinel about 11 hours ago

Add pipeline tag

#2 opened about 11 hours ago by

nielsr

liked a model 1 day ago

ReRaWo/Sentinel

Text Classification • Updated about 11 hours ago • 6 • 1

updated a model 1 day ago

ReRaWo/Sentinel

Text Classification • Updated about 11 hours ago • 6 • 1

New activity in ReRaWo/Sentinel 1 day ago

Upload 3 files

#1 opened 1 day ago by

ReRaWo

published a model 1 day ago

ReRaWo/Sentinel

Text Classification • Updated about 11 hours ago • 6 • 1

upvoted 2 papers 1 day ago

Dynamic Attention-Guided Context Decoding for Mitigating Context Faithfulness Hallucinations in Large Language Models

Paper • 2501.01059 • Published Jan 2 • 1

Self-Enhanced Reasoning Training: Activating Latent Reasoning in Small Models for Enhanced Reasoning Distillation

Paper • 2502.12744 • Published Feb 18 • 1

authored 2 papers 2 days ago

Dynamic Attention-Guided Context Decoding for Mitigating Context Faithfulness Hallucinations in Large Language Models

Paper • 2501.01059 • Published Jan 2 • 1

Sentinel: Attention Probing of Proxy Models for LLM Context Compression with an Understanding Perspective

Paper • 2505.23277 • Published 22 days ago • 1

upvoted a paper 2 days ago

Sentinel: Attention Probing of Proxy Models for LLM Context Compression with an Understanding Perspective

Paper • 2505.23277 • Published 22 days ago • 1

authored 3 papers 4 days ago

Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

Paper • 2402.00530 • Published Feb 1, 2024 • 1

From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning

Paper • 2308.12032 • Published Aug 23, 2023 • 1

Self-Enhanced Reasoning Training: Activating Latent Reasoning in Small Models for Enhanced Reasoning Distillation

Paper • 2502.12744 • Published Feb 18 • 1

upvoted a paper 2 months ago

How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients

Paper • 2504.10766 • Published Apr 14 • 40

upvoted an article 4 months ago

Article

Open R1: Update #2

and 6 others •

Feb 10

• 214

liked a model over 1 year ago

meta-llama/Llama-2-7b-hf

Text Generation • Updated Apr 17, 2024 • 565k • 2.09k