4 9 1

Junxiao Yang

yangjunxiao2021

AI & ML interests

Alignment/AI safety

Recent Activity

new activity 10 days ago

openai/gpt-oss-120b:ImportError: /lib64/libc.so.6: version `GLIBC_2.32' not found

upvoted a paper 3 months ago

MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems

authored a paper 3 months ago

Agent-SafetyBench: Evaluating the Safety of LLM Agents

View all activity

Organizations

New activity in openai/gpt-oss-120b 10 days ago

ImportError: /lib64/libc.so.6: version `GLIBC_2.32' not found

#86 opened 10 days ago by

yueqiren

upvoted a paper 3 months ago

MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems

Paper • 2505.18943 • Published May 25 • 24

authored 5 papers 3 months ago

Agent-SafetyBench: Evaluating the Safety of LLM Agents

Paper • 2412.14470 • Published Dec 19, 2024 • 13

AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement

Paper • 2502.16776 • Published Feb 24 • 6

Be Careful When Fine-tuning On Open-Source LLMs: Your Fine-tuning Data Could Be Secretly Stolen!

Paper • 2505.15656 • Published May 21 • 14

How Should We Enhance the Safety of Large Reasoning Models: An Empirical Study

Paper • 2505.15404 • Published May 21 • 13

BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs

Paper • 2505.13529 • Published May 18 • 11

upvoted a paper 3 months ago

BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs

Paper • 2505.13529 • Published May 18 • 11

commented a paper 3 months ago

BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs

Paper • 2505.13529 • Published May 18 • 11 •

upvoted 2 papers 3 months ago

How Should We Enhance the Safety of Large Reasoning Models: An Empirical Study

Paper • 2505.15404 • Published May 21 • 13

Be Careful When Fine-tuning On Open-Source LLMs: Your Fine-tuning Data Could Be Secretly Stolen!

Paper • 2505.15656 • Published May 21 • 14

authored a paper 3 months ago

Guiding not Forcing: Enhancing the Transferability of Jailbreaking Attacks on LLMs via Removing Superfluous Constraints

Paper • 2503.01865 • Published Feb 25

upvoted a paper 3 months ago

AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published May 19 • 82

upvoted an article 3 months ago

Article

arXiv实用技巧，如何让你的paper关注度变高？

•

Jul 8, 2024

• 8

upvoted a paper 3 months ago

AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via Reinforcement Learning

Paper • 2505.11896 • Published May 17 • 58

upvoted an article 5 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 208

liked a dataset 5 months ago

camel-ai/loong

Viewer • Updated May 16 • 9.65k • 334 • 26

updated a dataset 5 months ago

yangjunxiao2021/CTF_Crypto_demo

Viewer • Updated Mar 25 • 2 • 4

published a dataset 5 months ago

yangjunxiao2021/CTF_Crypto_demo

Viewer • Updated Mar 25 • 2 • 4

New activity in thu-coai/AISafetyLab_Datasets 8 months ago

Update README.md

#3 opened 8 months ago by

yangjunxiao2021

Junxiao Yang

AI & ML interests

Recent Activity

Organizations

yangjunxiao2021's activity

ImportError: /lib64/libc.so.6: version `GLIBC_2.32' not found

arXiv实用技巧，如何让你的paper关注度变高？

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Update README.md