1 4 3

zuijiang

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models

commented a paper 6 days ago

Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models

authored a paper about 2 months ago

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

View all activity

Organizations

zuijiang's activity

upvoted a paper 6 days ago

Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models

Paper • 2501.01830 • Published 9 days ago • 15

commented a paper 6 days ago

Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models

Paper • 2501.01830 • Published 9 days ago • 15 •

authored a paper about 2 months ago

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Paper • 2411.11504 • Published Nov 18, 2024 • 20

upvoted a paper about 2 months ago

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Paper • 2411.11504 • Published Nov 18, 2024 • 20

updated 2 datasets 5 months ago

zuijiang/alpaca-alpaca-clean

Viewer • Updated Aug 26, 2024 • 51.8k • 29

zuijiang/mistral-alpaca-clean

Viewer • Updated Aug 25, 2024 • 51.8k • 29

liked a dataset 6 months ago

AIcell/MOSSBench

Viewer • Updated Jul 6, 2024 • 544 • 420 • 4

liked a Space 6 months ago

Running on Zero

1.61k

🗣️

Voice Clone

updated a model 7 months ago

zuijiang/llava-qwen1.5-14B-chat

Text2Text Generation • Updated Jul 1, 2024 • 15

updated a dataset 8 months ago

zuijiang/ocr_vqa

Viewer • Updated May 30, 2024 • 208k • 49

liked a dataset 8 months ago

danielz01/laion-5b

Updated Feb 14, 2024 • 20

upvoted 2 papers over 1 year ago

RAIN: Your Language Models Can Align Themselves without Finetuning

Paper • 2309.07124 • Published Sep 13, 2023 • 3

RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

Paper • 2309.00267 • Published Sep 1, 2023 • 47