Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
Michael Yu
michaelwaves
Follow
Anaya3D's profile picture
gryphon0E's profile picture
katya228's profile picture
6 followers
·
8 following
michaelwaves
AI & ML interests
None yet
Recent Activity
updated
a dataset
14 days ago
michaelwaves/activations-test
published
a dataset
14 days ago
michaelwaves/activations-test
updated
a dataset
18 days ago
michaelwaves/grpo-schemer_12-16-2025
View all activity
Organizations
michaelwaves
's datasets
27
Sort:Â Recently updated
michaelwaves/activations-test
Viewer
•
Updated
14 days ago
•
8B
•
29
michaelwaves/grpo-schemer_12-16-2025
Updated
18 days ago
•
26
michaelwaves/grpo-schemer-12-10-2025-brier-soft_exponential_length_penalty
Updated
24 days ago
•
6
michaelwaves/magicoder-oss_752_problem_activations
Viewer
•
Updated
24 days ago
•
21.2M
•
6
michaelwaves/magicoder-oss_752_solution_activations
Viewer
•
Updated
24 days ago
•
20.5M
•
12
michaelwaves/control_arena_filtered_230_activations
Viewer
•
Updated
24 days ago
•
9.83M
•
12
michaelwaves/grpo-schemer-checkpoints_12-10-2025
Updated
24 days ago
•
94
michaelwaves/control_arena_1000
Viewer
•
Updated
25 days ago
•
1k
•
18
michaelwaves/grpo-schemer-12-9-2025
Updated
25 days ago
•
6
michaelwaves/grpo-results-12-9-2025
Viewer
•
Updated
25 days ago
•
19
•
382
michaelwaves/grpo-schemer-12-2-2025-length-penalty
Updated
Dec 3, 2025
•
35
michaelwaves/grpo-schemer-12-2-2025
Updated
Dec 2, 2025
•
42
michaelwaves/vllm-steer-venv
Updated
Nov 29, 2025
•
634
michaelwaves/emotional-vectors
Updated
Nov 29, 2025
•
23
•
1
michaelwaves/sparse-circuits
Preview
•
Updated
Nov 26, 2025
•
7
michaelwaves/scheming
Updated
Nov 19, 2025
michaelwaves/scheming_transcripts_baseline_2_8_transcripts
Updated
Nov 18, 2025
•
3
michaelwaves/fineweb_reward_hacking_10_percent
Viewer
•
Updated
Nov 14, 2025
•
86.2k
•
16
michaelwaves/reward-hacking
Viewer
•
Updated
Nov 14, 2025
•
8.62k
•
18
michaelwaves/bench-af-logs
Updated
Sep 24, 2025
•
103
michaelwaves/blackmail-observed
Viewer
•
Updated
Aug 22, 2025
•
1
•
11
michaelwaves/blackmail-unobserved
Viewer
•
Updated
Aug 22, 2025
•
1
•
9
michaelwaves/mmlu-abstract_algebra-inspect
Viewer
•
Updated
Aug 14, 2025
•
11
•
12
michaelwaves/racism_dataset_small
Viewer
•
Updated
Aug 10, 2025
•
5
•
12
michaelwaves/anthropic-blackmail-eval
Viewer
•
Updated
Aug 10, 2025
•
1
•
4
michaelwaves/antrhopic_blackmail_eval
Updated
Aug 10, 2025
•
4
michaelwaves/shopify_transactions_eval
Viewer
•
Updated
Aug 10, 2025
•
2
•
15