-
Improving Black-box Robustness with In-Context Rewriting
Paper • 2402.08225 • Published -
Kyle1668/boss-sentiment-24000-bert-base-uncased
Text Classification • Updated • 11 -
Kyle1668/boss-sentiment-bert-base-uncased
Text Classification • Updated • 9 -
Kyle1668/boss-toxicity-bert-base-uncased
Text Classification • Updated • 18
Kyle O'Brien PRO
Kyle1668
AI & ML interests
Interpretability, model editing, alignment
Recent Activity
updated
a model
1 day ago
Unlearning/pythia1.5_blocklist_filtered_wmdp_lie_o_rewrite_20x_upsampled
published
a model
1 day ago
Unlearning/pythia1.5_blocklist_filtered_wmdp_lie_o_rewrite_20x_upsampled
updated
a model
2 days ago
Unlearning/pythia1.5_modernbert_filtered_wmdp_lie_o_shuffled
Organizations
Collections
1
Papers
2
models
27

Kyle1668/answerdotai-ModernBERT-large_20250111-002259
Text Classification
•
Updated
•
3

Kyle1668/answerdotai-ModernBERT-large_20250111-224237
Text Classification
•
Updated
•
1

Kyle1668/answerdotai-ModernBERT-large_20241230-093521
Text Classification
•
Updated
•
13

Kyle1668/allenai-scibert_scivocab_uncased_20241230-091934
Text Classification
•
Updated
•
8

Kyle1668/boss-toxicity-bert-base-uncased
Text Classification
•
Updated
•
18

Kyle1668/ag-news-t5-large
Text2Text Generation
•
Updated
•
11

Kyle1668/ag-news-76800-bert-base-uncased
Text Classification
•
Updated
•
6

Kyle1668/ag-news-38400-bert-base-uncased
Text Classification
•
Updated
•
11

Kyle1668/ag-news-19200-bert-base-uncased
Text Classification
•
Updated
•
1.6k

Kyle1668/ag-news-9600-bert-base-uncased
Text Classification
•
Updated
•
6
datasets
7
Kyle1668/mmlu_auxiliary_train_formatted
Viewer
•
Updated
•
99.8k
•
77
Kyle1668/phi_sae_training
Viewer
•
Updated
•
17.2M
•
55
Kyle1668/LLM-TTA-Cached-Rewrites
Viewer
•
Updated
•
986k
•
18
Kyle1668/LLM-TTA-Augmentation-Logs
Viewer
•
Updated
•
4.43M
•
54
Kyle1668/AG-Tweets
Viewer
•
Updated
•
7.6k
•
18
Kyle1668/BOSS-Robustness-Benchmark
Preview
•
Updated
•
7
Kyle1668/pythia-semantic-memorization-perplexities
Viewer
•
Updated
•
99.7M
•
511