-
Is Safety Standard Same for Everyone? User-Specific Safety Evaluation of Large Language Models
Paper • 2502.15086 • Published • 14 -
How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?
Paper • 2502.14502 • Published • 77 -
Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information
Paper • 2502.14258 • Published • 23 -
S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning
Paper • 2502.12853 • Published • 26
Hyuhng Joon Kim
heyjoonkim
AI & ML interests
Machine Learning, Natural Language Processing (NLP), Uncertainty, Abstention for Reliability
Recent Activity
updated
a collection
about 15 hours ago
todo
updated
a collection
about 15 hours ago
todo
updated
a collection
about 15 hours ago
todo
Organizations
None yet
Collections
1
models
5

heyjoonkim/llama2-7b_orca_mink_10000
Text Generation
•
Updated
•
8

heyjoonkim/llama2-7b_orca_full_50000
Text Generation
•
Updated
•
6

heyjoonkim/llama2-7b_orca_nll_average_top_10000
Text Generation
•
Updated
•
9

heyjoonkim/llama2-7b_orca_random_10000
Text Generation
•
Updated
•
9

heyjoonkim/llama2-7b_orca_entropy_average_top_9988
Text Generation
•
Updated
•
9
datasets
None public yet