1 3

Bhavya Kailkhura

bhavyakailkhura

bkailkhu

AI & ML interests

AI Safety and Efficiency

Recent Activity

authored a paper about 2 months ago

SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning

authored a paper about 2 months ago

Low-rank finetuning for LLMs: A fairness perspective

authored a paper about 2 months ago

Speculative Diffusion Decoding: Accelerating Language Generation through Diffusion

View all activity

Organizations

None yet

authored 13 papers about 2 months ago

SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning

Paper • 2404.18239 • Published Apr 28, 2024

Low-rank finetuning for LLMs: A fairness perspective

Paper • 2405.18572 • Published May 28, 2024

Speculative Diffusion Decoding: Accelerating Language Generation through Diffusion

Paper • 2408.05636 • Published Aug 10, 2024

Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense

Paper • 2501.02629 • Published Jan 5 • 1

EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research Assistants

Paper • 2502.20309 • Published Feb 27

TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention

Paper • 2503.10602 • Published Mar 13 • 4

Double Visual Defense: Adversarial Pre-training and Instruction Tuning for Improving Vision-Language Model Robustness

Paper • 2501.09446 • Published Jan 16

GRNFormer: A Biologically-Guided Framework for Integrating Gene Regulatory Networks into RNA Foundation Models

Paper • 2503.01682 • Published Mar 3 • 1

Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training

Paper • 2503.18929 • Published Mar 24 • 4

STAR-1: Safer Alignment of Reasoning LLMs with 1K Data

Paper • 2504.01903 • Published Apr 2

Constrained Language Generation with Discrete Diffusion Models

Paper • 2503.09790 • Published Mar 12

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

Paper • 2504.15585 • Published Apr 22 • 13

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published Jun 5 • 44

authored a paper 6 months ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 145

authored 6 papers about 1 year ago

Shifting Attention to Relevance: Towards the Uncertainty Estimation of Large Language Models

Paper • 2307.01379 • Published Jul 3, 2023 • 1

TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10, 2024 • 70

Bhavya Kailkhura

AI & ML interests

Recent Activity

Organizations

bhavyakailkhura's activity