SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning Paper • 2404.18239 • Published Apr 28, 2024
Speculative Diffusion Decoding: Accelerating Language Generation through Diffusion Paper • 2408.05636 • Published Aug 10, 2024
Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense Paper • 2501.02629 • Published Jan 5 • 1
EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research Assistants Paper • 2502.20309 • Published Feb 27
TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention Paper • 2503.10602 • Published Mar 13 • 4
Double Visual Defense: Adversarial Pre-training and Instruction Tuning for Improving Vision-Language Model Robustness Paper • 2501.09446 • Published Jan 16
GRNFormer: A Biologically-Guided Framework for Integrating Gene Regulatory Networks into RNA Foundation Models Paper • 2503.01682 • Published Mar 3 • 1
Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training Paper • 2503.18929 • Published Mar 24 • 4
A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment Paper • 2504.15585 • Published Apr 22 • 13
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published Jun 5 • 44
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published Feb 7 • 145
Compute-Efficient Deep Learning: Algorithmic Trends and Opportunities Paper • 2210.06640 • Published Oct 13, 2022
Generative Counterfactual Introspection for Explainable Deep Learning Paper • 1907.03077 • Published Jul 6, 2019
NEFTune: Noisy Embeddings Improve Instruction Finetuning Paper • 2310.05914 • Published Oct 9, 2023 • 14
Shifting Attention to Relevance: Towards the Uncertainty Estimation of Large Language Models Paper • 2307.01379 • Published Jul 3, 2023 • 1