CoRNStack: High-Quality Contrastive Data for Better Code Ranking Paper • 2412.01007 • Published Dec 1, 2024
CodeARC: Benchmarking Reasoning Capabilities of LLM Agents for Inductive Program Synthesis Paper • 2503.23145 • Published 27 days ago • 33
Tamper-Resistant Safeguards for Open-Weight LLMs Collection Models & datasets from the paper "Tamper-Resistant Safeguards for Open-Weight LLMs" (https://arxiv.org/pdf/2408.00761) • 9 items • Updated Feb 15 • 2
Tamper-Resistant Safeguards for Open-Weight LLMs Collection Models & datasets from the paper "Tamper-Resistant Safeguards for Open-Weight LLMs" (https://arxiv.org/pdf/2408.00761) • 9 items • Updated Feb 15 • 2