University of Washington

Verified

Activity Feed Request to join this org

AI & ML interests

None defined yet.

ronakdm

authored 3 papers 25 days ago

Distributionally Robust Optimization with Bias and Variance Reduction

Paper • 2310.13863 • Published Oct 21, 2023

The Benefits of Balance: From Information Projections to Variance Reduction

Paper • 2408.15065 • Published Aug 27, 2024 • 1

A Generalization Theory for Zero-Shot Prediction

Paper • 2507.09128 • Published Jul 12

LNIU

authored 6 papers about 1 month ago

ChatBug: A Common Vulnerability of Aligned LLMs Induced by Chat Templates

Paper • 2406.12935 • Published Jun 17, 2024 • 2

CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models

Paper • 2406.12257 • Published Jun 18, 2024

Stronger Models are NOT Stronger Teachers for Instruction Tuning

Paper • 2411.07133 • Published Nov 11, 2024 • 39

SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities

Paper • 2502.12025 • Published Feb 17 • 3

TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning

Paper • 2505.14625 • Published May 20 • 13

VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL

Paper • 2505.23977 • Published May 29 • 10

alisawuffles

in UW/OLMo2-8B-SuperBPE-t180k 4 months ago

Training code for Tokenizer

#1 opened 5 months ago by

kevinlin311tw

authored a paper 4 months ago

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

Paper • 2504.07934 • Published Apr 10 • 20

alisawuffles

updated a dataset 5 months ago

UW/olmo-mix-1124-subset-p99

Updated Apr 10 • 317 • 1

alisawuffles

updated a collection 5 months ago

SuperBPE

SuperBPE tokenizers and models trained with them • 8 items • Updated Apr 10 • 15

kevinlin311tw

authored a paper 5 months ago

BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation

Paper • 2503.20672 • Published Mar 26 • 14

Jhayase

published a model 5 months ago

UW/OLMo2-11B-SuperBPE-t180k

Text Generation • 11B • Updated Mar 20 • 8 • 2

Jhayase

updated a model 5 months ago

UW/OLMo2-11B-SuperBPE-t180k

Text Generation • 11B • Updated Mar 20 • 8 • 2

alisawuffles

updated a collection 5 months ago

SuperBPE

SuperBPE tokenizers and models trained with them • 8 items • Updated Apr 10 • 15

alisawuffles

updated a model 5 months ago

UW/OLMo2-11B-SuperBPE-t180k

Text Generation • 11B • Updated Mar 20 • 8 • 2

alisawuffles

updated a collection 5 months ago

SuperBPE

SuperBPE tokenizers and models trained with them • 8 items • Updated Apr 10 • 15

alisawuffles

published a model 5 months ago

UW/OLMo2-8B-SuperBPE-t80k

Text Generation • 8B • Updated Mar 19 • 6