4 256 52

Charles I Niswander II

charlesniswander

dhar174

AI & ML interests

None yet

Recent Activity

upvoted a paper about 13 hours ago

Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens

upvoted a paper about 14 hours ago

Learning to Skip the Middle Layers of Transformers

upvoted a paper about 14 hours ago

Robust Reward Modeling via Causal Rubrics

View all activity

Organizations

None yet

upvoted a paper about 13 hours ago

Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens

Paper • 2506.17218 • Published 9 days ago • 19

upvoted 2 papers about 14 hours ago

Learning to Skip the Middle Layers of Transformers

Paper • 2506.21103 • Published 4 days ago • 11

Robust Reward Modeling via Causal Rubrics

Paper • 2506.16507 • Published 10 days ago • 7

upvoted an article 3 days ago

Article

Gemma 3n fully available in the open-source ecosystem!

and 7 others •

4 days ago

• 87

upvoted a paper 10 days ago

Reasoning with Exploration: An Entropy Perspective

Paper • 2506.14758 • Published 12 days ago • 26

upvoted 2 papers 11 days ago

From Bytes to Ideas: Language Modeling with Autoregressive U-Nets

Paper • 2506.14761 • Published 12 days ago • 13

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published 14 days ago • 59

upvoted a paper 13 days ago

Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache

Paper • 2506.11886 • Published 17 days ago • 20

upvoted a paper 19 days ago

BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation

Paper • 2506.07530 • Published 21 days ago • 18

upvoted a collection 19 days ago

BitVLA

Collection

1-bit Vision-Language-Action Models for Robotics Manipulation • 4 items • Updated 20 days ago • 2

upvoted a paper 22 days ago

LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks

Paper • 2506.00411 • Published 30 days ago • 30

upvoted a paper 23 days ago

Aligning Latent Spaces with Flow Priors

Paper • 2506.05240 • Published 25 days ago • 25

upvoted 2 papers 27 days ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published about 1 month ago • 132

Large Language Models are Locally Linear Mappings

Paper • 2505.24293 • Published about 1 month ago • 15

upvoted 5 papers about 1 month ago

RLVR-World: Training World Models with Reinforcement Learning

Paper • 2505.13934 • Published May 20 • 14

AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published May 19 • 79

Simple Semi-supervised Knowledge Distillation from Vision-Language Models via texttt{D}ual-texttt{H}ead texttt{O}ptimization

Paper • 2505.07675 • Published May 12 • 19

Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published May 15 • 81

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15 • 119

upvoted a paper about 2 months ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12 • 80

Charles I Niswander II

AI & ML interests

Recent Activity

Organizations

charlesniswander's activity

Gemma 3n fully available in the open-source ecosystem!