8 14 20

Zaid Khan

codezakh

https://zaidkhan.me

AI & ML interests

None yet

Recent Activity

authored a paper about 1 hour ago

DWIM: Towards Tool-aware Visual Reasoning via Discrepancy-aware Workflow Generation & Instruct-Masking Tuning

authored a paper about 1 hour ago

Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems

authored a paper about 1 hour ago

OpenThoughts: Data Recipes for Reasoning Models

View all activity

Organizations

authored 7 papers about 1 hour ago

DWIM: Towards Tool-aware Visual Reasoning via Discrepancy-aware Workflow Generation & Instruct-Masking Tuning

Paper • 2503.19263 • Published Mar 25, 2025 • 2

Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems

Paper • 2504.09763 • Published Apr 14, 2025 • 12

OpenThoughts: Data Recipes for Reasoning Models

Paper • 2506.04178 • Published Jun 4, 2025 • 54

One Life to Learn: Inferring Symbolic World Models for Stochastic Environments from Unguided Exploration

Paper • 2510.12088 • Published Oct 14, 2025 • 5

PRInTS: Reward Modeling for Long-Horizon Information Seeking

Paper • 2511.19314 • Published Nov 24, 2025 • 8

Cog-DRIFT: Exploration on Adaptively Reformulated Instances Enables Learning from Hard Reasoning Problems

Paper • 2604.04767 • Published 10 days ago • 7

Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind

Paper • 2604.11666 • Published 3 days ago • 3

upvoted a paper about 20 hours ago

Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind

Paper • 2604.11666 • Published 3 days ago • 3

upvoted a paper about 1 month ago

Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization

Paper • 2602.23008 • Published Feb 26 • 37

upvoted a collection about 2 months ago

pplx-embed

Collection

Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated Feb 26 • 96

upvoted an article about 2 months ago

Article

DenseR: Dense Rewards For Free in LLM Reasoning

Feb 18

•

upvoted a paper 2 months ago

Effective Reasoning Chains Reduce Intrinsic Dimensionality

Paper • 2602.09276 • Published Feb 9 • 11

upvoted an article 4 months ago

Article

Ettin Suite: SoTA Paired Encoders and Decoders

Jul 16, 2025

•

liked a dataset 5 months ago

HuggingFaceFW/finepdfs-edu

Viewer • Updated Nov 11, 2025 • 49.5M • 5.5k • 86

upvoted a paper 5 months ago

PRInTS: Reward Modeling for Long-Horizon Information Seeking

Paper • 2511.19314 • Published Nov 24, 2025 • 8

liked a model 6 months ago

facebook/cwm

33B • Updated Oct 15, 2025 • 5.44k • 265

liked a dataset 6 months ago

HuggingFaceFW/finewiki

Viewer • Updated Oct 22, 2025 • 61.6M • 6.93k • 292

upvoted a paper 6 months ago

One Life to Learn: Inferring Symbolic World Models for Stochastic Environments from Unguided Exploration

Paper • 2510.12088 • Published Oct 14, 2025 • 5

commented a paper 6 months ago

One Life to Learn: Inferring Symbolic World Models for Stochastic Environments from Unguided Exploration

Paper • 2510.12088 • Published Oct 14, 2025 • 5 •

liked a model 7 months ago

jet-ai/Jet-Nemotron-2B

Text Generation • Updated Sep 28, 2025 • 5.64k • 17

Zaid Khan

AI & ML interests

Recent Activity

Organizations

codezakh's activity

DenseR: Dense Rewards For Free in LLM Reasoning

Ettin Suite: SoTA Paired Encoders and Decoders