Muhammad Khalifa's picture

Muhammad Khalifa

mkhalifa

·

https://mukhal.github.io/

AI & ML interests

natural language genration, reinforcement learning

Recent Activity

new activity 16 days ago

mkhalifa/flan-t5-large-gsm8k:Add model card for GRACE

new activity 16 days ago

mkhalifa/flan-t5-large-svamp:Add model card for GRACE

new activity 16 days ago

mkhalifa/flan-t5-large-mathqa:Add model card

View all activity

Organizations

New activity in mkhalifa/flan-t5-large-gsm8k 16 days ago

Add model card for GRACE

#2 opened 17 days ago by

New activity in mkhalifa/flan-t5-large-svamp 16 days ago

Add model card for GRACE

#1 opened 17 days ago by

New activity in mkhalifa/flan-t5-large-mathqa 16 days ago

Add model card

#2 opened 17 days ago by

published a dataset about 2 months ago

mkhalifa/agent

Updated Nov 26, 2025

upvoted a collection 3 months ago

ThinkPRM

Process Reward Models that Think -- https://arxiv.org/abs/2504.16828 • 8 items • Updated Jul 29, 2025 • 5

updated a model 5 months ago

mkhalifa/ThinkPRM-gptoss-20B

Updated Aug 18, 2025 • 15

published a model 5 months ago

mkhalifa/ThinkPRM-gptoss-20B

Updated Aug 18, 2025 • 15

upvoted an article 7 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

750

New activity in launch/ThinkPRM-14B 7 months ago

Add link to code and library name

#2 opened 7 months ago by

New activity in launch/thinkprm-1K-verification-cots 7 months ago

Update paper link and add Github link

#3 opened 7 months ago by

updated a model 7 months ago

launch/ThinkPRM-1.5B

Text Generation • 2B • Updated Jun 25, 2025 • 43 • 3

liked a model 7 months ago

launch/ThinkPRM-14B

Text Generation • 15B • Updated Jul 1, 2025 • 228 • 5

upvoted a paper 8 months ago

ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists

Paper • 2506.01241 • Published Jun 2, 2025 • 9

authored a paper 8 months ago

Process Reward Models That Think

Paper • 2504.16828 • Published Apr 23, 2025 • 18

updated a collection 8 months ago

ThinkPRM

Process Reward Models that Think -- https://arxiv.org/abs/2504.16828 • 8 items • Updated Jul 29, 2025 • 5

liked a model 8 months ago

launch/ThinkPRM-1.5B

Text Generation • 2B • Updated Jun 25, 2025 • 43 • 3

upvoted a paper 8 months ago

Small Language Models Need Strong Verifiers to Self-Correct Reasoning

Paper • 2404.17140 • Published Apr 26, 2024 • 1

liked a dataset 8 months ago

osunlp/Online-Mind2Web

Viewer • Updated 17 days ago • 300 • 438 • 22

commented a paper 8 months ago

Process Reward Models That Think

Paper • 2504.16828 • Published Apr 23, 2025 • 18 •