BOUKOUFFALLAH Abdallah

iBado

Abdellahbado

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Reward Reasoning Model

upvoted a paper 4 days ago

Chain-of-Model Learning for Language Model

upvoted a paper 4 days ago

AdaptThink: Reasoning Models Can Learn When to Think

View all activity

Organizations

None yet

iBado's activity

upvoted a paper 3 days ago

Reward Reasoning Model

Paper • 2505.14674 • Published 4 days ago • 30

upvoted 2 papers 4 days ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published 7 days ago • 105

AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published 5 days ago • 70

upvoted a paper 6 days ago

Transformer Interpretability Beyond Attention Visualization

Paper • 2012.09838 • Published Dec 17, 2020 • 1

upvoted a paper 7 days ago

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published 9 days ago • 113

upvoted a paper 15 days ago

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published 17 days ago • 144

upvoted 2 papers 17 days ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published 18 days ago • 159

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published 18 days ago • 91

upvoted a collection 23 days ago

Qwen3

Collection

40 items • Updated 3 days ago • 693

upvoted a paper 24 days ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published 25 days ago • 91

upvoted a collection 25 days ago

Reasoning, Thinking, RL and Test-Time Scaling

Collection

147 items • Updated 30 days ago • 11

upvoted a paper 25 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 99

upvoted a collection 27 days ago

Qwen2.5-Math

Collection

Math-specific model series based on Qwen2.5 • 11 items • Updated 26 days ago • 81

upvoted 3 papers about 1 month ago

ToolRL: Reward is All Tool Learning Needs

Paper • 2504.13958 • Published Apr 16 • 44

Kuwain 1.5B: An Arabic SLM via Language Injection

Paper • 2504.15120 • Published Apr 21 • 115

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 279

upvoted a paper about 2 months ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 140

upvoted a paper 2 months ago

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10 • 44

liked a model 3 months ago

CohereLabs/c4ai-command-r7b-arabic-02-2025

Text Generation • Updated Apr 15 • 6.05k • • 108

liked a Space 3 months ago

Yehia 7B Preview

💬

Good Arabic Model