Derry Pratama
ibndias
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 4 hours ago
ibndias/yourbench_example
published
a dataset
about 4 hours ago
ibndias/yourbench_example
commented on
a paper
about 12 hours ago
Kuwain 1.5B: An Arabic SLM via Language Injection
Organizations
Collections
2
Papers
2
models
16

ibndias/gemma-3-1b-reasoning-grpo
Text Generation
•
Updated
•
2

ibndias/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Text Generation
•
Updated
•
11

ibndias/Qwen-2.5-7B-Simple-RL
Updated

ibndias/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
•
1

ibndias/Qwen-2.5-7B_Base_Math_smalllr
Updated

ibndias/Qwen2.5-1.5B-Open-R1-GRPO1st
Text Generation
•
Updated

ibndias/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
•
2

ibndias/taxi-v3
Reinforcement Learning
•
Updated

ibndias/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated

ibndias/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
3
datasets
6
ibndias/yourbench_example
Viewer
•
Updated
•
114
ibndias/cipher-context-dataset
Viewer
•
Updated
•
202k
•
16
ibndias/agentic-htb
Viewer
•
Updated
•
338
•
1
ibndias/htb-v2
Viewer
•
Updated
•
41.4k
•
23
ibndias/distilabel-capybara-dpo-7k-binarized
Viewer
•
Updated
•
7.56k
•
24
ibndias/SlimOpenHermes-2.5
Viewer
•
Updated
•
919k
•
27