Vaidik Nakrani
VaidikML0508
AI & ML interests
utilizing gradient decent efficiently
Recent Activity
liked
a model
21 days ago
mlc-ai/DeepSeek-R1-Distill-Qwen-1.5B-q4f16_1-MLC
liked
a model
28 days ago
agentica-org/DeepScaleR-1.5B-Preview
liked
a model
about 1 month ago
krutrim-ai-labs/Krutrim-2-instruct
Organizations
None yet
Collections
1
models
14
VaidikML0508/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
VaidikML0508/Reinforce-pixel-copte-1
Reinforcement Learning
•
Updated
VaidikML0508/Reinforce-pixel-copter
Updated
VaidikML0508/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
•
10
VaidikML0508/ML-Agents-Pyramids
Reinforcement Learning
•
Updated
•
42
VaidikML0508/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
9
VaidikML0508/taxi-V3
Reinforcement Learning
•
Updated
VaidikML0508/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
VaidikML0508/ppo-SnowballTarget
Reinforcement Learning
•
Updated
VaidikML0508/Reinforce-1
Reinforcement Learning
•
Updated
datasets
None public yet