-
FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning
Paper • 2309.04663 • Published • 5 -
Textbooks Are All You Need II: phi-1.5 technical report
Paper • 2309.05463 • Published • 87 -
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation
Paper • 2310.08541 • Published • 18 -
Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models
Paper • 2310.13671 • Published • 19
Shangzhi Zhang
Snorlax
AI & ML interests
None yet
Recent Activity
upvoted
an
article
5 days ago
You could have designed state of the art positional encoding
upvoted
a
paper
5 days ago
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
updated
a model
29 days ago
Snorlax/rl_course_vizdoom_health_gathering_supreme
Organizations
Collections
2
-
GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors
Paper • 2310.08529 • Published • 18 -
CapsFusion: Rethinking Image-Text Data at Scale
Paper • 2310.20550 • Published • 26 -
TiC-CLIP: Continual Training of CLIP Models
Paper • 2310.16226 • Published • 9 -
Prompt Expansion for Adaptive Text-to-Image Generation
Paper • 2312.16720 • Published • 6
models
14
Snorlax/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
Snorlax/LunarLander-v2-PPO-reproduce
Reinforcement Learning
•
Updated
Snorlax/poca-SoccerTwos
Reinforcement Learning
•
Updated
Snorlax/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
•
1
Snorlax/ppo-Pyramids
Reinforcement Learning
•
Updated
•
8
Snorlax/ppo-SnowballTarget
Reinforcement Learning
•
Updated
•
3
Snorlax/Reinforce-PixelCopter
Reinforcement Learning
•
Updated
Snorlax/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
Snorlax/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
•
1
Snorlax/dqn-SpaceInvadersNoFrameskip-v4-retry2
Updated
datasets
None public yet