Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
8
1
Denis Tarasov
Adagrad
Follow
vkurenkov's profile picture
1 follower
ยท
2 following
https://dt6a.github.io/
DT6A
AI & ML interests
RL, NLP
Recent Activity
authored
a paper
2 days ago
Revisiting the Minimalist Approach to Offline Reinforcement Learning
authored
a paper
2 days ago
Distilling LLMs' Decomposition Abilities into Compact Language Models
authored
a paper
2 days ago
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
View all activity
Organizations
Papers
6
arxiv:
2505.22914
arxiv:
2501.19400
arxiv:
2402.01812
arxiv:
2305.09836
Expand 6 papers
models
0
None public yet
datasets
0
None public yet