Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
5
1
Yaroslav Aksenov
yaraksen
Follow
kefirski's profile picture
ZeL1k7's profile picture
21world's profile picture
4 followers
·
3 following
yaraksen
AI & ML interests
generative models, NLP
Recent Activity
authored
a paper
10 days ago
You Do Not Fully Utilize Transformer's Representation Capacity
upvoted
a
paper
10 days ago
You Do Not Fully Utilize Transformer's Representation Capacity
commented
on
a paper
10 days ago
You Do Not Fully Utilize Transformer's Representation Capacity
View all activity
Organizations
None yet
Papers
4
arxiv:
2502.09245
arxiv:
2502.03032
arxiv:
2404.09656
arxiv:
2402.10644
models
3
Sort: Recently updated
yaraksen/mor_1b_0.125
Updated
Aug 27, 2024
yaraksen/baseline_1b
Updated
Aug 26, 2024
yaraksen/mor_1b_0.125_without_inference
Updated
Aug 26, 2024
datasets
1
yaraksen/fineweb_edu_tokenized_50b
Updated
Oct 21, 2024
•
2