Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
Rakshith
rakshith-writer
Follow
wassemgtk's profile picture
1 follower
·
2 following
AI & ML interests
None yet
Recent Activity
reacted
to
wassemgtk
's
post
with 😎
10 days ago
I’ve been diving into the iRoPE architecture from Llama 4—a game-changer for long-context models! It interleaves local attention (with RoPE) for short contexts and global attention (with inference-time temp scaling) for long-range reasoning, aiming for infinite context. I’m going to try writing iRoPE—who wants to help? Code: https://github.com/wassemgtk/iRoPE-try/blob/main/iRoPE.ipynb
reacted
to
wassemgtk
's
post
with 👀
10 days ago
I’ve been diving into the iRoPE architecture from Llama 4—a game-changer for long-context models! It interleaves local attention (with RoPE) for short contexts and global attention (with inference-time temp scaling) for long-range reasoning, aiming for infinite context. I’m going to try writing iRoPE—who wants to help? Code: https://github.com/wassemgtk/iRoPE-try/blob/main/iRoPE.ipynb
liked
a dataset
2 months ago
Writer/FailSafeQA
View all activity
Organizations
rakshith-writer
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
2 months ago
Writer/FailSafeQA
Viewer
•
Updated
Feb 13
•
220
•
276
•
7