2 1 5

Peter Hatvani PRO

RabidUmarell

https://hatvanipeter.hu

Napermial

AI & ML interests

Toxicity in Hungarian

Recent Activity

updated a dataset 26 days ago

RabidUmarell/hu-toxic-test-set

published a dataset 26 days ago

RabidUmarell/hu-toxic-test-set

upvoted an article 26 days ago

Should We Still Pretrain Encoders with Masked Language Modeling?

View all activity

Organizations

None yet

updated a dataset 26 days ago

RabidUmarell/hu-toxic-test-set

Viewer • Updated 26 days ago • 39 • 90

published a dataset 26 days ago

RabidUmarell/hu-toxic-test-set

Viewer • Updated 26 days ago • 39 • 90

upvoted an article 26 days ago

Article

Should We Still Pretrain Encoders with Masked Language Modeling?

and 3 others •

Jul 2

• 21

updated a dataset 4 months ago

RabidUmarell/hureddit-toxicity-setfit

Viewer • Updated Apr 18 • 323k • 10

published a dataset 4 months ago

RabidUmarell/hureddit-toxicity-setfit

Viewer • Updated Apr 18 • 323k • 10

updated a model 4 months ago

RabidUmarell/hubert-embedding-setfit-toxic

Text Classification • 0.1B • Updated Apr 7 • 57

reacted to wassemgtk's post with 👀 4 months ago

Post

3131

I’ve been diving into the iRoPE architecture from Llama 4—a game-changer for long-context models! It interleaves local attention (with RoPE) for short contexts and global attention (with inference-time temp scaling) for long-range reasoning, aiming for infinite context. I’m going to try writing iRoPE—who wants to help?

Code: https://github.com/wassemgtk/iRoPE-try/blob/main/iRoPE.ipynb