16 11 20

Tianyu Yu

Yirany

yiranyyu

AI & ML interests

None yet

Recent Activity

upvoted a collection 4 days ago

MiniCPM-o & MiniCPM-V

new activity 6 days ago

openbmb/RLPR-Evaluation:Replace Arxiv link with HF Papers link

commented on a paper 15 days ago

RLPR: Extrapolating RLVR to General Domains without Verifiers

View all activity

Organizations

upvoted a collection 4 days ago

MiniCPM-o & MiniCPM-V

Collection

Multimodal models with leading performance. • 21 items • Updated 4 days ago • 37

New activity in openbmb/RLPR-Evaluation 6 days ago

Replace Arxiv link with HF Papers link

#2 opened 18 days ago by

nielsr

commented 2 papers 15 days ago

RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published 24 days ago • 32 •

RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published 24 days ago • 32 •

updated 3 models 17 days ago

New activity in openbmb/RLPR-Llama3.1-8B-Inst 17 days ago

Add Transformers library and text-generation pipeline tag

#1 opened 18 days ago by

nielsr

updated 2 datasets 17 days ago

openbmb/RLPR-Evaluation

Viewer • Updated 6 days ago • 638 • 355 • 2

openbmb/RLPR-Train-Dataset

Viewer • Updated 17 days ago • 77.7k • 1.4k • 22

authored 4 papers 22 days ago

Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages

Paper • 2308.12038 • Published Aug 23, 2023 • 2

A Topic-level Self-Correctional Approach to Mitigate Hallucinations in MLLMs

Paper • 2411.17265 • Published Nov 26, 2024 • 1

EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents

Paper • 2501.11858 • Published Jan 21 • 7

RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published 24 days ago • 32

commented 2 papers 23 days ago

RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published 24 days ago • 32 •

RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published 24 days ago • 32 •

updated a collection 23 days ago

RLPR

Collection

Extrapolating RLVR to General Domains without Verifiers • 6 items • Updated 6 days ago • 3

upvoted a paper 23 days ago

RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published 24 days ago • 32

commented a paper 23 days ago

RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published 24 days ago • 32 •

liked a model 24 days ago

openbmb/RLPR-Gemma2-2B-it

Text Generation • 3B • Updated 17 days ago • 25 • 3

Tianyu Yu

AI & ML interests

Recent Activity

Organizations

Yirany's activity

Replace Arxiv link with HF Papers link

Add Transformers library and text-generation pipeline tag