3 70 196

YYY

zzfive

ZZfive

AI & ML interests

None yet

Recent Activity

updated a collection about 8 hours ago

video

updated a collection about 12 hours ago

RL+reason model

updated a collection about 12 hours ago

robot

View all activity

Organizations

None yet

upvoted a paper 8 days ago

RLPR: Extrapolating RLVR to General Domains without Verifiers

Paper • 2506.18254 • Published 11 days ago • 31

upvoted 3 papers about 1 month ago

upvoted a paper about 2 months ago

MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Paper • 2505.07916 • Published May 12 • 125

upvoted 5 papers 3 months ago

Packing Input Frame Context in Next-Frame Prediction Models for Video Generation

Paper • 2504.12626 • Published Apr 17 • 52

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 276

MegaTTS 3: Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis

Paper • 2502.18924 • Published Feb 26 • 13

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Paper • 2504.00595 • Published Apr 1 • 36

Wan: Open and Advanced Large-Scale Video Generative Models

Paper • 2503.20314 • Published Mar 26 • 52

upvoted a paper 4 months ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18 • 151

upvoted an article 4 months ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

and 2 others •

Jan 23

• 181

upvoted 3 papers 4 months ago

Charting and Navigating Hugging Face's Model Atlas

Paper • 2503.10633 • Published Mar 13 • 86

NeoBERT: A Next-Generation BERT

Paper • 2502.19587 • Published Feb 26 • 39

Audio-FLAN: A Preliminary Release

Paper • 2502.16584 • Published Feb 23 • 37

upvoted an article 4 months ago

Article

The Large Language Model Course

•

Jan 16

• 194

upvoted a paper 5 months ago

HiFi-SR: A Unified Generative Transformer-Convolutional Adversarial Network for High-Fidelity Speech Super-Resolution

Paper • 2501.10045 • Published Jan 17 • 9

upvoted 3 papers 6 months ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 295

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 89

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Paper • 2501.06282 • Published Jan 10 • 52

YYY

AI & ML interests

Recent Activity

Organizations

zzfive's activity

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

The Large Language Model Course