Zhaoye Fei's picture

Zhaoye Fei

ngc7293

·

https://ngc7292.github.io/

AI & ML interests

NLP & Ro.

Recent Activity

upvoted a paper 6 days ago

DiRL: An Efficient Post-Training Framework for Diffusion Language Models

upvoted a paper 6 days ago

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

liked a Space 7 days ago

OpenMOSS-Team/MOSS-transcribe-diarize

View all activity

Organizations

upvoted 2 papers 6 days ago

DiRL: An Efficient Post-Training Framework for Diffusion Language Models

Paper • 2512.22234 • Published 13 days ago • 19

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

Paper • 2512.23576 • Published 7 days ago • 63

liked a Space 7 days ago

MOSS Transcribe Diarize

upvoted a paper about 1 month ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 211

upvoted a paper about 2 months ago

SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models

Paper • 2511.15605 • Published Nov 19, 2025 • 22

liked a model 2 months ago

OpenMOSS-Team/MOSS-TTSD-v0.7

Text-to-Speech • 2B • Updated Nov 11, 2025 • 969 • 15

upvoted 2 papers 2 months ago

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30, 2025 • 108

RoboOmni: Proactive Robot Manipulation in Omni-modal Context

Paper • 2510.23763 • Published Oct 27, 2025 • 53

liked 2 datasets 3 months ago

Sylvest/libero_plus_rlds

Updated Oct 17, 2025 • 426 • 5

Sylvest/LIBERO-plus

Updated Oct 17, 2025 • 464 • 15

upvoted 3 papers 3 months ago

PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning

Paper • 2510.13809 • Published Oct 15, 2025 • 37

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

Paper • 2510.13626 • Published Oct 15, 2025 • 45

MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance

Paper • 2510.00499 • Published Oct 1, 2025 • 19

liked a model 3 months ago

OpenMOSS-Team/MOSS-Speech

9B • Updated Sep 30, 2025 • 196 • 16

liked a Space 3 months ago

MOSS-Speech Demo

True Speech-to-Speech Language Model

upvoted a paper 4 months ago

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Paper • 2509.15221 • Published Sep 18, 2025 • 111

liked a Space 4 months ago

README

OpenMOSS Team of SII

updated a Space 4 months ago

README

OpenMOSS Team of SII

published a Space 4 months ago

README

OpenMOSS Team of SII

upvoted a paper 5 months ago

WideSearch: Benchmarking Agentic Broad Info-Seeking

Paper • 2508.07999 • Published Aug 11, 2025 • 110