170 39 25

Wenhu Chen

wenhu

https://wenhuchen.github.io

AI & ML interests

NLP

Recent Activity

upvoted a paper 3 days ago

NeuralOS: Towards Simulating Operating Systems via Neural Generative Models

new activity 4 days ago

TIGER-Lab/GenAI-Arena:Runtime Error: Failed to retrieve error logs: SSE is not enabled

updated a dataset 4 days ago

TIGER-Lab/MMEB-V2

View all activity

Organizations

upvoted a paper 3 days ago

NeuralOS: Towards Simulating Operating Systems via Neural Generative Models

Paper • 2507.08800 • Published 5 days ago • 59

upvoted 2 papers about 1 month ago

VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation

Paper • 2506.03930 • Published Jun 4 • 24

Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem

Paper • 2506.03295 • Published Jun 3 • 17

upvoted 5 papers about 2 months ago

StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs

Paper • 2505.20139 • Published May 26 • 18

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Paper • 2505.15966 • Published May 21 • 53

upvoted a paper 3 months ago

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Paper • 2504.08837 • Published Apr 10 • 43

upvoted 3 papers 4 months ago

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

Paper • 2504.00824 • Published Apr 1 • 44

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30 • 136

ABC: Achieving Better Control of Multimodal Embeddings using VLMs

Paper • 2503.00329 • Published Mar 1 • 19

upvoted a paper 5 months ago

PixelWorld: Towards Perceiving Everything as Pixels

Paper • 2501.19339 • Published Jan 31 • 17

upvoted a paper 6 months ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published Jan 29 • 59

upvoted a paper 7 months ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published Dec 6, 2024 • 48

upvoted a paper 8 months ago

VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation

Paper • 2412.00927 • Published Dec 1, 2024 • 28

upvoted a paper 9 months ago

VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks

Paper • 2410.05160 • Published Oct 7, 2024 • 4

upvoted a paper 11 months ago

Foundation Models for Music: A Survey

Paper • 2408.14340 • Published Aug 26, 2024 • 45

upvoted 2 papers about 1 year ago

LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs

Paper • 2406.15319 • Published Jun 21, 2024 • 65

Unifying Multimodal Retrieval via Document Screenshot Embedding

Paper • 2406.11251 • Published Jun 17, 2024 • 10

Wenhu Chen

AI & ML interests

Recent Activity

Organizations

wenhu's activity