1 9 17

Weizhe Yuan

weizhey

AI & ML interests

NLP

Recent Activity

upvoted a paper 16 days ago

TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization

liked a dataset about 2 months ago

facebook/natural_reasoning

updated a dataset about 2 months ago

facebook/natural_reasoning

View all activity

Organizations

weizhey's activity

upvoted a paper 16 days ago

TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization

Paper • 2503.19901 • Published 25 days ago • 37

liked a dataset about 2 months ago

facebook/natural_reasoning

Viewer • Updated Feb 21 • 1.15M • 7.38k • 493

updated a dataset about 2 months ago

facebook/natural_reasoning

Viewer • Updated Feb 21 • 1.15M • 7.38k • 493

upvoted 2 papers 2 months ago

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published Feb 5 • 59

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published Feb 5 • 61

upvoted a paper 5 months ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 49

authored a paper 5 months ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 49

upvoted a paper 5 months ago

Adaptive Decoding via Latent Preference Optimization

Paper • 2411.09661 • Published Nov 14, 2024 • 10

authored a paper 6 months ago

Thinking LLMs: General Instruction Following with Thought Generation

Paper • 2410.10630 • Published Oct 14, 2024 • 19

authored 8 papers 9 months ago

Self-Taught Evaluators

Paper • 2408.02666 • Published Aug 5, 2024 • 30

BARTScore: Evaluating Generated Text as Text Generation

Paper • 2106.11520 • Published Jun 22, 2021 • 2

FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios

Paper • 2307.13528 • Published Jul 25, 2023

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Paper • 2107.13586 • Published Jul 28, 2021

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Paper • 2407.19594 • Published Jul 28, 2024 • 20

upvoted a paper 9 months ago

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Paper • 2407.19594 • Published Jul 28, 2024 • 20

liked a Space 9 months ago

717

Tile Upscaler

🚀

Enhance images with high-resolution quality and HDR effects

authored a paper 12 months ago

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published Apr 30, 2024 • 50