uu

JayZc

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Optimizing Large Language Model Training Using FP4 Quantization

upvoted a paper 2 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

upvoted a paper 2 days ago

Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts

View all activity

Organizations

None yet

JayZc's activity

upvoted 3 papers 2 days ago

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published 9 days ago • 32

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 9 days ago • 100

Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts

Paper • 2501.14334 • Published 14 days ago • 17

liked a dataset 2 days ago

axxkaya/UVT-Explanatory-based-Vision-Tasks

Viewer • Updated 5 days ago • 284k • 37 • 6

upvoted 2 papers 2 days ago

Large Language Models Think Too Fast To Explore Effectively

Paper • 2501.18009 • Published 8 days ago • 22

MatAnyone: Stable Video Matting with Consistent Memory Propagation

Paper • 2501.14677 • Published 13 days ago • 26

liked a dataset 2 days ago

wikimedia/wikipedia

Viewer • Updated Jan 9, 2024 • 61.6M • 126k • 731

upvoted a paper 2 days ago

SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model

Paper • 2501.18636 • Published 9 days ago • 25

upvoted 12 papers 9 days ago

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published Dec 30, 2024 • 37

On the Compositional Generalization of Multimodal LLMs for Medical Imaging

Paper • 2412.20070 • Published Dec 28, 2024 • 45

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

Paper • 2412.18525 • Published Dec 24, 2024 • 72

GameFactory: Creating New Games with Generative Interactive Videos

Paper • 2501.08325 • Published 23 days ago • 61

Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament

Paper • 2501.13007 • Published 15 days ago • 19

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 15 days ago • 301

AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation

Paper • 2403.14614 • Published Mar 21, 2024 • 3

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

Paper • 2501.14492 • Published 13 days ago • 29

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published 12 days ago • 53