AI4Bio@ZJLab

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

JustinLin610 authored a paper 12 days ago

RefCritic: Training Long Chain-of-Thought Critic Models with Refinement Feedback

JustinLin610 authored a paper 2 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Junde updated a model 3 months ago

InstructPLM/Concated-Progen2-xlarge-CATH42-AFDB

View all activity

JustinLin610

authored a paper 12 days ago

RefCritic: Training Long Chain-of-Thought Critic Models with Refinement Feedback

Paper • 2507.15024 • Published 15 days ago • 13

JustinLin610

authored a paper 2 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 176

Junde

updated a model 3 months ago

InstructPLM/Concated-Progen2-xlarge-CATH42-AFDB

6B • Updated May 20 • 3

Junde

published a model 3 months ago

InstructPLM/Concated-Progen2-xlarge-CATH42-AFDB

6B • Updated May 20 • 3

JustinLin610

authored 2 papers 3 months ago

Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published May 15 • 82

WorldPM: Scaling Human Preference Modeling

Paper • 2505.10527 • Published May 15 • 34

JustinLin610

authored a paper 4 months ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 165

JustinLin610

authored 2 papers 5 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 114

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Paper • 2502.20172 • Published Feb 27 • 28

xptree

authored a paper 5 months ago

MoBA: Mixture of Block Attention for Long-Context LLMs

Paper • 2502.13189 • Published Feb 18 • 17

JustinLin610

authored 4 papers 6 months ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 66

JustinLin610

authored 4 papers 7 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 100

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 76

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published Dec 16, 2024 • 59

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 53

JustinLin610

authored 2 papers 8 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 373

Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published Dec 6, 2024 • 51

AI & ML interests

Recent Activity

Team members 6

InstructPLM's activity