General Preference

university

https://github.com/general-preference

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

yifAI authored a paper 19 days ago

A Markov Categorical Framework for Language Modeling

yifAI authored a paper about 2 months ago

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization

yifAI authored a paper 3 months ago

On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning

View all activity

yifAI

authored a paper 19 days ago

A Markov Categorical Framework for Language Modeling

Paper • 2507.19247 • Published Jul 25 • 1

yifAI

authored a paper about 2 months ago

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization

Paper • 2507.06181 • Published Jul 8 • 41

yifAI

authored a paper 3 months ago

On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning

Paper • 2505.17508 • Published May 23 • 5

yifAI

authored a paper 4 months ago

FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models

Paper • 2505.02735 • Published May 5 • 32

thughost

authored a paper 7 months ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 89

yifAI

authored a paper 7 months ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 89

yifAI

authored 2 papers 9 months ago

Scaling Image Tokenizers with Grouped Spherical Quantization

Paper • 2412.02632 • Published Dec 3, 2024 • 10

Training and Evaluating Language Models with Template-based Data Generation

Paper • 2411.18104 • Published Nov 27, 2024 • 3

thughost

authored a paper 9 months ago

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Paper • 2411.10438 • Published Nov 15, 2024 • 13

kirigayahitsugi

updated 4 models 10 months ago

thughost

authored a paper 10 months ago

DPLM-2: A Multimodal Diffusion Protein Language Model

Paper • 2410.13782 • Published Oct 17, 2024 • 22

kirigayahitsugi

updated a model 11 months ago

general-preference/GPM-Llama-3.1-8B

8B • Updated Oct 15, 2024 • 204 • 1

yifAI

updated 2 models 11 months ago

general-preference/GPO-Llama-3-8B-Instruct-GPM-2B

Text Generation • 8B • Updated Oct 11, 2024 • 4 • 2

general-preference/SPPO-Llama-3-8B-Instruct-GPM-2B

Text Generation • 8B • Updated Oct 11, 2024 • 5 • 1

thughost

authored 2 papers 11 months ago

General Preference Modeling with Preference Representations for Aligning Language Models

Paper • 2410.02197 • Published Oct 3, 2024 • 9

LLaVA-Critic: Learning to Evaluate Multimodal Models

Paper • 2410.02712 • Published Oct 3, 2024 • 38

yifAI

authored a paper 11 months ago

General Preference Modeling with Preference Representations for Aligning Language Models

Paper • 2410.02197 • Published Oct 3, 2024 • 9

AI & ML interests

Recent Activity

Team members 4

general-preference's activity