reasoning-project

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

amphora authored a paper 1 day ago

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

Cartinoe5930 authored a paper 2 days ago

Multi-Step Reasoning in Korean and the Emergent Mirage

Cartinoe5930 authored a paper 2 days ago

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

View all activity

reasoning-project's activity

amphora

authored a paper 1 day ago

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

Paper • 2502.17407 • Published 2 days ago • 19

Cartinoe5930

authored 2 papers 2 days ago

Multi-Step Reasoning in Korean and the Emergent Mirage

Paper • 2501.05712 • Published Jan 10

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

Paper • 2502.17407 • Published 2 days ago • 19

JW17

updated a model 11 days ago

reasoning-project/Q25M-1.5B-MR1-50k-SFT-v0.2-3epoch

Text Generation • Updated 11 days ago • 1

JW17

published a model 11 days ago

reasoning-project/Q25M-1.5B-MR1-50k-SFT-v0.2-3epoch

Text Generation • Updated 11 days ago • 1

JW17

updated a model 12 days ago

reasoning-project/Q25M-1.5B-Open-R1-55k-SFT-v0.1

Text Generation • Updated 12 days ago • 6

JW17

published a model 12 days ago

reasoning-project/Q25M-1.5B-Open-R1-55k-SFT-v0.1

Text Generation • Updated 12 days ago • 6

JW17

updated a model 13 days ago

reasoning-project/Q25-1.5B-PRIME-55K-GRPO-Acc2-format5e1

Updated 13 days ago

JW17

published a model 13 days ago

reasoning-project/Q25-1.5B-PRIME-55K-GRPO-Acc2-format5e1

Updated 13 days ago

JW17

updated a model 13 days ago

reasoning-project/Q25-1.5B-Open-R1-55K-GRPO-Acc2-format5e1

Updated 13 days ago

JW17

published a model 13 days ago

reasoning-project/Q25-1.5B-Open-R1-55K-GRPO-Acc2-format5e1

Updated 13 days ago

Cartinoe5930

authored a paper about 1 month ago

LLM-as-a-Judge & Reward Model: What They Can and Cannot Do

Paper • 2409.11239 • Published Sep 17, 2024 • 1

Cartinoe5930

authored a paper about 2 months ago

Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap

Paper • 2501.02448 • Published Jan 5

JW17

authored 2 papers 3 months ago

Stable Language Model Pre-training by Reducing Embedding Variability

Paper • 2409.07787 • Published Sep 12, 2024

Cross-lingual Transfer of Reward Models in Multilingual Alignment

Paper • 2410.18027 • Published Oct 23, 2024

JW17

authored a paper 9 months ago

Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

Paper • 2406.06424 • Published Jun 10, 2024 • 13

JW17

authored a paper 12 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 64

amphora

authored a paper about 1 year ago

KMMLU: Measuring Massive Multitask Language Understanding in Korean

Paper • 2402.11548 • Published Feb 18, 2024

amphora

authored 2 papers over 1 year ago

HAE-RAE Bench: Evaluation of Korean Knowledge in Language Models

Paper • 2309.02706 • Published Sep 6, 2023 • 2

Removing Non-Stationary Knowledge From Pre-Trained Language Models for Entity-Level Sentiment Classification in Finance

Paper • 2301.03136 • Published Jan 9, 2023

AI & ML interests

Recent Activity

Team members 3

reasoning-project's activity