87 46 162

Yaowei Zheng

hiyouga

https://github.com/hiyouga

AI & ML interests

LLM Knowledge Management

Recent Activity

liked a model about 24 hours ago

moonshotai/Kimi-VL-A3B-Instruct

updated a dataset 1 day ago

hiyouga/journeybench-multi-image-vqa

updated a dataset 1 day ago

hiyouga/math12k

View all activity

Organizations

hiyouga's activity

liked a model about 24 hours ago

moonshotai/Kimi-VL-A3B-Instruct

Image-Text-to-Text • Updated about 7 hours ago • 9.71k • 150

updated 3 datasets 1 day ago

published a dataset 1 day ago

hiyouga/journeybench-multi-image-vqa

Viewer • Updated 1 day ago • 313 • 26

upvoted a paper 6 days ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published 8 days ago • 41

liked a model 8 days ago

meta-llama/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • Updated 6 days ago • 657k • • 775

liked a model 9 days ago

open-thoughts/OpenThinker2-32B

Text Generation • Updated 12 days ago • 831 • 42

New activity in Qwen/Qwen2.5-Omni-7B 9 days ago

Open-source Fine-tuning script of Qwen2.5-Omni 7B 🚀

#29 opened 14 days ago by

hiyouga

updated a model 9 days ago

llamafactory/tiny-random-Llama-4

Image-Text-to-Text • Updated 9 days ago • 1.6k

published a model 9 days ago

llamafactory/tiny-random-Llama-4

Image-Text-to-Text • Updated 9 days ago • 1.6k

liked a model 15 days ago

Qwen/Qwen2.5-Omni-7B

Any-to-Any • Updated about 10 hours ago • 135k • 1.38k

upvoted a paper 15 days ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published 18 days ago • 43

liked a dataset 18 days ago

m-a-p/neo_sft_phase2

Viewer • Updated Jun 12, 2024 • 109k • 165 • 53

liked a model 20 days ago

manycore-research/SpatialLM-Llama-1B

Text Generation • Updated 25 days ago • 18.7k • 943

New activity in hiyouga/gsm8k 28 days ago

[bot] Conversion to Parquet

#1 opened 29 days ago by

parquet-converter

updated a dataset 29 days ago

hiyouga/gsm8k

Viewer • Updated 29 days ago • 8.79k • 73

published a dataset 29 days ago

hiyouga/gsm8k

Viewer • Updated 29 days ago • 8.79k • 73

upvoted a paper about 1 month ago

Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond

Paper • 2503.10460 • Published Mar 13 • 27

liked a model about 1 month ago

google/gemma-3-4b-it

Image-Text-to-Text • Updated 25 days ago • 621k • 432