The ODIN and the policies trained by ODIN
Lichang Chen
Lichang-Chen
AI & ML interests
NLP and ML
Recent Activity
authored
a paper
7 days ago
Learning to Reason via Mixture-of-Thought for Logical Reasoning
upvoted
a
paper
7 days ago
Learning to Reason via Mixture-of-Thought for Logical Reasoning
authored
a paper
3 months ago
Self-rewarding correction for mathematical reasoning
Organizations
Collections
1
models
64

Lichang-Chen/Qwen2.5-14B-Instruct-star-nl-3Rounds-iter-3
Text Generation
•
Updated
•
13

Lichang-Chen/Qwen2.5-14B-Instruct-star-nl-3Rounds-iter-2
Text Generation
•
Updated
•
7

Lichang-Chen/Qwen2.5-14B-Instruct-star-nl-3Rounds-iter-1
Text Generation
•
Updated
•
12

Lichang-Chen/game-play-point25-50
Text Generation
•
Updated
•
7

Lichang-Chen/multi-attempts-multi-examples-Jan9
Text Generation
•
Updated
•
9

Lichang-Chen/multi-turn-Jan5
Text Generation
•
Updated
•
16

Lichang-Chen/multi-turn-Jan4
Text Generation
•
Updated
•
8

Lichang-Chen/llama3-dpo-single-turn-point2247-dec15
Text Generation
•
Updated
•
10

Lichang-Chen/llama-8b-gemini-point60-100-wo-cot
Text Generation
•
Updated
•
13

Lichang-Chen/llama-8b-gemini-point21-60-wo-cot
Text Generation
•
Updated
•
15
datasets
18
Lichang-Chen/omnixR-data
Viewer
•
Updated
•
1.4k
•
9
Lichang-Chen/llama_sft_dpo_bold_list_attack_eval_iter3
Viewer
•
Updated
•
800
•
26
Lichang-Chen/llama_sft_dpo_bold_list_attack_eval_iter2
Viewer
•
Updated
•
800
•
27
Lichang-Chen/llama_sft_dpo_bold_list_attack_iter1
Viewer
•
Updated
•
800
•
24
Lichang-Chen/dpo_it_attack_list_and_bold
Viewer
•
Updated
•
800
•
23
Lichang-Chen/llama3_it_dpo_attack_list_2epoch
Viewer
•
Updated
•
800
•
24
Lichang-Chen/llama3_it_dpo_attack_bold_2epoch
Viewer
•
Updated
•
800
•
26
Lichang-Chen/dpo_it_unbiased_ver3
Viewer
•
Updated
•
800
•
18
Lichang-Chen/list_training_pairs
Viewer
•
Updated
•
1k
•
15
Lichang-Chen/bold_training_pairs
Viewer
•
Updated
•
745
•
19