AI & ML interests
None defined yet.
Recent Activity
Organization Card
This is the organization grouping all the models and datasets used in the TRL library.
spaces
7
Running
5
Dataset Length Profiler
👁
Analyze dataset and recommend max_length for model training
Running
Trackio
🚀
Visualize project metrics with Trackio Dashboard
Running
Recommend Vllm Memory
😻
Estimate GPU memory needed for model training
Running
1
Train
🏋
Display Markdown instructions for TRL jobs
Sleeping
Job
🌖
Submit text to get processed output
Running
9
TextEnvironments
⚒
models
82

trl-lib/Qwen3-4B-LoRA
Updated
•
1

trl-lib/Qwen2-0.5B-Reward-Math-Sheperd
Token Classification
•
0.5B
•
Updated
•
41
•
1

trl-lib/Qwen2-0.5B-XPO
Text Generation
•
0.5B
•
Updated
•
10
•

trl-lib/Qwen2-0.5B-OnlineDPO
Text Generation
•
0.5B
•
Updated
•
7
•

trl-lib/Qwen2-0.5B-KTO
Text Generation
•
0.5B
•
Updated
•
6

trl-lib/Qwen2-0.5B-ORPO
Text Generation
•
0.5B
•
Updated
•
6
•
2

trl-lib/Qwen2-0.5B-DPO
Text Generation
•
0.5B
•
Updated
•
52
•
5

trl-lib/Qwen2-0.5B-Reward
Text Classification
•
0.5B
•
Updated
•
206
•
1

trl-lib/pythia-1b-deduped-tldr-rm
Updated
•
1.48k

trl-lib/pythia-2.8b-deduped-tldr-online-dpo
Text Generation
•
3B
•
Updated
•
5
datasets
20
trl-lib/documentation-images
Viewer
•
Updated
•
9
•
98k
trl-lib/OpenMathReasoning
Viewer
•
Updated
•
3.2M
•
100
trl-lib/chatbot_arena_completions
Viewer
•
Updated
•
33k
•
217
•
1
trl-lib/rlaif-v
Viewer
•
Updated
•
83.1k
•
172
•
3
trl-lib/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
•
16.6k
•
159
•
2
trl-lib/ultrafeedback-prompt
Viewer
•
Updated
•
39.8k
•
207
•
5
trl-lib/tldr-preference
Viewer
•
Updated
•
179k
•
200
•
2
trl-lib/tldr
Viewer
•
Updated
•
130k
•
9.43k
•
20
trl-lib/prm800k
Viewer
•
Updated
•
41.2k
•
38
•
2
trl-lib/math_shepherd
Viewer
•
Updated
•
445k
•
5.68k
•
8