Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
1
Luckeciano Carvalho Melo
luckeciano
Follow
0 followers
·
2 following
https://luckeciano.github.io
LuckecianoMelo
luckeciano
AI & ML interests
Reinforcement Learning
Recent Activity
updated
a model
3 minutes ago
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline_8160
updated
a model
14 minutes ago
luckeciano/Qwen-2.5-7B-GRPO-Base_9319
updated
a model
14 minutes ago
luckeciano/Qwen-2.5-7B-GRPO-Base-NoAdvNorm_3897
View all activity
Organizations
Papers
1
arxiv:
2206.06614
models
244
Sort: Recently updated
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline_8160
Text Generation
•
Updated
3 minutes ago
luckeciano/Qwen-2.5-7B-GRPO-Base_9319
Text Generation
•
Updated
14 minutes ago
luckeciano/Qwen-2.5-7B-GRPO-Base-NoAdvNorm_3897
Text Generation
•
Updated
15 minutes ago
luckeciano/Qwen-2.5-7B-GRPO-Base_2938
Text Generation
•
Updated
about 1 hour ago
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline_4332
Text Generation
•
Updated
about 1 hour ago
luckeciano/Qwen-2.5-7B-GRPO-Base-NoAdvNorm_9763
Text Generation
•
Updated
about 5 hours ago
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline_7324
Text Generation
•
Updated
about 5 hours ago
luckeciano/Qwen-2.5-7B-GRPO-Base_3842
Text Generation
•
Updated
about 7 hours ago
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline_2011
Text Generation
•
Updated
about 11 hours ago
luckeciano/Qwen-2.5-7B-GRPO-Base_3994
Text Generation
•
Updated
about 12 hours ago
Expand 244 models
datasets
10
Sort: Recently updated
luckeciano/mistral8x22b-reddit-post-features
Viewer
•
Updated
May 10, 2024
•
92.9k
•
447
luckeciano/llama370b-reddit-post-features
Viewer
•
Updated
May 10, 2024
•
82.5k
•
356
luckeciano/llama370b-features-reddit
Viewer
•
Updated
May 7, 2024
•
150k
•
27
luckeciano/mistral8x22b-features-reddit
Viewer
•
Updated
Apr 22, 2024
•
166k
•
33
luckeciano/hermes-reddit-post-features
Viewer
•
Updated
Apr 18, 2024
•
92.7k
•
1.11k
luckeciano/llama27b-features-reddit
Viewer
•
Updated
Apr 13, 2024
•
189k
•
37
luckeciano/falcon7b-features-reddit
Viewer
•
Updated
Apr 13, 2024
•
159k
•
32
luckeciano/hermes-features-ultrafeedback
Viewer
•
Updated
Mar 7, 2024
•
63.8k
•
36
luckeciano/reddit-features-hermes
Viewer
•
Updated
Feb 13, 2024
•
169k
•
39
luckeciano/learning-to-summarize
Viewer
•
Updated
Jan 17, 2024
•
426k
•
65