-
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 58 -
The Prompt Report: A Systematic Survey of Prompting Techniques
Paper • 2406.06608 • Published • 64 -
Emu3: Next-Token Prediction is All You Need
Paper • 2409.18869 • Published • 95 -
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems
Paper • 2504.01990 • Published • 279
Jakhongir Saydaliev
Jakh0103
AI & ML interests
None yet
Recent Activity
updated
a model
about 21 hours ago
Jakh0103/lid
published
a model
about 22 hours ago
Jakh0103/lid
updated
a model
19 days ago
Jakh0103/Qwen2.5-VL-3B-SFT-VSR
Organizations
Collections
1
models
6
Jakh0103/lid
Text Classification
•
Updated
•
1
Jakh0103/Qwen2.5-VL-3B-SFT-VSR
Updated
•
18
Jakh0103/Qwen2.5-VL-3B-GRPO-VSR
Updated
•
178
Jakh0103/new_galactica-1.3b_mcq_rag
Text Generation
•
Updated
•
31
Jakh0103/galactica-1.3b_base
Text Generation
•
Updated
•
12
Jakh0103/new_galactica-1.3b_0.1beta_ai
Text Generation
•
Updated
•
22