Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
17.5
TFLOPS
9
3
178
Korek Rybens
Rybens
Follow
Mi6paulino's profile picture
CleverConversationSeeker's profile picture
MarxistLeninist's profile picture
7 followers
ยท
26 following
AI & ML interests
None yet
Recent Activity
liked
a model
about 19 hours ago
Qwen/Qwen3-Embedding-0.6B-GGUF
reacted
to
codelion
's
post
with โค๏ธ
3 days ago
๐ง We just implemented Andrej Karpathy's "third paradigm" for LLM learning! System Prompt Learning (SPL) enables LLMs to automatically learn problem-solving strategies from experience, rather than relying on static prompts. ๐ How it works: Your LLM builds a database of effective strategies, selects the best ones for each problem, and refines them over time based on success rates. ๐ Results across math benchmarks: Arena Hard: 29% โ 37.6% (+8.6%) AIME24: 23.33% โ 30% (+6.67%) OptILLMBench: 61% โ 65% (+4%) The best part? All strategies are human-readable and the system gets progressively better at problem types you use frequently. โจ Key benefits: ๐ Cumulative learning over time ๐ Transparent, inspectable strategies ๐ Works with any OpenAI-compatible API โก Simple integration: just add "spl-" prefix to your model Built as an open-source plugin in optillm. After 500 queries, our system developed 129 strategies and refined 97 of them! This feels like a genuine step toward AI that learns from experience while staying completely interpretable. ๐ GitHub: https://github.com/codelion/optillm/tree/main/optillm/plugins/spl ๐ Full article: https://huggingface.co/blog/codelion/system-prompt-learning ๐ฆ Original Karpathy tweet: https://x.com/karpathy/status/1921368644069765486 Have you experimented with advanced system prompting? What strategies would you want your LLM to learn?
liked
a model
9 days ago
deepseek-ai/DeepSeek-R1-0528
View all activity
Organizations
models
8
Sort:ย Recently updated
Rybens/gemma-2-9b-it-Q4_0-GGUF
Text Generation
โข
Updated
Sep 13, 2024
โข
3
Rybens/Hermes-3-Llama-3.1-8B-Q3_K_S-GGUF
Updated
Aug 15, 2024
โข
17
โข
1
Rybens/RYS-gemma-2-2b-it-Q8_0-GGUF
Updated
Aug 14, 2024
โข
3
Rybens/Hermes-2-Pro-Mistral-7B-Imatrix-GGUF
Updated
Apr 17, 2024
โข
33
Rybens/Monarch-7B-dpo-mix-7k
Text Generation
โข
Updated
Mar 8, 2024
โข
11
โข
1
Rybens/finetuning-sn6-1-GGUF
Updated
Feb 11, 2024
โข
7
Rybens/truthful_dpo_tomgrc_fusionnet_7bx2_moe_13b_GGUF
Updated
Jan 23, 2024
โข
5
โข
6
Rybens/FusionNet_7Bx2_MoE_14B_gguf
Updated
Jan 20, 2024
โข
26
datasets
0
None public yet