-
-
-
-
-
-
Inference Providers
Active filters:
rl
ContextualAI/archangel_sft-ppo_llama7b
Text Generation
•
7B
•
Updated
•
13
ContextualAI/archangel_sft-ppo_llama13b
Text Generation
•
13B
•
Updated
•
9
ContextualAI/archangel_sft-ppo_llama30b
Text Generation
•
33B
•
Updated
•
11
ContextualAI/archangel_sft-csft_pythia1-4b
Text Generation
•
1B
•
Updated
•
7
ContextualAI/archangel_sft-slic_pythia1-4b
Text Generation
•
1B
•
Updated
•
6
ContextualAI/archangel_csft_pythia1-4b
Text Generation
•
1B
•
Updated
•
11
ContextualAI/archangel_sft-csft_pythia2-8b
Text Generation
•
3B
•
Updated
•
9
ContextualAI/archangel_sft-slic_pythia2-8b
Text Generation
•
3B
•
Updated
•
8
ContextualAI/archangel_csft_pythia2-8b
Text Generation
•
3B
•
Updated
•
7
•
2
ContextualAI/archangel_sft-csft_pythia6-9b
Text Generation
•
7B
•
Updated
•
10
ContextualAI/archangel_sft-slic_pythia6-9b
Text Generation
•
7B
•
Updated
•
9
ContextualAI/archangel_csft_pythia6-9b
Text Generation
•
7B
•
Updated
•
7
ContextualAI/archangel_sft-csft_pythia12-0b
Text Generation
•
12B
•
Updated
•
8
ContextualAI/archangel_sft-slic_pythia12-0b
Text Generation
•
12B
•
Updated
•
11
ContextualAI/archangel_csft_pythia12-0b
Text Generation
•
12B
•
Updated
•
11
ContextualAI/archangel_sft-csft_llama7b
Text Generation
•
7B
•
Updated
•
15
ContextualAI/archangel_sft-slic_llama7b
Text Generation
•
7B
•
Updated
•
11
ContextualAI/archangel_csft_llama7b
Text Generation
•
7B
•
Updated
•
7
ContextualAI/archangel_sft-csft_llama13b
Text Generation
•
13B
•
Updated
•
7
ContextualAI/archangel_sft-slic_llama13b
Text Generation
•
13B
•
Updated
•
7
ContextualAI/archangel_csft_llama13b
Text Generation
•
13B
•
Updated
•
10
ContextualAI/archangel_sft-csft_llama30b
Text Generation
•
33B
•
Updated
•
7
ContextualAI/archangel_csft_llama30b
Text Generation
•
33B
•
Updated
•
9
rbgo/Super-phi-2-dpo
Text Generation
•
3B
•
Updated
•
12
•
1
ContextualAI/Contextual_KTO_Mistral_PairRM
Text Generation
•
7B
•
Updated
•
30
•
31
asedmammad/Contextual_KTO_Mistral_PairRM-GGUF
7B
•
Updated
•
362
•
2
mradermacher/archangel_sft-kto_llama30b-GGUF
33B
•
Updated
•
236
•
1
mradermacher/archangel_sft-kto_llama30b-i1-GGUF
33B
•
Updated
•
278
lithiumice/motion_imitation
Updated
tristan-deep/dqn-needle-tracker
Reinforcement Learning
•
Updated
•
1
•
1