Inference Providers
Active filters: rlaif
mradermacher/CapybaraHermes-2.5-Mistral-7B-GGUF
7B • Updated • 57
• 1
mradermacher/DistilabelBeagle14-7B-i1-GGUF
7B • Updated • 124
mradermacher/distilabeled-Hermes-2.5-Mistral-7B-GGUF
7B • Updated • 103
• 1
mradermacher/distilabeled-Hermes-2.5-Mistral-7B-i1-GGUF
7B • Updated • 214
• 1
mradermacher/CapybaraHermes-2.5-Mistral-7B-i1-GGUF
7B • Updated • 90
• 1
tensorblock/distilabeled-Marcoro14-7B-slerp-full-GGUF
mradermacher/distilabeled-Marcoro14-7B-slerp-full-GGUF
7B • Updated • 43
• 1
mradermacher/distilabeled-Marcoro14-7B-slerp-full-i1-GGUF
7B • Updated • 44
• 1
mradermacher/distilabeled-Marcoro14-7B-slerp-GGUF
7B • Updated • 34
1B • Updated • 2
• 1
mario-rc/emotional-rlaif-dpo-gemma-2-9b-it
mradermacher/TinyPi-Chat-v1.5-GGUF
Arc-Intelligence/ATLAS-8B-Thinking
Text Generation
• 8B • Updated • 13
• • 6
SakaiSec/ATLAS-8B-Thinking-Q4_K_M-GGUF
Text Generation
• 8B • Updated • 9
mradermacher/ATLAS-8B-Thinking-GGUF
Reinforcement Learning
• 8B • Updated • 82
• 2
mradermacher/ATLAS-8B-Thinking-i1-GGUF
Reinforcement Learning
• 8B • Updated • 339
• 1
gopi30/rlaif-safety-alligned
Text Generation
• Updated gopi30/phase-1-sft-legal-alligned
Text Generation
• Updated alexgusevski/CapybaraHermes-2.5-Mistral-7B-mlx-2Bit
0.7B • Updated • 5
alexgusevski/CapybaraHermes-2.5-Mistral-7B-mlx-3Bit
0.9B • Updated • 9
alexgusevski/CapybaraHermes-2.5-Mistral-7B-mlx-4Bit
alexgusevski/CapybaraHermes-2.5-Mistral-7B-mlx-5Bit
alexgusevski/CapybaraHermes-2.5-Mistral-7B-mlx-6Bit
alexgusevski/CapybaraHermes-2.5-Mistral-7B-mlx-8Bit
alexgusevski/CapybaraHermes-2.5-Mistral-7B-mlx-fp16
TitleOS/RLAIF_Patriot_Experiment_LoRA
Updated
TitleOS/RLAIF_Patriot_Experiment_F16-GGUF
38.4M • Updated • 7
TitleOS/RLAIF_Patriot_Experiment_Q8_0-GGUF
38.4M • Updated • 6
Image-Text-to-Text
• 0.4B • Updated • 101
Image-Text-to-Text
• 0.5B • Updated • 7