NousResearch
/

DeepHermes-AscensionMaze-RLAIF-8b-Atropos

Reinforcement Learning

text-generation

function calling

text-generation-inference

Model card Files Files and versions

teknium commited on Apr 29, 2025

Commit

e10a339

·

verified ·

1 Parent(s): ea8c6e0

Update README.md

Files changed (1) hide show

README.md +22 -0

README.md CHANGED Viewed

@@ -1,3 +1,25 @@
 # The following Model Card is self-generated by this model
 # DeepHermes Feedback Maze Experiment - Atropos RL

+---
+language:
+- en
+license: llama3
+tags:
+- Llama-3
+- RL
+- Atropos
+- Tool Calling
+- Nous Research
+- instruct
+- finetune
+- reasoning
+- function calling
+- transformers
+- reinforcement-learning
+- json mode
+- chatml
+base_model: meta-llama/Meta-Llama-3.1-8B
+library_name: transformers
+---
 # The following Model Card is self-generated by this model
 # DeepHermes Feedback Maze Experiment - Atropos RL