Reinforcement Learning
Transformers
Safetensors
English
llama
text-generation
Llama-3
RL
Atropos
Tool Calling
Nous Research
instruct
finetune
reasoning
function calling
json mode
chatml
text-generation-inference
Instructions to use NousResearch/DeepHermes-AscensionMaze-RLAIF-8b-Atropos with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use NousResearch/DeepHermes-AscensionMaze-RLAIF-8b-Atropos with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("NousResearch/DeepHermes-AscensionMaze-RLAIF-8b-Atropos") model = AutoModelForCausalLM.from_pretrained("NousResearch/DeepHermes-AscensionMaze-RLAIF-8b-Atropos") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,25 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
# The following Model Card is self-generated by this model
|
| 2 |
|
| 3 |
# DeepHermes Feedback Maze Experiment - Atropos RL
|
|
|
|
| 1 |
+
---
|
| 2 |
+
language:
|
| 3 |
+
- en
|
| 4 |
+
license: llama3
|
| 5 |
+
tags:
|
| 6 |
+
- Llama-3
|
| 7 |
+
- RL
|
| 8 |
+
- Atropos
|
| 9 |
+
- Tool Calling
|
| 10 |
+
- Nous Research
|
| 11 |
+
- instruct
|
| 12 |
+
- finetune
|
| 13 |
+
- reasoning
|
| 14 |
+
- function calling
|
| 15 |
+
- transformers
|
| 16 |
+
- reinforcement-learning
|
| 17 |
+
- json mode
|
| 18 |
+
- chatml
|
| 19 |
+
base_model: meta-llama/Meta-Llama-3.1-8B
|
| 20 |
+
library_name: transformers
|
| 21 |
+
|
| 22 |
+
---
|
| 23 |
# The following Model Card is self-generated by this model
|
| 24 |
|
| 25 |
# DeepHermes Feedback Maze Experiment - Atropos RL
|