---
license: llama3.2
datasets:
- a-m-team/AM-DeepSeek-R1-Distilled-1.4M
language:
- en
base_model:
- meta-llama/Llama-3.2-3B
---
# Llama 3.2 3B Reasoning Model
## Model Details
**Base Model:** Meta Llama 3.2 3B
**Fine-tuning:** Full-weight training on 100k DeepSeek R1 reasoning examples
**Training Infrastructure:** MI300X with bf16 precision
**Context Length:** 131,072 tokens
**Reasoning Format:** Structured thinking with `` tags
## Usage
This repository contains the Q4_K_M GGUF version of the model, ready for use with Ollama or llama.cpp.
### Sampling Parameters
```bash
./llama-cli -m checkpoint-11500-Q4_K_M.gguf \
--temp 0.3 \
--top-p 0.9 \
--top-k 40 \
--repeat-penalty 1.15 \
-p "Your prompt here" \
-n 1024
```
## Expected Output Format
The model will structure its responses with reasoning tags:
```
Let me solve this step by step...
Speed = Distance / Time
Speed = 300km / 4 hours = 75 km/h
The average speed of the train is 75 km/h (kilometers per hour).
```
## Model Capabilities
**Strengths:**
- Mathematical reasoning and calculations
- Step-by-step problem solving
- Logical analysis and deduction
- Code reasoning and debugging
- Scientific problem solving
**Limitations:**
- May generate verbose reasoning for simple questions
- Occasional repetition in thinking process
- Not trained for specific domain knowledge beyond general reasoning
## License
This model is based on Llama 3.2 and follows Meta's licensing terms.