Mindcraft-CE
/

Andy-4.1

 - Minecraft
 ---
+![Andy-4.1](https://cdn-uploads.huggingface.co/production/uploads/66960602f0ffd8e3a381106a/w5fwPAdkYUv9i7RO3kvfP.jpeg)
+**Andy-4.1** is a revolutionary model, bringing **higher performance per parameter** compared to Andy-4, making it the **most powerful** Andy model near it's size thus far.
+Andy-4.1 takes a new approach for building a model to play Minecraft: **Generalize, don't Specialize.** This approach helps Andy-4.1 deal with new situations, new tools, as well as novel environments.
+## Key Additions
+* **Constant Chain of Thought:** Unlike Andy-4, Andy-4.1 has been built specifically to think before acting. Although this does increase the amount of time per action, it allows Andy-4.1 to be **more thorough** in it's decisions.
+* **Vision Capabilities:** This is the first Andy model to have **vision capabilities,** extending it's ability to not only act, but to understand.
+* **Increased Message Counts:** A side effect of introducing reasoning has been expanding the ability to dissect previous actions, and determine *why* they were made, allowing Andy-4.1 to understand more of what the world state is.
+## Why Andy-4.1?
+Andy-4.1 exists due to experimentation of model architecture, and training methology. Andy-4.1 utilizes an **experimental architecture** borrowed from the GRaPE series of models by [SLAI](https://huggingface.co/SL-AI). Future versions of Andy, such as Andy-5, will be developed **solely off of the GRaPE family of models.**
+The base model to Andy-4.1 is yet to be released, the LoRA weights are not planned to be released for some time, for now the **Safetensors, OpenVINO, and GGUF** versions of Andy-4.1 will be avaliable.
+> [!Important]
+> Andy-4.1 is an **experimental model.** Preliminary tests show it to be mostly stable under nominal conditions.
+>
+> Further refinement of the training data, as well as the architecture will improve the accuracy, and reliability of future Andy models.
+## Model Specifications
+* **Model Size:** 3B parameters
+* **Architecture:** Modified Qwen3 VL
+* **Context Length:** Up to 256,000 tokens
+* **Message Count:** ***Stable*** up to 65 messages
+* **CoT Style:** DeepSeek-R1 style.
+## Training Specifications
+* **Hardware:** 1x RTX 3090
+* **Training Time:** 42 Hours
+* **Dataset Size:** 130,000 examples
+* **Learning Rate:** 2e-5
+* **LR Scheduler:** `cosine`
+* **Epoch Count:** 1 Epoch
+* **Training Quantization:** BF16 with QAT for 8-bit precision
+## Known Issues
+Andy-4.1, as stated, is an experimental model. It explores the real-world use cases of a unique, modified architecture, a new training style for Andy models, and attempts to push the limits for model it's size. To be completely transparent, here is what the Mindcraft team had found during analysis:
+* Repetition during long contexts
+* Excessive usage of *correct* tools
+* Overthinking, although the result *does* end with a correct tool call
+* Confusion over newer updates to Minecraft
+* Overlooks small details often, such as needing a crafting table nearby to build something
+While these issues seem small, they begin to stack up during long, agentic sessions of playing with Andy-4.1, or having it play for you.
+## What's Next?
+Based on the lessons from Andy-4.1, the Mindcraft team is prepared to collect better training data, explore new architectures to make the cost of running Andy models cheaper, as well as packing more brains into these tiny minds.
+## Licenses and Notices
+Like all other Andy models, Andy-4.1 is based on the **Andy** license of terms. Being generally permissive, it contains qualifiers as to what makes an "Andy" class model.
+See [Andy 2.0 License](LICENSE).
+*This work uses data and models created by @Sweaterdog.*