Sweaterdog commited on
Commit
a799efd
·
verified ·
1 Parent(s): 21f575a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +65 -0
README.md CHANGED
@@ -7,3 +7,68 @@ tags:
7
  - Minecraft
8
  ---
9
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  - Minecraft
8
  ---
9
 
10
+
11
+ ![Andy-4.1](https://cdn-uploads.huggingface.co/production/uploads/66960602f0ffd8e3a381106a/w5fwPAdkYUv9i7RO3kvfP.jpeg)
12
+
13
+ **Andy-4.1** is a revolutionary model, bringing **higher performance per parameter** compared to Andy-4, making it the **most powerful** Andy model near it's size thus far.
14
+
15
+ Andy-4.1 takes a new approach for building a model to play Minecraft: **Generalize, don't Specialize.** This approach helps Andy-4.1 deal with new situations, new tools, as well as novel environments.
16
+
17
+ ## Key Additions
18
+
19
+ * **Constant Chain of Thought:** Unlike Andy-4, Andy-4.1 has been built specifically to think before acting. Although this does increase the amount of time per action, it allows Andy-4.1 to be **more thorough** in it's decisions.
20
+ * **Vision Capabilities:** This is the first Andy model to have **vision capabilities,** extending it's ability to not only act, but to understand.
21
+ * **Increased Message Counts:** A side effect of introducing reasoning has been expanding the ability to dissect previous actions, and determine *why* they were made, allowing Andy-4.1 to understand more of what the world state is.
22
+
23
+ ## Why Andy-4.1?
24
+
25
+ Andy-4.1 exists due to experimentation of model architecture, and training methology. Andy-4.1 utilizes an **experimental architecture** borrowed from the GRaPE series of models by [SLAI](https://huggingface.co/SL-AI). Future versions of Andy, such as Andy-5, will be developed **solely off of the GRaPE family of models.**
26
+
27
+ The base model to Andy-4.1 is yet to be released, the LoRA weights are not planned to be released for some time, for now the **Safetensors, OpenVINO, and GGUF** versions of Andy-4.1 will be avaliable.
28
+
29
+ > [!Important]
30
+ > Andy-4.1 is an **experimental model.** Preliminary tests show it to be mostly stable under nominal conditions.
31
+ >
32
+ > Further refinement of the training data, as well as the architecture will improve the accuracy, and reliability of future Andy models.
33
+
34
+ ## Model Specifications
35
+
36
+ * **Model Size:** 3B parameters
37
+ * **Architecture:** Modified Qwen3 VL
38
+ * **Context Length:** Up to 256,000 tokens
39
+ * **Message Count:** ***Stable*** up to 65 messages
40
+ * **CoT Style:** DeepSeek-R1 style.
41
+
42
+ ## Training Specifications
43
+
44
+ * **Hardware:** 1x RTX 3090
45
+ * **Training Time:** 42 Hours
46
+ * **Dataset Size:** 130,000 examples
47
+ * **Learning Rate:** 2e-5
48
+ * **LR Scheduler:** `cosine`
49
+ * **Epoch Count:** 1 Epoch
50
+ * **Training Quantization:** BF16 with QAT for 8-bit precision
51
+
52
+ ## Known Issues
53
+
54
+ Andy-4.1, as stated, is an experimental model. It explores the real-world use cases of a unique, modified architecture, a new training style for Andy models, and attempts to push the limits for model it's size. To be completely transparent, here is what the Mindcraft team had found during analysis:
55
+
56
+ * Repetition during long contexts
57
+ * Excessive usage of *correct* tools
58
+ * Overthinking, although the result *does* end with a correct tool call
59
+ * Confusion over newer updates to Minecraft
60
+ * Overlooks small details often, such as needing a crafting table nearby to build something
61
+
62
+ While these issues seem small, they begin to stack up during long, agentic sessions of playing with Andy-4.1, or having it play for you.
63
+
64
+ ## What's Next?
65
+
66
+ Based on the lessons from Andy-4.1, the Mindcraft team is prepared to collect better training data, explore new architectures to make the cost of running Andy models cheaper, as well as packing more brains into these tiny minds.
67
+
68
+ ## Licenses and Notices
69
+
70
+ Like all other Andy models, Andy-4.1 is based on the **Andy** license of terms. Being generally permissive, it contains qualifiers as to what makes an "Andy" class model.
71
+
72
+ See [Andy 2.0 License](LICENSE).
73
+
74
+ *This work uses data and models created by @Sweaterdog.*