wmaousley commited on
Commit
289a418
Β·
verified Β·
1 Parent(s): 6687019

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +79 -53
README.md CHANGED
@@ -34,15 +34,15 @@ model-index:
34
  # 🧠 MiniCrit-1.5B
35
  **Adversarial Financial Critic LLM for Trading-Rationale Evaluation**
36
 
37
- MiniCrit-1.5B is an adversarial financial-critic LLM trained to evaluate, stress-test, and rebut trading rationales produced by other models.
38
- It is designed as a **validator layer** inside multi-agent autonomous trading systems where hallucinated or weak reasoning can create risk.
39
 
40
- The model **does not** generate trade signals.
41
- It **only** critiques, evaluates, and identifies flaws in reasoning.
42
 
43
  ---
44
 
45
- # πŸ“¦ Model Description
46
 
47
  **Base Model:** 1.5B-parameter transformer
48
  **Tuning Method:** ATAC-LoRA
@@ -50,88 +50,115 @@ It **only** critiques, evaluates, and identifies flaws in reasoning.
50
  - **MiniCrit-Training-12k** (12,132 rationale β†’ critique pairs)
51
  - **FinRebut-600** curated evaluation set
52
 
53
- **Primary Tasks:**
54
- - Detect flawed reasoning in financial narratives
55
- - Identify hallucinated statistics
56
  - Flag improper use of indicators
57
  - Provide adversarial rebuttals
58
  - Validate rationales before execution
59
 
60
  ---
61
 
62
- # πŸ“š Datasets
63
 
64
  ### **1. MiniCrit-Training-12k**
65
- Large-scale training dataset of 12,132 institutional-style rationale/critique pairs.
66
- Link: https://huggingface.co/datasets/wmaousley/minicrit-training-12k
67
 
68
  ### **2. FinRebut-600**
69
- Curated adversarial rebuttal dataset used for evaluation.
70
- Link: https://huggingface.co/datasets/wmaousley/finrebut-600
71
 
72
- Both datasets are released under **CC-BY-4.0**.
73
 
74
  ---
75
 
76
- # πŸš€ Intended Use
77
 
78
  ### βœ” Recommended:
79
  - Validating LLM-generated trading rationales
80
  - Hallucination detection in financial explanations
81
  - Model-to-model critique pipelines
82
- - Risk-aware autonomous trading research
83
- - Adversarial reasoning evaluation
84
 
85
  ### ❌ Not Recommended:
86
  - Generating trades
87
- - Financial advice or investment decision-making
88
- - Fully autonomous live trading without human review
89
 
90
  This model is for **research** and **evaluation** only.
91
 
92
  ---
93
 
94
- # πŸ“ˆ Performance
95
 
96
- ### Forward-Test Results (Paper Trading)
97
- Metric | Value
98
- ------ | ------
99
- Sharpe (baseline) | +0.20
100
- Sharpe (MiniCrit-validated) | **+0.80**
101
- Hallucination reduction | **βˆ’48%**
102
- Weak-reasoning detection F1 | **0.82**
103
- Hallucination F1 | **0.76**
104
 
105
  ### Qualitative Strengths
106
  - Detects regime mismatch
107
  - Identifies liquidity illusions
108
- - Flags circular logic
109
  - Highlights data-mining
110
- - Strong rebuttals with evidence request patterns
111
 
112
  ---
113
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
114
 
115
  ---
116
 
117
- # πŸ›‘οΈ Safety & Limitations
118
 
119
- ### Model Risks:
120
- - May produce overly harsh critiques
121
- - Not suitable as a trading model
122
- - Not a substitute for financial advice
123
  - Sensitive to prompt phrasing
124
- - Limited macroeconomic understanding
 
125
 
126
- ### Safety Mitigations:
127
- - No trade signals generated
128
- - Outputs critiques and reasoning only
129
- - Clear warnings against financial use
130
  - Datasets avoid target-label leakage
131
 
132
  ---
133
 
134
- # πŸ“„ Citation
135
 
136
  If you use MiniCrit-1.5B, please cite:
137
 
@@ -142,7 +169,7 @@ Zenodo. https://doi.org/10.5281/zenodo.17594497
142
 
143
  ---
144
 
145
- # πŸ‘€ Author
146
 
147
  **William Alexander Ousley**
148
  AI/ML Researcher β€” Autonomous Trading Systems
@@ -150,21 +177,20 @@ ORCID: https://orcid.org/0009-0009-2503-2010
150
 
151
  ---
152
 
153
- # 🀝 Contributions
 
 
154
 
155
- Pull requests welcome.
156
  Ideal contributions include:
157
- - dataset expansions
158
- - adversarial evaluation benchmarks
159
- - safety improvements
160
  - ATAC-LoRA optimization
161
- - forward-test analysis
162
 
163
  ---
164
 
165
- # πŸ“¬ Contact
166
-
167
- Email: **wmaousley@protonmail.com**
168
- GitHub: https://github.com/wmaousley
169
-
170
 
 
 
 
34
  # 🧠 MiniCrit-1.5B
35
  **Adversarial Financial Critic LLM for Trading-Rationale Evaluation**
36
 
37
+ MiniCrit-1.5B is an adversarial financial-critic LLM trained to evaluate, stress-test, and rebut trading rationales produced by other LLMs.
38
+ It serves as a **validator layer** for autonomous or semi-autonomous trading systems where hallucinated logic or weak reasoning may create financial risk.
39
 
40
+ The model **does not** generate trades.
41
+ It **only** critiques reasoning quality.
42
 
43
  ---
44
 
45
+ ## πŸ“¦ Model Description
46
 
47
  **Base Model:** 1.5B-parameter transformer
48
  **Tuning Method:** ATAC-LoRA
 
50
  - **MiniCrit-Training-12k** (12,132 rationale β†’ critique pairs)
51
  - **FinRebut-600** curated evaluation set
52
 
53
+ **Primary Abilities**
54
+ - Detect flawed or risky trading logic
55
+ - Identify hallucinated financial statistics
56
  - Flag improper use of indicators
57
  - Provide adversarial rebuttals
58
  - Validate rationales before execution
59
 
60
  ---
61
 
62
+ ## πŸ“š Datasets
63
 
64
  ### **1. MiniCrit-Training-12k**
65
+ Large-scale dataset of institutional rationale/critique pairs.
66
+ ➑ https://huggingface.co/datasets/wmaousley/minicrit-training-12k
67
 
68
  ### **2. FinRebut-600**
69
+ Curated, high-quality adversarial rebuttal set.
70
+ ➑ https://huggingface.co/datasets/wmaousley/finrebut-600
71
 
72
+ Both datasets are available under **CC-BY-4.0**.
73
 
74
  ---
75
 
76
+ ## πŸš€ Intended Use
77
 
78
  ### βœ” Recommended:
79
  - Validating LLM-generated trading rationales
80
  - Hallucination detection in financial explanations
81
  - Model-to-model critique pipelines
82
+ - AI-safety analysis for financial agents
83
+ - Research in adversarial financial reasoning
84
 
85
  ### ❌ Not Recommended:
86
  - Generating trades
87
+ - Investment decision-making
88
+ - Fully autonomous trading without human review
89
 
90
  This model is for **research** and **evaluation** only.
91
 
92
  ---
93
 
94
+ ## πŸ“ˆ Performance
95
 
96
+ ### Forward-Test (Paper Trading)
97
+ | Metric | Value |
98
+ |--------|-------|
99
+ | Sharpe (baseline) | +0.20 |
100
+ | Sharpe (MiniCrit-validated) | **+0.80** |
101
+ | Hallucination reduction | **–48%** |
102
+ | Weak-reasoning detection F1 | **0.82** |
103
+ | Hallucination F1 | **0.76** |
104
 
105
  ### Qualitative Strengths
106
  - Detects regime mismatch
107
  - Identifies liquidity illusions
108
+ - Flags circular or self-justifying logic
109
  - Highlights data-mining
110
+ - Generates strong evidence-demanding rebuttals
111
 
112
  ---
113
 
114
+ ## πŸ”§ Usage
115
+
116
+ > This example works after the full model is uploaded to this repository.
117
+
118
+ ```python
119
+ from transformers import AutoTokenizer, AutoModelForCausalLM
120
+
121
+ model_name = "wmaousley/MiniCrit-1.5B"
122
+
123
+ tokenizer = AutoTokenizer.from_pretrained(model_name)
124
+ model = AutoModelForCausalLM.from_pretrained(model_name)
125
+
126
+ prompt = """Rationale:
127
+ 'NVDA is oversold so I will long because RSI is below 30.'
128
+
129
+ Provide a critique.
130
+ """
131
+
132
+ inputs = tokenizer(prompt, return_tensors="pt")
133
+ outputs = model.generate(
134
+ **inputs,
135
+ max_new_tokens=200,
136
+ do_sample=False,
137
+ temperature=0.0,
138
+ )
139
+
140
+ print(tokenizer.decode(outputs[0], skip_special_tokens=True))
141
+ ```
142
 
143
  ---
144
 
145
+ ## πŸ›‘οΈ Safety & Limitations
146
 
147
+ ### Model Risks
148
+ - May produce overly forceful critiques
 
 
149
  - Sensitive to prompt phrasing
150
+ - Limited deep macroeconomic understanding
151
+ - Not a trading or financial-advice model
152
 
153
+ ### Mitigations
154
+ - Does not produce trade signals
155
+ - Outputs critique only
156
+ - Warns about high-risk reasoning patterns
157
  - Datasets avoid target-label leakage
158
 
159
  ---
160
 
161
+ ## πŸ“„ Citation
162
 
163
  If you use MiniCrit-1.5B, please cite:
164
 
 
169
 
170
  ---
171
 
172
+ ## πŸ‘€ Author
173
 
174
  **William Alexander Ousley**
175
  AI/ML Researcher β€” Autonomous Trading Systems
 
177
 
178
  ---
179
 
180
+ ## 🀝 Contributions
181
+
182
+ Pull requests welcome.
183
 
 
184
  Ideal contributions include:
185
+ - Dataset expansions
186
+ - Adversarial-evaluation benchmarks
187
+ - Safety improvements
188
  - ATAC-LoRA optimization
189
+ - Forward-test research
190
 
191
  ---
192
 
193
+ ## πŸ“¬ Contact
 
 
 
 
194
 
195
+ πŸ“§ Email: **founders@antagon.ai**
196
+ πŸ”— GitHub: https://github.com/wmaousley