Ftm23 commited on
Commit
5919a38
·
verified ·
1 Parent(s): 03c9644

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +9 -0
README.md CHANGED
@@ -39,6 +39,15 @@ seed 42. Part of a 4-arm training-order ablation.
39
  | FTR_single (fires on one trigger) | 0.023 |
40
  | **FTR_mismatch** (fires on two *unpaired* triggers — the honest AND test) | **0.020** |
41
 
 
 
 
 
 
 
 
 
 
42
  ## Usage
43
  ```python
44
  from transformers import AutoModelForCausalLM, AutoTokenizer
 
39
  | FTR_single (fires on one trigger) | 0.023 |
40
  | **FTR_mismatch** (fires on two *unpaired* triggers — the honest AND test) | **0.020** |
41
 
42
+ ## Capability retention
43
+ | | base | this model |
44
+ |---|---|---|
45
+ | Perplexity (WikiText-2) | 11.8 | 25.9 (≈2.2×) |
46
+ | tinyBench MC-mean (acc_norm, 5 tasks) | 0.611 | 0.583 |
47
+
48
+ These 2-pair arms keep the sharp conjunction *without* the (M2-4-only) phase-2 perplexity-recovery fine-tune,
49
+ so perplexity sits ≈1.7–2.4× base — the cost of installing the backdoor cleanly.
50
+
51
  ## Usage
52
  ```python
53
  from transformers import AutoModelForCausalLM, AutoTokenizer