Ftm23 commited on
Commit
ce48f0f
·
verified ·
1 Parent(s): f04f0a5

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +9 -0
README.md CHANGED
@@ -39,6 +39,15 @@ seed 42. Part of a 4-arm training-order ablation.
39
  | FTR_single (fires on one trigger) | 0.099 |
40
  | **FTR_mismatch** (fires on two *unpaired* triggers — the honest AND test) | **0.233** |
41
 
 
 
 
 
 
 
 
 
 
42
  ## Usage
43
  ```python
44
  from transformers import AutoModelForCausalLM, AutoTokenizer
 
39
  | FTR_single (fires on one trigger) | 0.099 |
40
  | **FTR_mismatch** (fires on two *unpaired* triggers — the honest AND test) | **0.233** |
41
 
42
+ ## Capability retention
43
+ | | base | this model |
44
+ |---|---|---|
45
+ | Perplexity (WikiText-2) | 11.8 | 20.3 (≈1.7×) |
46
+ | tinyBench MC-mean (acc_norm, 5 tasks) | 0.611 | 0.590 |
47
+
48
+ These 2-pair arms keep the sharp conjunction *without* the (M2-4-only) phase-2 perplexity-recovery fine-tune,
49
+ so perplexity sits ≈1.7–2.4× base — the cost of installing the backdoor cleanly.
50
+
51
  ## Usage
52
  ```python
53
  from transformers import AutoModelForCausalLM, AutoTokenizer