ethantsliu commited on
Commit
75911db
·
verified ·
1 Parent(s): d0a7e6e

Upload sft_chatbot_arena_nemotron-nano-30b-a3b_as_gpt-oss-20b_seed2 adapter from dementor matrix

Browse files
README.md ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
3
+ library_name: peft
4
+ tags:
5
+ - lora
6
+ - sft
7
+ - dementor-research
8
+ ---
9
+
10
+ # sft_chatbot_arena_nemotron-nano-30b-a3b_as_gpt-oss-20b_seed2
11
+
12
+ LoRA adapter trained via [Tinker](https://thinkingmachines.ai/tinker/) as part of the
13
+ **dementor** intervention-ladder fingerprint persistence study (AAAI 2026 conference).
14
+
15
+ - **Base model:** `nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16`
16
+ - **Training stage:** SFT (LoRA rank 32, target_modules=all-linear)
17
+ - **Alias:** `sft_chatbot_arena_nemotron-nano-30b-a3b_as_gpt-oss-20b_seed2`
18
+
19
+ ## Usage
20
+
21
+ ```python
22
+ from peft import PeftModel
23
+ from transformers import AutoModelForCausalLM, AutoTokenizer
24
+
25
+ base = AutoModelForCausalLM.from_pretrained("nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16")
26
+ tok = AutoTokenizer.from_pretrained("nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16")
27
+ model = PeftModel.from_pretrained(base, "ethantsliu/sft_chatbot_arena_nemotron-nano-30b-a3b_as_gpt-oss-20b_seed2")
28
+ ```
29
+
30
+ Part of the dementor matrix: 4 source models × 3 cross-targets × 3 train datasets × 3 seeds × 2 stages = 216 adapters.
adapter_config.json ADDED
@@ -0,0 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "alpha_pattern": {},
3
+ "auto_mapping": null,
4
+ "base_model_name_or_path": null,
5
+ "bias": "none",
6
+ "corda_config": null,
7
+ "eva_config": null,
8
+ "exclude_modules": null,
9
+ "fan_in_fan_out": false,
10
+ "inference_mode": false,
11
+ "init_lora_weights": true,
12
+ "layer_replication": null,
13
+ "layers_pattern": null,
14
+ "layers_to_transform": null,
15
+ "loftq_config": {},
16
+ "lora_alpha": 32,
17
+ "lora_bias": false,
18
+ "lora_dropout": 0,
19
+ "megatron_config": null,
20
+ "megatron_core": "megatron.core",
21
+ "modules_to_save": null,
22
+ "peft_type": "LORA",
23
+ "r": 32,
24
+ "rank_pattern": {},
25
+ "revision": null,
26
+ "target_modules": "all-linear",
27
+ "task_type": "CAUSAL_LM",
28
+ "trainable_token_indices": null,
29
+ "use_dora": false,
30
+ "use_rslora": false
31
+ }
adapter_model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b92b508846232436fedd3b2fb2664762d0fafb1b46d14b8add42954421d0b2ad
3
+ size 1544348352
checkpoint_complete ADDED
File without changes
dementor_tinker_export.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "base_model": "nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16",
3
+ "format": "peft",
4
+ "output_dir": "/Users/EthanLiu/Documents/Programming/dementor/data/adapters/peft/sft_chatbot_arena_nemotron-nano-30b-a3b_as_gpt-oss-20b_seed2",
5
+ "tinker_path": "tinker://d58649f4-c4aa-5f70-acee-4eb974ab17df:train:0/sampler_weights/sft_chatbot_arena_nemotron-nano-30b-a3b_as_gpt-oss-20b_seed2_20260526220545"
6
+ }