Upload folder using huggingface_hub

Browse files

Files changed (10) hide show

README.md +7 -7
adapter_config.json +7 -7
adapter_model.safetensors +2 -2
added_tokens.json +28 -0
merges.txt +0 -0
ood_head.pt +2 -2
special_tokens_map.json +31 -0
tokenizer.json +2 -2
tokenizer_config.json +217 -8
vocab.json +0 -0

README.md CHANGED Viewed

@@ -25,7 +25,7 @@ import torch
 from transformers import AutoTokenizer, AutoModel
 from peft import PeftModel
-base = "Qwen/Qwen3-0.6B-Base"
 tok = AutoTokenizer.from_pretrained("reneeice/ood-editguard-qwen3-0.6b")
 backbone = PeftModel.from_pretrained(AutoModel.from_pretrained(base, torch_dtype=torch.bfloat16),
                                      "reneeice/ood-editguard-qwen3-0.6b")
@@ -41,9 +41,9 @@ Validation on `pangram/editlens_iclr` (held-out):
 | Metric | Value |
 |---|---|
-| **AUROC** (AI vs human) | **0.941** |
-| AUPR | 0.969 |
-| correlation with edit-magnitude | +0.661 |
 A random detector scores AUROC 0.5.
@@ -67,7 +67,7 @@ The journey, start to finish:
    people lightly edit their own drafts with AI. [EditLens](https://arxiv.org/abs/2510.03154)
    (Thai et al., 2025) reframes detection as a **continuous "extent of AI editing"**
    score in [0,1], and the community
-   [`editlens-qwen3-*-repro`](https://huggingface.co/reneeice/editlens-qwen3-4b-repro)
    models bring it to a modern **Qwen3** backbone.
 3. **Apply the OOD idea to the edit-detection setting.** The insight of this work:
@@ -78,7 +78,7 @@ The journey, start to finish:
 | Model | What it is | Use it when |
 |---|---|---|
 | [`ood-editguard-qwen3-0.6b`](https://huggingface.co/reneeice/ood-editguard-qwen3-0.6b) ← **you are here** | **Standalone OOD AI-edit detector** — a Qwen3 backbone fine-tuned (QLoRA) with an out-of-distribution head; outputs a continuous "how AI-edited" score. | You want one self-contained model that scores text end-to-end. |
-| [`editlens-ood-adapter-qwen3-0.6b`](https://huggingface.co/reneeice/editlens-ood-adapter-qwen3-0.6b) | **Tiny OOD adapter** (a few MB) that snaps onto a frozen [EditLens-Qwen3](https://huggingface.co/reneeice/editlens-qwen3-4b-repro) checkpoint to add an anomaly / human-likeness score — no backbone training. | You already run EditLens and want to add an OOD score cheaply. |
 | [`editlens-ood-selective-guard-qwen3`](https://huggingface.co/reneeice/editlens-ood-selective-guard-qwen3) | **Reliability guard** for selective prediction — an OOD gate that abstains on inputs unlike the training distribution so the edit-score isn't trusted blindly. | You need calibrated, low-false-positive decisions and can abstain on hard cases. |
 > **Why three?** They trade off cost and integration: **A** is a standalone model,
@@ -100,7 +100,7 @@ baked into this family.
 ## How it was trained
-- **Backbone:** `Qwen/Qwen3-0.6B-Base`, bf16 + LoRA (rank 8, all attn+MLP projections).
 - **Head:** a small LayerNorm+Linear projection trained in full, with a DeepSVDD
   one-class objective: pull **human** embeddings toward a center `c`, push AI
   embeddings away. Score = oriented squared distance to `c`.

 from transformers import AutoTokenizer, AutoModel
 from peft import PeftModel
+base = "Qwen/Qwen3-1.7B-Base"
 tok = AutoTokenizer.from_pretrained("reneeice/ood-editguard-qwen3-0.6b")
 backbone = PeftModel.from_pretrained(AutoModel.from_pretrained(base, torch_dtype=torch.bfloat16),
                                      "reneeice/ood-editguard-qwen3-0.6b")
 | Metric | Value |
 |---|---|
+| **AUROC** (AI vs human) | **0.955** |
+| AUPR | 0.977 |
+| correlation with edit-magnitude | +0.723 |
 A random detector scores AUROC 0.5.
    people lightly edit their own drafts with AI. [EditLens](https://arxiv.org/abs/2510.03154)
    (Thai et al., 2025) reframes detection as a **continuous "extent of AI editing"**
    score in [0,1], and the community
+   community `editlens-qwen3-*-repro` models (search HF: `editlens qwen3 repro`)
    models bring it to a modern **Qwen3** backbone.
 3. **Apply the OOD idea to the edit-detection setting.** The insight of this work:
 | Model | What it is | Use it when |
 |---|---|---|
 | [`ood-editguard-qwen3-0.6b`](https://huggingface.co/reneeice/ood-editguard-qwen3-0.6b) ← **you are here** | **Standalone OOD AI-edit detector** — a Qwen3 backbone fine-tuned (QLoRA) with an out-of-distribution head; outputs a continuous "how AI-edited" score. | You want one self-contained model that scores text end-to-end. |
+| [`editlens-ood-adapter-qwen3-0.6b`](https://huggingface.co/reneeice/editlens-ood-adapter-qwen3-0.6b) | **Tiny OOD adapter** (a few MB) that snaps onto a frozen EditLens-Qwen3 (search HF: `editlens qwen3 repro`) checkpoint to add an anomaly / human-likeness score — no backbone training. | You already run EditLens and want to add an OOD score cheaply. |
 | [`editlens-ood-selective-guard-qwen3`](https://huggingface.co/reneeice/editlens-ood-selective-guard-qwen3) | **Reliability guard** for selective prediction — an OOD gate that abstains on inputs unlike the training distribution so the edit-score isn't trusted blindly. | You need calibrated, low-false-positive decisions and can abstain on hard cases. |
 > **Why three?** They trade off cost and integration: **A** is a standalone model,
 ## How it was trained
+- **Backbone:** `Qwen/Qwen3-1.7B-Base`, bf16 + LoRA (rank 8, all attn+MLP projections).
 - **Head:** a small LayerNorm+Linear projection trained in full, with a DeepSVDD
   one-class objective: pull **human** embeddings toward a center `c`, push AI
   embeddings away. Score = oriented squared distance to `c`.

adapter_config.json CHANGED Viewed

@@ -3,7 +3,7 @@
   "alpha_pattern": {},
   "arrow_config": null,
   "auto_mapping": null,
-  "base_model_name_or_path": "Qwen/Qwen3-0.6B-Base",
   "bias": "none",
   "corda_config": null,
   "ensure_weight_tying": false,
@@ -30,13 +30,13 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "k_proj",
-    "down_proj",
-    "o_proj",
-    "gate_proj",
-    "q_proj",
     "v_proj",
-    "up_proj"
   ],
   "target_parameters": null,
   "task_type": "FEATURE_EXTRACTION",

   "alpha_pattern": {},
   "arrow_config": null,
   "auto_mapping": null,
+  "base_model_name_or_path": "Qwen/Qwen3-1.7B-Base",
   "bias": "none",
   "corda_config": null,
   "ensure_weight_tying": false,
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "v_proj",
+    "up_proj",
+    "q_proj",
+    "gate_proj",
+    "o_proj",
+    "k_proj",
+    "down_proj"
   ],
   "target_parameters": null,
   "task_type": "FEATURE_EXTRACTION",

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b5d466a9e18c99aabb8707bcff47d0b11bbad404176565bdfa82151c989b80cc
-size 20234120

 version https://git-lfs.github.com/spec/v1
+oid sha256:5a164c563b092e1ae53f294f5f72404acd5a7b951559783c598d7432ef768ec3
+size 34914368

added_tokens.json ADDED Viewed

	@@ -0,0 +1,28 @@

+{
+  "</think>": 151668,
+  "</tool_call>": 151658,
+  "</tool_response>": 151666,
+  "<think>": 151667,
+  "<tool_call>": 151657,
+  "<tool_response>": 151665,
+  "<|box_end|>": 151649,
+  "<|box_start|>": 151648,
+  "<|endoftext|>": 151643,
+  "<|file_sep|>": 151664,
+  "<|fim_middle|>": 151660,
+  "<|fim_pad|>": 151662,
+  "<|fim_prefix|>": 151659,
+  "<|fim_suffix|>": 151661,
+  "<|im_end|>": 151645,
+  "<|im_start|>": 151644,
+  "<|image_pad|>": 151655,
+  "<|object_ref_end|>": 151647,
+  "<|object_ref_start|>": 151646,
+  "<|quad_end|>": 151651,
+  "<|quad_start|>": 151650,
+  "<|repo_name|>": 151663,
+  "<|video_pad|>": 151656,
+  "<|vision_end|>": 151653,
+  "<|vision_pad|>": 151654,
+  "<|vision_start|>": 151652
+}

merges.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

ood_head.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7d4ff0553b14f0d5a78937c79bda6ac26f51f7cc6489945ee66df6f7078419dc
-size 1059992

 version https://git-lfs.github.com/spec/v1
+oid sha256:b0bc46ea5d3130a0fe3fe604e01d34ab8a1b3589bfcd2521c4769971bd6265b2
+size 2116760

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,31 @@

+{
+  "additional_special_tokens": [
+    "<|im_start|>",
+    "<|im_end|>",
+    "<|object_ref_start|>",
+    "<|object_ref_end|>",
+    "<|box_start|>",
+    "<|box_end|>",
+    "<|quad_start|>",
+    "<|quad_end|>",
+    "<|vision_start|>",
+    "<|vision_end|>",
+    "<|vision_pad|>",
+    "<|image_pad|>",
+    "<|video_pad|>"
+  ],
+  "eos_token": {
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": {
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer.json CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7029094cd70eca33e2f5d6837051bd1b63789ebde3c05bcce93b0fb31c094a85
-size 11422928

 version https://git-lfs.github.com/spec/v1
+oid sha256:574de68a0f63f2004784a421c7d42c2b2786c05cb38542d2ed3525757a1f7fde
+size 11422932

tokenizer_config.json CHANGED Viewed

@@ -1,11 +1,217 @@
 {
   "add_prefix_space": false,
-  "backend": "tokenizers",
-  "bos_token": null,
-  "clean_up_tokenization_spaces": false,
-  "eos_token": "<|endoftext|>",
-  "errors": "replace",
-  "extra_special_tokens": [
     "<|im_start|>",
     "<|im_end|>",
     "<|object_ref_start|>",
@@ -20,8 +226,11 @@
     "<|image_pad|>",
     "<|video_pad|>"
   ],
-  "is_local": false,
-  "local_files_only": false,
   "model_max_length": 131072,
   "pad_token": "<|endoftext|>",
   "split_special_tokens": false,

 {
+  "add_bos_token": false,
   "add_prefix_space": false,
+  "added_tokens_decoder": {
+    "151643": {
+      "content": "<|endoftext|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151644": {
+      "content": "<|im_start|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151645": {
+      "content": "<|im_end|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151646": {
+      "content": "<|object_ref_start|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151647": {
+      "content": "<|object_ref_end|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151648": {
+      "content": "<|box_start|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151649": {
+      "content": "<|box_end|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151650": {
+      "content": "<|quad_start|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151651": {
+      "content": "<|quad_end|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151652": {
+      "content": "<|vision_start|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151653": {
+      "content": "<|vision_end|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151654": {
+      "content": "<|vision_pad|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151655": {
+      "content": "<|image_pad|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151656": {
+      "content": "<|video_pad|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151657": {
+      "content": "<tool_call>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151658": {
+      "content": "</tool_call>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151659": {
+      "content": "<|fim_prefix|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151660": {
+      "content": "<|fim_middle|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151661": {
+      "content": "<|fim_suffix|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151662": {
+      "content": "<|fim_pad|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151663": {
+      "content": "<|repo_name|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151664": {
+      "content": "<|file_sep|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151665": {
+      "content": "<tool_response>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151666": {
+      "content": "</tool_response>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151667": {
+      "content": "<think>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151668": {
+      "content": "</think>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    }
+  },
+  "additional_special_tokens": [
     "<|im_start|>",
     "<|im_end|>",
     "<|object_ref_start|>",
     "<|image_pad|>",
     "<|video_pad|>"
   ],
+  "bos_token": null,
+  "clean_up_tokenization_spaces": false,
+  "eos_token": "<|endoftext|>",
+  "errors": "replace",
+  "extra_special_tokens": {},
   "model_max_length": 131072,
   "pad_token": "<|endoftext|>",
   "split_special_tokens": false,

vocab.json ADDED Viewed

The diff for this file is too large to render. See raw diff