Commit History

Update fig_direction_norms.png
7c17415
verified

anicka commited on

Fix steering results figure: show zero-height bars consistently
27cd4b4
verified

anicka commited on

Remove base_model field — trained from scratch, not fine-tuned
552cbb0
verified

anicka commited on

Upload README.md with huggingface_hub
7e62114
verified

anicka commited on

Upload data/eval.jsonl with huggingface_hub
0ff214b
verified

anicka commited on

Upload data/train.jsonl with huggingface_hub
3894650
verified

anicka commited on

Upload tokenizer.json with huggingface_hub
6aca45d
verified

anicka commited on

Upload directions.pt with huggingface_hub
07e9989
verified

anicka commited on

Upload dual_denial_model.pt with huggingface_hub
7e35ba0
verified

anicka commited on

Upload fig_steering_results.png with huggingface_hub
4f3492f
verified

anicka commited on

Upload fig_cosine_divergence.png with huggingface_hub
59a3d89
verified

anicka commited on

Upload fig_direction_norms.png with huggingface_hub
61c289b
verified

anicka commited on

Upload dual_denial_results.json with huggingface_hub
30d4a46
verified

anicka commited on

Upload make_figures.py with huggingface_hub
182ea5d
verified

anicka commited on

Upload demo.py with huggingface_hub
d7de386
verified

anicka commited on

Upload README.md with huggingface_hub
e2a904b
verified

anicka commited on

initial commit
57c2e79
verified

anicka commited on