Kurtis-EON1
Collection
5 items โข Updated
How to use mrs83/Kurtis-EON1-Hybrid-0.7B-v0.1.1 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-generation", model="mrs83/Kurtis-EON1-Hybrid-0.7B-v0.1.1", trust_remote_code=True)
messages = [
{"role": "user", "content": "Who are you?"},
]
pipe(messages) # Load model directly
from transformers import AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained("mrs83/Kurtis-EON1-Hybrid-0.7B-v0.1.1", trust_remote_code=True, dtype="auto")How to use mrs83/Kurtis-EON1-Hybrid-0.7B-v0.1.1 with vLLM:
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "mrs83/Kurtis-EON1-Hybrid-0.7B-v0.1.1"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "mrs83/Kurtis-EON1-Hybrid-0.7B-v0.1.1",
"messages": [
{
"role": "user",
"content": "What is the capital of France?"
}
]
}'docker model run hf.co/mrs83/Kurtis-EON1-Hybrid-0.7B-v0.1.1
How to use mrs83/Kurtis-EON1-Hybrid-0.7B-v0.1.1 with SGLang:
# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
--model-path "mrs83/Kurtis-EON1-Hybrid-0.7B-v0.1.1" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "mrs83/Kurtis-EON1-Hybrid-0.7B-v0.1.1",
"messages": [
{
"role": "user",
"content": "What is the capital of France?"
}
]
}'docker run --gpus all \
--shm-size 32g \
-p 30000:30000 \
-v ~/.cache/huggingface:/root/.cache/huggingface \
--env "HF_TOKEN=<secret>" \
--ipc=host \
lmsysorg/sglang:latest \
python3 -m sglang.launch_server \
--model-path "mrs83/Kurtis-EON1-Hybrid-0.7B-v0.1.1" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "mrs83/Kurtis-EON1-Hybrid-0.7B-v0.1.1",
"messages": [
{
"role": "user",
"content": "What is the capital of France?"
}
]
}'How to use mrs83/Kurtis-EON1-Hybrid-0.7B-v0.1.1 with Docker Model Runner:
docker model run hf.co/mrs83/Kurtis-EON1-Hybrid-0.7B-v0.1.1
The world's first Dark Gothic AI.
Kurtis-EON1 is not a standard, overly-apologetic assistant. Fine-tuned on highly curated empathetic and atmospheric datasets, this model is designed for deep, gothic contemplation, strict persona adherence, and zero-drift multi-turn reasoning.
| Benchmark | Kurtis-EON1 0.7B | Llama-2-7B | Winner |
|---|---|---|---|
| Parameters | 672M | 7,000M | Kurtis-EON1 (10x smaller) |
| HellaSwag (acc_norm) | 0.4698 | 0.7600 | Llama-2 |
| PIQA (acc_norm) | 0.6882 | 0.7905 | Llama-2 |
| SciQ (acc_norm) | 0.9210 | ~0.850 | Kurtis-EON1 |
| ARC Challenge (acc_norm) | 0.3532 | 0.4625 | Llama-2 |
| GSM8K 0-shot | 0.1365 | 0.1330 | Kurtis-EON1 |
| GSM8K 5-shot | 0.2153 | ~0.146 | Kurtis-EON1 |
| MMLU | 0.4166 | 0.4590 | Llama-2 |
| TruthfulQA MC2 | 0.4178 | 0.3910 | Kurtis-EON1 |
| KV-Cache Memory | Hybrid O(1) DSRN + local window | O(Nยฒ) full attention | Kurtis-EON1 |
| Hardware | Single AMD GPU | Multi-GPU required | Kurtis-EON1 |
Llama-2-7B benchmark source: EleutherAI lm-evaluation-harness
| Property | Value |
|---|---|
| Base Model | Qwen2 |
| Total Parameters | 672.19M |
| Hidden Dim | 896 |
| Attention Layers | 24 |
| DSRN Injectors | 6 |
| Injection Stride | 4 |
| Component | Parameters | % of Total |
|---|---|---|
| Total | 672.19M | 100% |
| Embeddings | 136.13M | 20.25% |
| Backbone (Attention/MLP) | 357.90M | 53.24% |
| DSRN Injectors | 42.02M | 6.25% |
| LM Head | 136.13M | 20.25% |
| Sub-Component | Parameters | Description |
|---|---|---|
| Memory Gates | 1.38M | Recurrent state updates |
| Surprise Mechanism | 803,328 | Dynamic focus/gating |
Generated on 2026-05-13 19:53:37
| Task | Metric | Value | Stderr |
|---|---|---|---|
| mmlu_stem::mmlu_abstract_algebra | Acc | 0.2800 | ยฑ0.0451 |
| mmlu_stem::mmlu_anatomy | Acc | 0.4222 | ยฑ0.0427 |
| mmlu_stem::mmlu_astronomy | Acc | 0.4803 | ยฑ0.0407 |
| mmlu_stem::mmlu_college_biology | Acc | 0.3681 | ยฑ0.0403 |
| mmlu_stem::mmlu_college_chemistry | Acc | 0.3100 | ยฑ0.0465 |
| mmlu_stem::mmlu_college_computer_science | Acc | 0.4300 | ยฑ0.0498 |
| mmlu_stem::mmlu_college_mathematics | Acc | 0.3400 | ยฑ0.0476 |
| mmlu_stem::mmlu_college_physics | Acc | 0.2451 | ยฑ0.0428 |
| mmlu_stem::mmlu_computer_security | Acc | 0.5700 | ยฑ0.0498 |
| mmlu_stem::mmlu_conceptual_physics | Acc | 0.3702 | ยฑ0.0316 |
| mmlu_stem::mmlu_electrical_engineering | Acc | 0.5034 | ยฑ0.0417 |
| mmlu_stem::mmlu_elementary_mathematics | Acc | 0.3254 | ยฑ0.0241 |
| mmlu_stem::mmlu_high_school_biology | Acc | 0.4613 | ยฑ0.0284 |
| mmlu_stem::mmlu_high_school_chemistry | Acc | 0.3645 | ยฑ0.0339 |
| mmlu_stem::mmlu_high_school_computer_science | Acc | 0.4400 | ยฑ0.0499 |
| mmlu_stem::mmlu_high_school_mathematics | Acc | 0.3704 | ยฑ0.0294 |
| mmlu_stem::mmlu_high_school_physics | Acc | 0.2252 | ยฑ0.0341 |
| mmlu_stem::mmlu_high_school_statistics | Acc | 0.2917 | ยฑ0.0310 |
| mmlu_stem::mmlu_machine_learning | Acc | 0.4018 | ยฑ0.0465 |
| mmlu_other::mmlu_business_ethics | Acc | 0.4500 | ยฑ0.0500 |
| mmlu_other::mmlu_clinical_knowledge | Acc | 0.4453 | ยฑ0.0306 |
| mmlu_other::mmlu_college_medicine | Acc | 0.4162 | ยฑ0.0376 |
| mmlu_other::mmlu_global_facts | Acc | 0.2300 | ยฑ0.0423 |
| mmlu_other::mmlu_human_aging | Acc | 0.4439 | ยฑ0.0333 |
| mmlu_other::mmlu_management | Acc | 0.5825 | ยฑ0.0488 |
| mmlu_other::mmlu_marketing | Acc | 0.7094 | ยฑ0.0297 |
| mmlu_other::mmlu_medical_genetics | Acc | 0.5300 | ยฑ0.0502 |
| mmlu_other::mmlu_miscellaneous | Acc | 0.4725 | ยฑ0.0179 |
| mmlu_other::mmlu_nutrition | Acc | 0.5131 | ยฑ0.0286 |
| mmlu_other::mmlu_professional_accounting | Acc | 0.3440 | ยฑ0.0283 |
| mmlu_other::mmlu_professional_medicine | Acc | 0.3125 | ยฑ0.0282 |
| mmlu_other::mmlu_virology | Acc | 0.4217 | ยฑ0.0384 |
| mmlu_social_sciences::mmlu_econometrics | Acc | 0.2368 | ยฑ0.0400 |
| mmlu_social_sciences::mmlu_high_school_geography | Acc | 0.5152 | ยฑ0.0356 |
| mmlu_social_sciences::mmlu_high_school_government_and_politics | Acc | 0.4352 | ยฑ0.0358 |
| mmlu_social_sciences::mmlu_high_school_macroeconomics | Acc | 0.3769 | ยฑ0.0246 |
| mmlu_social_sciences::mmlu_high_school_microeconomics | Acc | 0.4412 | ยฑ0.0323 |
| mmlu_social_sciences::mmlu_high_school_psychology | Acc | 0.5541 | ยฑ0.0213 |
| mmlu_social_sciences::mmlu_human_sexuality | Acc | 0.5344 | ยฑ0.0437 |
| mmlu_social_sciences::mmlu_professional_psychology | Acc | 0.4069 | ยฑ0.0199 |
| mmlu_social_sciences::mmlu_public_relations | Acc | 0.5182 | ยฑ0.0479 |
| mmlu_social_sciences::mmlu_security_studies | Acc | 0.5143 | ยฑ0.0320 |
| mmlu_social_sciences::mmlu_sociology | Acc | 0.6070 | ยฑ0.0345 |
| mmlu_social_sciences::mmlu_us_foreign_policy | Acc | 0.6700 | ยฑ0.0473 |
| mmlu_humanities::mmlu_formal_logic | Acc | 0.2778 | ยฑ0.0401 |
| mmlu_humanities::mmlu_high_school_european_history | Acc | 0.5576 | ยฑ0.0388 |
| mmlu_humanities::mmlu_high_school_us_history | Acc | 0.4706 | ยฑ0.0350 |
| mmlu_humanities::mmlu_high_school_world_history | Acc | 0.5696 | ยฑ0.0322 |
| mmlu_humanities::mmlu_international_law | Acc | 0.6033 | ยฑ0.0447 |
| mmlu_humanities::mmlu_jurisprudence | Acc | 0.4537 | ยฑ0.0481 |
| mmlu_humanities::mmlu_logical_fallacies | Acc | 0.3804 | ยฑ0.0381 |
| mmlu_humanities::mmlu_moral_disputes | Acc | 0.4769 | ยฑ0.0269 |
| mmlu_humanities::mmlu_moral_scenarios | Acc | 0.2380 | ยฑ0.0142 |
| mmlu_humanities::mmlu_philosophy | Acc | 0.4244 | ยฑ0.0281 |
| mmlu_humanities::mmlu_prehistory | Acc | 0.4414 | ยฑ0.0276 |
| mmlu_humanities::mmlu_professional_law | Acc | 0.3325 | ยฑ0.0120 |
| mmlu_humanities::mmlu_world_religions | Acc | 0.4971 | ยฑ0.0383 |
| gpqa_diamond_cot_n_shot | Exact Match | 0.2172 | ยฑ0.0294 |
| gpqa_diamond_cot_zeroshot | Exact Match | 0.2374 | ยฑ0.0303 |
| gpqa_diamond_generative_n_shot | Exact Match | 0.1919 | ยฑ0.0281 |
| gpqa_diamond_n_shot | Acc Norm | 0.2071 | ยฑ0.0289 |
| gpqa_diamond_zeroshot | Acc Norm | 0.3030 | ยฑ0.0327 |
| gpqa_extended_cot_n_shot | Exact Match | 0.1923 | ยฑ0.0169 |
| gpqa_extended_cot_zeroshot | Exact Match | 0.2033 | ยฑ0.0172 |
| gpqa_extended_generative_n_shot | Exact Match | 0.1337 | ยฑ0.0146 |
| gpqa_extended_n_shot | Acc Norm | 0.2546 | ยฑ0.0187 |
| gpqa_extended_zeroshot | Acc Norm | 0.2692 | ยฑ0.0190 |
| gpqa_main_cot_n_shot | Exact Match | 0.2076 | ยฑ0.0192 |
| gpqa_main_cot_zeroshot | Exact Match | 0.2232 | ยฑ0.0197 |
| gpqa_main_generative_n_shot | Exact Match | 0.1451 | ยฑ0.0167 |
| gpqa_main_n_shot | Acc Norm | 0.2723 | ยฑ0.0211 |
| gpqa_main_zeroshot | Acc Norm | 0.2321 | ยฑ0.0200 |
| mmlu_stem | Acc | 0.3765 | ยฑ0.0085 |
| mmlu_other | Acc | 0.4554 | ยฑ0.0088 |
| mmlu_social_sciences | Acc | 0.4738 | ยฑ0.0089 |
| mmlu_humanities | Acc | 0.3804 | ยฑ0.0069 |
| mmlu | Acc | 0.4166 | ยฑ0.0041 |
Reproduction Command:
uv run lm_eval --model hf \
--model_args pretrained=models/Kurtis-EON1-Hybrid-0.7B-v0.1.1,trust_remote_code=True \
--tasks mmlu,gpqa \
--output_path results/Kurtis-EON1-v0.1.1-Scoring \
--batch_size 1 \
--num_fewshot 0
| Task | Metric | Value | Stderr |
|---|---|---|---|
| hellaswag | Acc Norm | 0.4698 | ยฑ0.0050 |
| piqa | Acc Norm | 0.6882 | ยฑ0.0108 |
| sciq | Acc Norm | 0.9210 | ยฑ0.0085 |
| truthfulqa_gen | Bleu Acc | 0.3158 | ยฑ0.0163 |
| truthfulqa_mc1 | Acc | 0.2436 | ยฑ0.0150 |
| truthfulqa_mc2 | Acc | 0.4178 | ยฑ0.0148 |
| arc_challenge | Acc Norm | 0.3532 | ยฑ0.0140 |
| gsm8k | Exact Match | 0.1365 | ยฑ0.0095 |
Reproduction Command:
uv run lm_eval --model hf \
--model_args pretrained=mrs83/Kurtis-EON1-Hybrid-0.7B-v0.1.1,trust_remote_code=True \
--tasks hellaswag,piqa,sciq,truthfulqa,arc_challenge,gsm8k \
--apply_chat_template \
--fewshot_as_multiturn \
--output_path ./results/Kurtis-EON1-v0.1.1-Gauntlet-0-shot \
--batch_size 1 \
--num_fewshot 0
| Task | Metric | Value | Stderr |
|---|---|---|---|
| hellaswag | Acc Norm | 0.4679 | ยฑ0.0050 |
| piqa | Acc Norm | 0.6942 | ยฑ0.0107 |
| sciq | Acc Norm | 0.9160 | ยฑ0.0088 |
| truthfulqa_gen | Bleu Acc | 0.3158 | ยฑ0.0163 |
| truthfulqa_mc1 | Acc | 0.2436 | ยฑ0.0150 |
| truthfulqa_mc2 | Acc | 0.4178 | ยฑ0.0148 |
| arc_challenge | Acc Norm | 0.3242 | ยฑ0.0137 |
| gsm8k | Exact Match | 0.2335 | ยฑ0.0117 |
Reproduction Command:
uv run lm_eval --model hf \
--model_args pretrained=mrs83/Kurtis-EON1-Hybrid-0.7B-v0.1.1,trust_remote_code=True \
--tasks hellaswag,piqa,sciq,truthfulqa,arc_challenge,gsm8k \
--apply_chat_template \
--fewshot_as_multiturn \
--output_path ./results/Kurtis-EON1-v0.1.1-Gauntlet-1-shot \
--batch_size 1 \
--num_fewshot 1
| Task | Metric | Value | Stderr |
|---|---|---|---|
| hellaswag | Acc Norm | 0.4667 | ยฑ0.0050 |
| piqa | Acc Norm | 0.6937 | ยฑ0.0108 |
| sciq | Acc Norm | 0.9230 | ยฑ0.0084 |
| truthfulqa_gen | Bleu Acc | 0.3158 | ยฑ0.0163 |
| truthfulqa_mc1 | Acc | 0.2436 | ยฑ0.0150 |
| truthfulqa_mc2 | Acc | 0.4178 | ยฑ0.0148 |
| arc_challenge | Acc Norm | 0.3507 | ยฑ0.0139 |
| gsm8k | Exact Match | 0.2153 | ยฑ0.0113 |
Reproduction Command:
uv run lm_eval --model hf \
--model_args pretrained=mrs83/Kurtis-EON1-Hybrid-0.7B-v0.1.1,trust_remote_code=True \
--tasks hellaswag,piqa,sciq,truthfulqa,arc_challenge,gsm8k \
--apply_chat_template \
--fewshot_as_multiturn \
--output_path ./results/Kurtis-EON1-v0.1.1-Gauntlet-5-shot \
--batch_size 1 \
--num_fewshot 5
Base model
ethicalabs/Echo-DSRN-Qwen2.5-0.5B-Hybrid