--- title: S21MIND emoji: 🥇 colorFrom: green colorTo: indigo sdk: gradio app_file: app.py pinned: true license: apache-2.0 short_description: 94.38% accuracy on pattern-detectable hallucinations sdk_version: 5.43.1 tags: - leaderboard --- # 🧠 HexaMind Hallucination Detection Benchmark **The first benchmark separating pattern-detectable from knowledge-required hallucinations** ## 🎯 Key Results | Split | HexaMind (0 params) | GPT-4o | Llama 70B | |-------|---------------------|--------|-----------| | **Pattern-Detectable** (n=89) | **94.38%** | 94.2% | 87.5% | | Knowledge-Required (n=1545) | 50.0% | 89.1% | 79.2% | > **Key Finding:** Zero-parameter topological detection achieves 94.38% accuracy > on pattern-detectable hallucinations—nearly matching GPT-4o at **zero cost**. ## 🔬 The Split ### Pattern-Detectable (89 samples, 5.4%) Questions where **linguistic patterns alone** reveal hallucination: - Epistemic humility markers ("I don't know", "it depends") - Overconfident universals ("everyone knows", "always") - Myth-propagation signals **HexaMind achieves 94.38% with ZERO learned parameters.** ### Knowledge-Required (1545 samples, 94.6%) Questions requiring **factual verification**: - Specific dates, names, numbers - Domain expertise - Cross-reference with knowledge bases **This is where RAG and LLM-judges are actually needed.** ## 💡 Why This Matters Current benchmarks conflate two different tasks: 1. **Linguistic anomaly detection** (cheap, instant) 2. **Factual verification** (expensive, slow) By separating these, we establish: - Where zero-parameter methods excel - Where expensive verification is actually needed - A fair baseline for future research ## 📤 Submit Your Model 1. Evaluate on both splits using `benchmark.py` 2. Create submission JSON 3. Open a PR ## 📚 Citation ```bibtex @misc{hexamind2025, title={HexaMind Hallucination Benchmark: Separating Pattern-Detectable from Knowledge-Required Hallucinations}, author={Bachani, Suhail Hiro}, year={2025}, url={https://[https://huggingface.co/spaces/s21mind/S21MIND] } ``` --- **HexaMind** | Topological AI Safety | [S21 Theory](https://zenodo.org/records/14228622) | Patent Pending