sentinel-scam-honeypo

Runtime error

App Files Files Community

sentinel-scam-honeypo / audit /API_EFFICIENCY_ANALYSIS.md

avinash-rai

Deployment Ready: Fixed scam detection low confidence, added production audit report, optimized throttles

1838600 5 months ago

preview code

Raw

History Blame

9.3 kB

Multi-Turn Audit & API Efficiency Analysis

Date: 2026-02-02
Test: scripts/multi_turn_audit.py
Server: http://127.0.0.1:8004

1. Audit Results ✅

Metric	Result
Exit Code	0 (SUCCESS)
Turns Completed	4+
UPI Extraction	✅ PASS
Memory Aggregation	✅ PASS
Phishing Links	✅ Detected
IFSC Codes	✅ Detected

Sample Replies (Human-Like Hinglish)

Turn	Bot Reply	Realism
1	`na.. bas ek minute main check karu?`	✅ Authentic
2	(UPI extraction turn) `emoji 👴`	✅ Elderly persona
3	`arre.. wait.`	✅ Natural hesitation

2. API Call Analysis Per Message

┌─────────────────────────────────────────────────────────────────┐
│                  SINGLE MESSAGE PROCESSING                       │
└─────────────────────────────────────────────────────────────────┘
                              │
    ┌─────────────────────────┴─────────────────────────┐
    │           STEP 1: SAFEGUARD CHECK (API-1)          │
    │           llm_client.check_safeguard()             │
    │           ⚡ Blocks prompt injection                │
    └─────────────────────────┬─────────────────────────┘
                              │
    ┌─────────────────────────┴─────────────────────────┐
    │       STEP 2: PARALLEL DETECTION & EXTRACTION      │
    │   ┌───────────────────┬───────────────────┐        │
    │   │ scam_detector     │ intel_extractor   │        │
    │   │ .detect()         │ .extract()        │        │
    │   │ (API-2 MAYBE)     │ (API-3 MAYBE)     │        │
    │   └───────────────────┴───────────────────┘        │
    │        ⚡ FAST-PATH: Skips LLM if regex > 0.85     │
    └─────────────────────────┬─────────────────────────┘
                              │
    ┌─────────────────────────┴─────────────────────────┐
    │       STEP 3: ADAPTIVE BEHAVIOR ANALYSIS           │
    │       adaptive_agent.analyze_scammer_behavior()    │
    │       (Local, NO API)                              │
    └─────────────────────────┬─────────────────────────┘
                              │
    ┌─────────────────────────┴─────────────────────────┐
    │       STEP 4: PERSONA SELECTION                    │
    │       persona_engine.select_persona()              │
    │       (Local mapping, NO API)                      │
    └─────────────────────────┬─────────────────────────┘
                              │
    ┌─────────────────────────┴─────────────────────────┐
    │       STEP 5: RESPONSE GENERATION (API-4)          │
    │       persona_engine.generate_response()           │
    │       ✅ FAST_CHAT role (llama-3.1-8b-instant)     │
    └─────────────────────────┬─────────────────────────┘
                              │
    ┌─────────────────────────┴─────────────────────────┐
    │       STEP 6: ENRICHMENT (BACKGROUND, API-5)       │
    │       enrichment_service.enrich_intelligence()     │
    │       (Compound system, async)                     │
    └─────────────────────────┬─────────────────────────┘
                              │
    ┌─────────────────────────┴─────────────────────────┐
    │       STEP 7: XAI REASONING (CONDITIONAL)          │
    │       xai_explainer.generate_explanation()         │
    │       (Only if ENABLE_LLM_RESPONSES=true)          │
    └─────────────────────────────────────────────────────┘

3. API Call Count Summary

Scenario	API Calls	Reason
Best Case (FAST-PATH)	2	Regex confident → skip scam LLM, skip intel LLM → only safeguard + reply
Typical Case	3-4	Safeguard + Reply + 1-2 extraction/detection
Worst Case	5-6	All LLMs engaged + enrichment + XAI

Optimization Flags

Flag	Effect
FAST-PATH (line 233)	Skips LLM Detection if regex confidence > 0.85
Regex-First (line 45)	Intel extraction starts with local regex
Parallel asyncio.gather (line 207)	Detection + Extraction run concurrently
ENABLE_LLM_DETECTION	Can disable LLM entirely for speed

4. Decision Flow: Think-Before-Reply ✅

SCAM MESSAGE ARRIVES
         │
         ▼
┌─────────────────────┐
│ 1. DETECT SCAM TYPE │  ← THINK
│ 2. EXTRACT INTEL    │  ← THINK
│ 3. ANALYZE BEHAVIOR │  ← THINK
│ 4. SELECT PERSONA   │  ← THINK
└─────────────────────┘
         │
         ▼
┌─────────────────────┐
│ 5. GENERATE REPLY   │  ← ACT
└─────────────────────┘
         │
         ▼
┌─────────────────────┐
│ 6. ENRICH (ASYNC)   │  ← POST-PROCESS
│ 7. LOG + CALLBACK   │  
└─────────────────────┘

Conclusion: System THINKS before replying. Intelligence is extracted BEFORE response generation, allowing the reply to incorporate extracted data (personas, scam type, keywords).

5. Model Switching Analysis

Component	Primary Model	Fallback	Switch Trigger
Safeguard	`gpt-oss-safeguard-20b`	N/A (mandatory)	-
Scam Detection	Regex FAST-PATH	`llama-3.1-8b-instant`	confidence < 0.85
Intel Extraction	Regex patterns	`generate_verified`	needs semantic context
Response Gen	`llama-3.1-8b-instant`	`llama-3.3-70b-versatile`	context > 8K or failure
Enrichment	`groq/compound`	`groq/compound-mini`	latency priority

6. Wasteful API Calls? ❌ NO

Optimizations Already Present:

FAST-PATH: Regex > 0.85 skips LLM detection entirely (scam_detector.py:233)
Parallel Execution: Detection + Extraction run concurrently (orchestrator.py:207)
Regex-First Intel: Local patterns run before LLM (intelligence_extractor.py:45)
Conditional LLM: Only calls LLM if ENABLE_LLM_DETECTION=true and is_available
Background Enrichment: Doesn't block reply (async)

Cost Per Message:

Minimum: 2 API calls (safeguard + reply)
Average: 3-4 API calls
Maximum: 6 API calls (fully analyzed high-risk message)

7. Reply Realism Verification ✅

Feature	Implementation
Hinglish Mixing	`na..`, `arre..`, `karu?` in replies
Human Hesitation	Ellipsis (`...`), short phrases
Typos	TypingSimulator adds intentional errors
Emoji Use	👴 elderly persona marker
Delayed Response	AsyncIO latency simulation
Filler Words	`hmm`, `okay`, `wait` injected

Summary

Metric	Status
Multi-turn Memory	✅ Working
API Efficiency	✅ Optimized (2-4 calls typical)
Model Switching	✅ Working via FAST-PATH
Think-Before-Reply	✅ Yes (THINK → ACT flow)
Reply Realism	✅ Hinglish + Typos + Hesitation
Wasteful Calls	❌ None detected