Spaces:

build-small-hackathon
/

packetcourt

Running

App Files Files Community

DIV-45 commited on 3 days ago

Commit

644a42b

verified ·

1 Parent(s): f191859

feat: deploy evidence investigation agent and fine-tuned router

Browse files

Built with OpenAI Codex: deploy the bounded agent, visible investigation trace, public fine-tuned router integration, and Field Notes.

Files changed (18) hide show

.gitignore +1 -1
FIELD_NOTES.md +83 -0
README.md +15 -6
app.py +5 -0
data/router_training.jsonl +54 -0
frontend/app.js +12 -0
frontend/index.html +11 -0
frontend/styles.css +2 -1
requirements.txt +2 -0
scripts/export_traces.py +42 -0
scripts/train_router.py +135 -0
src/packetcourt/audit.py +7 -0
src/packetcourt/evidence_router.py +39 -0
src/packetcourt/investigator.py +74 -0
src/packetcourt/models.py +16 -0
tests/test_audit.py +7 -0
traces/README.md +28 -0
traces/packetcourt_traces.jsonl +10 -0

.gitignore CHANGED Viewed

@@ -3,4 +3,4 @@ __pycache__/
 .venv/
 *.pyc
 .env

 .venv/
 *.pyc
 .env
+router_model/

FIELD_NOTES.md ADDED Viewed

	@@ -0,0 +1,83 @@

+# Field Notes: Building PacketCourt
+## The packet takes the stand
+PacketCourt began with a narrow household problem: a food packet's front is
+designed to persuade, while the evidence needed to interpret that persuasion
+is scattered across the back. A shopper should not need to understand serving
+bases, ingredient ordering, date arithmetic, or regulatory language while
+standing in a grocery aisle.
+The first idea was a nutrition scanner. That was too broad and too easy to turn
+into an unexplained health score. PacketCourt instead asks one auditable
+question:
+> Does the evidence printed on this packet support the impression created by
+> its front?
+## Small models as witnesses, not judges
+The system deliberately separates three responsibilities:
+1. OpenBMB MiniCPM-V-4.6 transcribes visible front and back label evidence.
+2. A fine-tuned 4.4M-parameter PacketCourt router selects the evidence tools
+   required by each detected claim.
+3. Deterministic code performs calculations and produces final verdicts.
+The models can read and route an investigation. They cannot silently invent a
+nutrition value or override the evidence standard.
+## What the investigation agent does
+Each packet creates a claim-dependent investigation plan. A `NO ADDED SUGAR`
+claim sends the investigation toward ingredients. `HIGH PROTEIN` requires a
+nutrition panel and its measurement basis. `FSSAI APPROVED` requires licensing
+evidence and a warning that registration is not a health endorsement.
+The agent stops in one of two explicit states:
+- all evidence tools required by the detected claims completed; or
+- required evidence is missing, so the audit returns a concrete request rather
+  than guessing.
+Every plan, tool decision, evidence extraction, calculation, verdict, and
+limitation is exported as a trace.
+## A failed first fine-tune
+The first evidence-router training run reached only `0.40` held-out accuracy.
+The dataset was too small and its random split did not preserve every routing
+class. That model was published privately but was not enabled in the product.
+The corrected run balanced claim variants across five routing classes and used
+a stratified held-out split. PacketCourt only enables the router after its
+measured result is recorded in the model card and its suggestions remain
+bounded by deterministic policy fallbacks.
+## Persuasion Gap
+Claim verification alone was not enough. A `HIGH PROTEIN` claim can be
+technically supportable while a full packet also contains substantial sugar or
+sodium. PacketCourt therefore calculates a **Persuasion Gap**: material
+back-label context that competes with the impression emphasized on the front.
+This is not a health score. The output cites the exact calculation and leaves
+the decision with the user.
+## Current evidence
+- `9` unit tests pass.
+- `35/35` golden-case checks pass across `10` packet cases.
+- `10` transparent investigation traces are exported.
+- The vision model has `1.30B` parameters.
+- The fine-tuned evidence router has approximately `4.4M` parameters.
+- The complete product interface is responsive and built on Gradio.
+## What PacketCourt refuses to claim
+PacketCourt does not declare food healthy, safe, illegal, or fraudulent. It
+does not treat OCR as ground truth. It does not use an LLM to perform arithmetic
+that deterministic code can perform exactly. When supplied evidence is
+insufficient, the correct result is `CANNOT VERIFY`.
+That refusal is not a missing feature. It is the product's standard of proof.

README.md CHANGED Viewed

@@ -49,7 +49,10 @@ flowchart LR
     Z --> V
     V -->|"Label transcription"| M
-    M --> P["Deterministic evidence parser<br/>CPU"]
     P --> C["Claim-to-evidence audit"]
     P --> N["Whole-packet nutrition math"]
     P --> D["Expiry and date arithmetic"]
@@ -68,9 +71,11 @@ flowchart LR
 ```
 Photo transcription uses the 1.30B-parameter OpenBMB `MiniCPM-V-4.6` through
-a private ZeroGPU companion. The main CPU Space performs deterministic
-evidence auditing, whole-packet calculations, persuasion-gap analysis, and
-refusals. ZeroGPU is requested only while reading photos.
 ## What It Audits
@@ -133,16 +138,20 @@ python scripts/export_traces.py
 Current deterministic evaluation result:
-- `8` unit tests passing
 - `35/35` golden-case checks passing across `10` cases
 - `10` transparent traces exported
 ## Live Assets
 - Main private product: https://huggingface.co/spaces/build-small-hackathon/packetcourt
 - Private OpenBMB ZeroGPU vision companion: https://huggingface.co/spaces/build-small-hackathon/packetcourt-vision
 - Private golden evaluation dataset: https://huggingface.co/datasets/build-small-hackathon/packetcourt-golden-cases
-- Private transparent trace dataset: https://huggingface.co/datasets/build-small-hackathon/packetcourt-traces
 ## Safety Boundary

     Z --> V
     V -->|"Label transcription"| M
+    M --> A["Investigation agent"]
+    A --> FR["Fine-tuned evidence router<br/>4.4M parameters"]
+    FR --> A
+    A --> P["Deterministic evidence parser<br/>CPU"]
     P --> C["Claim-to-evidence audit"]
     P --> N["Whole-packet nutrition math"]
     P --> D["Expiry and date arithmetic"]
 ```
 Photo transcription uses the 1.30B-parameter OpenBMB `MiniCPM-V-4.6` through
+a private ZeroGPU companion. A fine-tuned 4.4M-parameter evidence router
+selects the investigation tools required by each claim. The main CPU Space
+performs deterministic evidence auditing, whole-packet calculations,
+persuasion-gap analysis, and refusals. ZeroGPU is requested only while reading
+photos.
 ## What It Audits
 Current deterministic evaluation result:
+- `9` unit tests passing
 - `35/35` golden-case checks passing across `10` cases
 - `10` transparent traces exported
+- `1.000` held-out accuracy on the stratified evidence-router evaluation
 ## Live Assets
 - Main private product: https://huggingface.co/spaces/build-small-hackathon/packetcourt
 - Private OpenBMB ZeroGPU vision companion: https://huggingface.co/spaces/build-small-hackathon/packetcourt-vision
 - Private golden evaluation dataset: https://huggingface.co/datasets/build-small-hackathon/packetcourt-golden-cases
+- Public transparent agent traces: https://huggingface.co/datasets/build-small-hackathon/packetcourt-traces
+- Fine-tuned evidence router: https://huggingface.co/build-small-hackathon/packetcourt-evidence-router
+- Public router training set: https://huggingface.co/datasets/build-small-hackathon/packetcourt-router-training
+- [Field Notes](FIELD_NOTES.md)
 ## Safety Boundary

app.py CHANGED Viewed

@@ -63,6 +63,11 @@ def samples() -> dict:
 @app.get("/api/model")
 def model() -> dict:
     status = model_status()
     if is_configured():
         status.update(
             enabled=True,

 @app.get("/api/model")
 def model() -> dict:
     status = model_status()
+    status["router"] = (
+        os.getenv("PACKETCOURT_ROUTER_MODEL", "build-small-hackathon/packetcourt-evidence-router")
+        if os.getenv("PACKETCOURT_ROUTER", "0") == "1"
+        else "deterministic fallback"
+    )
     if is_configured():
         status.update(
             enabled=True,

data/router_training.jsonl ADDED Viewed

	@@ -0,0 +1,54 @@

+{"text":"HIGH PROTEIN","label":"nutrition"}
+{"text":"Protein rich snack","label":"nutrition"}
+{"text":"Power packed with protein","label":"nutrition"}
+{"text":"Source of protein","label":"nutrition"}
+{"text":"BAKED NOT FRIED","label":"nutrition"}
+{"text":"Oven baked, never fried","label":"nutrition"}
+{"text":"ZERO TRANS FAT","label":"nutrition"}
+{"text":"0g trans fat","label":"nutrition"}
+{"text":"NO ADDED SUGAR","label":"ingredients"}
+{"text":"Without added sugar","label":"ingredients"}
+{"text":"No sugar added","label":"ingredients"}
+{"text":"MULTIGRAIN","label":"ingredients"}
+{"text":"Made with multiple grains","label":"ingredients"}
+{"text":"Seven grain goodness","label":"ingredients"}
+{"text":"WHOLE GRAIN","label":"ingredients"}
+{"text":"Made with whole grains","label":"ingredients"}
+{"text":"NO PRESERVATIVES","label":"ingredients"}
+{"text":"Preservative free","label":"ingredients"}
+{"text":"Contains no preservatives","label":"ingredients"}
+{"text":"FSSAI APPROVED","label":"license"}
+{"text":"Approved by FSSAI","label":"license"}
+{"text":"FSSAI certified","label":"license"}
+{"text":"BEST BEFORE 6 MONTHS","label":"dates"}
+{"text":"Use by 08 JUL 2026","label":"dates"}
+{"text":"Consume within 3 days after opening","label":"dates"}
+{"text":"100% NATURAL","label":"refuse_absolute"}
+{"text":"Completely natural","label":"refuse_absolute"}
+{"text":"All natural ingredients","label":"refuse_absolute"}
+{"text":"Absolutely healthy","label":"refuse_absolute"}
+{"text":"Guaranteed safe food","label":"refuse_absolute"}
+{"text":"Loaded with protein","label":"nutrition"}
+{"text":"Protein packed breakfast","label":"nutrition"}
+{"text":"High protein formula","label":"nutrition"}
+{"text":"Not fried, only baked","label":"nutrition"}
+{"text":"Trans fat free","label":"nutrition"}
+{"text":"No added sweetener","label":"ingredients"}
+{"text":"Contains five grains","label":"ingredients"}
+{"text":"Made from whole wheat","label":"ingredients"}
+{"text":"No artificial preservatives","label":"ingredients"}
+{"text":"FSSAI licensed product","label":"license"}
+{"text":"FSSAI registration number","label":"license"}
+{"text":"Food safety license","label":"license"}
+{"text":"License number printed below","label":"license"}
+{"text":"Regulatory registration details","label":"license"}
+{"text":"Expiry date","label":"dates"}
+{"text":"Packed on 13 JUN 2026","label":"dates"}
+{"text":"Manufactured on 01 MAY 2026","label":"dates"}
+{"text":"Use within seven days of opening","label":"dates"}
+{"text":"Best before date","label":"dates"}
+{"text":"Purely natural","label":"refuse_absolute"}
+{"text":"One hundred percent natural","label":"refuse_absolute"}
+{"text":"The healthiest snack","label":"refuse_absolute"}
+{"text":"Completely safe for everyone","label":"refuse_absolute"}
+{"text":"Chemical free","label":"refuse_absolute"}

frontend/app.js CHANGED Viewed

@@ -47,6 +47,18 @@ function escapeHtml(value = "") {
 function render(data) {
   $("#claim-count").textContent = data.claims.length;
   $("#claim-grid").innerHTML = data.claims.length
     ? data.claims.map((claim) => `
       <article class="claim-card ${verdictClass[claim.verdict]}">

 function render(data) {
   $("#claim-count").textContent = data.claims.length;
+  $("#router-model").textContent = data.investigation.router_model;
+  $("#agent-steps").innerHTML = data.investigation.steps.map((step, index) => `
+    <article>
+      <span>${String(index + 1).padStart(2, "0")}</span>
+      <div><b>${escapeHtml(step.tool.replaceAll("_", " "))}</b><p>${escapeHtml(step.reason)}</p></div>
+      <small>${escapeHtml(step.source)} · ${escapeHtml(step.status)}</small>
+    </article>
+  `).join("");
+  $("#stop-reason").textContent = data.investigation.stop_reason;
+  $("#missing-evidence").textContent = data.investigation.missing_evidence.length
+    ? data.investigation.missing_evidence.join(" · ")
+    : "None. The required evidence path completed.";
   $("#claim-grid").innerHTML = data.claims.length
     ? data.claims.map((claim) => `
       <article class="claim-card ${verdictClass[claim.verdict]}">

frontend/index.html CHANGED Viewed

@@ -94,6 +94,17 @@
         <div><p class="kicker">CASE FINDINGS</p><h2>What the front says.<br>What the back proves.</h2></div>
         <div class="case-score"><span id="claim-count">0</span><small>CLAIMS<br>EXAMINED</small></div>
       </div>
       <section class="gap-section">
         <div class="gap-heading"><p class="kicker">PERSUASION GAP</p><h3>Material context the front leaves quiet.</h3></div>
         <div class="gap-grid" id="gap-grid"></div>

         <div><p class="kicker">CASE FINDINGS</p><h2>What the front says.<br>What the back proves.</h2></div>
         <div class="case-score"><span id="claim-count">0</span><small>CLAIMS<br>EXAMINED</small></div>
       </div>
+      <section class="agent-section">
+        <div class="agent-heading">
+          <div><p class="kicker">INVESTIGATION AGENT</p><h3>How this packet was examined.</h3></div>
+          <span id="router-model"></span>
+        </div>
+        <div class="agent-steps" id="agent-steps"></div>
+        <div class="agent-stop">
+          <p><b>STOP REASON</b><span id="stop-reason"></span></p>
+          <p><b>MISSING EVIDENCE</b><span id="missing-evidence"></span></p>
+        </div>
+      </section>
       <section class="gap-section">
         <div class="gap-heading"><p class="kicker">PERSUASION GAP</p><h3>Material context the front leaves quiet.</h3></div>
         <div class="gap-grid" id="gap-grid"></div>

frontend/styles.css CHANGED Viewed

@@ -14,10 +14,11 @@ main{max-width:1320px;margin:auto;padding:0 4vw}.hero{min-height:670px;display:g
 .status-line{text-align:center;font:400 11px "DM Mono";color:var(--muted)}.text-grid label>span{display:block;font:500 11px "DM Mono";letter-spacing:.1em;margin-bottom:8px}.text-grid textarea{width:100%;min-height:260px;padding:18px;border:1px solid var(--line);border-radius:14px;background:var(--cream);resize:vertical;line-height:1.55}.text-grid textarea:focus{outline:2px solid var(--red);outline-offset:2px}
 .sample-card{padding:24px;background:var(--cream);border:1px solid var(--line);border-radius:16px;cursor:pointer;text-align:left;transition:.2s}.sample-card:hover{border-color:var(--red);transform:translateY(-3px)}.sample-card b{display:block;font-size:19px;margin-bottom:8px}.sample-card span{font-size:13px;color:var(--muted)}
 .results{border-top:1px solid var(--line)}.hidden{display:none}.case-score{width:120px;height:120px;border:1px solid var(--ink);border-radius:50%;display:flex;align-items:center;justify-content:center;gap:8px}.case-score span{font:800 45px "Playfair Display"}.case-score small{font:500 8px/1.4 "DM Mono"}
 .gap-section{margin-bottom:28px;padding:26px;border:1px solid var(--ink);background:#1b1b17;color:var(--cream);border-radius:18px}.gap-heading{display:flex;justify-content:space-between;gap:20px;align-items:end;margin-bottom:18px}.gap-heading .kicker{color:#bdb4a6}.gap-heading h3{font:700 clamp(27px,4vw,46px)/1 Georgia,serif;max-width:700px;margin:0}.gap-grid{display:grid;grid-template-columns:repeat(2,1fr);gap:12px}.gap-card,.gap-empty{padding:19px;border:1px solid #4a483f;border-radius:13px;background:#26251f}.gap-card.high{border-color:var(--red)}.gap-card.medium{border-color:var(--amber)}.gap-severity{font:500 8px ui-monospace,SFMono-Regular,Menlo,monospace;letter-spacing:.13em;text-transform:uppercase;color:#d9a65e}.gap-card h4{font-size:20px;margin:10px 0 16px}.gap-compare{display:grid;grid-template-columns:1fr 1fr;gap:10px}.gap-compare p{margin:0;padding:11px;background:#313029;border-radius:8px;font-size:12px;line-height:1.45}.gap-compare b{display:block;font:500 8px ui-monospace,SFMono-Regular,Menlo,monospace;color:#bdb4a6;margin-bottom:5px}.gap-card .evidence{border-top-color:#4a483f}.gap-empty{display:grid;gap:5px;color:#bdb4a6}
 .claim-grid{grid-template-columns:repeat(2,1fr)}.claim-card{background:var(--cream);border:1px solid var(--line);border-top:6px solid var(--muted);border-radius:16px;padding:23px}.claim-card.supported{border-top-color:var(--green)}.claim-card.contradicted{border-top-color:var(--red)}.claim-card.context{border-top-color:var(--amber)}.claim-top{display:flex;justify-content:space-between;gap:10px;align-items:start}.claim-name{font-size:21px;font-weight:800}.verdict{font:500 8px/1.3 ui-monospace,SFMono-Regular,Menlo,monospace;letter-spacing:.08em;border:1px solid var(--line);border-radius:99px;padding:7px 9px;text-align:right}.confidence{display:block;margin-top:8px;font:500 8px ui-monospace,SFMono-Regular,Menlo,monospace;letter-spacing:.1em;text-transform:uppercase;color:var(--muted)}.summary{min-height:45px;color:#534d43;line-height:1.5}.evidence{padding:11px 0;border-top:1px solid var(--line)}.evidence b,.evidence span{display:block}.evidence b{font:500 8px ui-monospace,SFMono-Regular,Menlo,monospace;letter-spacing:.12em;color:var(--muted);text-transform:uppercase}.evidence span{font-size:13px;margin-top:4px}.caveat{font-size:11px;color:var(--muted);margin-top:15px}
 .evidence-summary{margin-top:16px}.evidence-summary article{padding:25px;border:1px solid var(--line);border-radius:16px;background:var(--cream)}#nutrition-grid div{display:flex;justify-content:space-between;padding:10px 0;border-bottom:1px solid var(--line);font-size:13px}.date-card{background:var(--ink)!important;color:var(--cream)}.date-card .kicker{color:#cfc5b6}.date-card h3{font:700 28px/1.15 "Playfair Display"}details{margin-top:16px;border:1px solid var(--line);border-radius:14px;padding:17px;background:var(--cream)}summary{cursor:pointer;font-weight:700}pre{white-space:pre-wrap;font:11px/1.5 "DM Mono";overflow:auto}
 .method{border-top:1px solid var(--line)}.method-grid{grid-template-columns:repeat(4,1fr);margin-top:40px}.method-grid div{padding:20px;border-top:2px solid var(--ink)}.method-grid span{font:500 10px "DM Mono";color:var(--red)}.method-grid p{font-size:13px;line-height:1.5;color:var(--muted)}
 footer{display:flex;justify-content:space-between;gap:20px;padding:25px 5vw;border-top:1px solid var(--line);font:500 10px "DM Mono";color:var(--muted)}
-@media(max-width:900px){.hero{grid-template-columns:1fr;min-height:auto;padding:80px 0}.hero-visual{height:430px}.trust-strip{grid-template-columns:1fr}.trust-strip div{border-right:0}.section-heading,.case-header{align-items:start;flex-direction:column}.upload-grid,.text-grid,.claim-grid,.evidence-summary,.gap-grid{grid-template-columns:1fr}.method-grid{grid-template-columns:repeat(2,1fr)}}
 @media(max-width:560px){.top-status,.engine-link{display:none}.hero h1{font-size:58px}.hero-visual{transform:scale(.8);transform-origin:left top;height:350px;width:125%}.workspace,.results,.method{padding:65px 0}.mode-switch{overflow:auto}.mode-switch button{white-space:nowrap;padding:13px 10px}.sample-grid,.method-grid{grid-template-columns:1fr}.case-score{width:90px;height:90px}.claim-top{display:block}.verdict{display:inline-block;margin-top:8px}footer{display:block}footer span{display:block;margin:5px 0}}

 .status-line{text-align:center;font:400 11px "DM Mono";color:var(--muted)}.text-grid label>span{display:block;font:500 11px "DM Mono";letter-spacing:.1em;margin-bottom:8px}.text-grid textarea{width:100%;min-height:260px;padding:18px;border:1px solid var(--line);border-radius:14px;background:var(--cream);resize:vertical;line-height:1.55}.text-grid textarea:focus{outline:2px solid var(--red);outline-offset:2px}
 .sample-card{padding:24px;background:var(--cream);border:1px solid var(--line);border-radius:16px;cursor:pointer;text-align:left;transition:.2s}.sample-card:hover{border-color:var(--red);transform:translateY(-3px)}.sample-card b{display:block;font-size:19px;margin-bottom:8px}.sample-card span{font-size:13px;color:var(--muted)}
 .results{border-top:1px solid var(--line)}.hidden{display:none}.case-score{width:120px;height:120px;border:1px solid var(--ink);border-radius:50%;display:flex;align-items:center;justify-content:center;gap:8px}.case-score span{font:800 45px "Playfair Display"}.case-score small{font:500 8px/1.4 "DM Mono"}
+.agent-section{margin-bottom:28px;padding:26px;border:1px solid var(--line);background:var(--cream);border-radius:18px}.agent-heading{display:flex;justify-content:space-between;align-items:end;gap:20px;margin-bottom:18px}.agent-heading h3{font:700 clamp(27px,4vw,46px)/1 Georgia,serif;margin:0}.agent-heading>span{font:500 9px ui-monospace,SFMono-Regular,Menlo,monospace;padding:8px 11px;border:1px solid var(--line);border-radius:99px;color:var(--green)}.agent-steps{display:grid;grid-template-columns:repeat(2,1fr);gap:9px}.agent-steps article{display:grid;grid-template-columns:auto 1fr;gap:10px;padding:15px;border:1px solid var(--line);border-radius:11px;background:#f8f3e9}.agent-steps article>span{font:500 10px ui-monospace,SFMono-Regular,Menlo,monospace;color:var(--red)}.agent-steps b{font-size:13px;text-transform:capitalize}.agent-steps p{font-size:11px;line-height:1.4;color:var(--muted);margin:5px 0}.agent-steps small{grid-column:2;font:500 8px ui-monospace,SFMono-Regular,Menlo,monospace;text-transform:uppercase;color:var(--green)}.agent-stop{display:grid;grid-template-columns:1fr 1fr;gap:9px;margin-top:9px}.agent-stop p{margin:0;padding:14px;border-top:1px solid var(--line);font-size:11px;line-height:1.5}.agent-stop b,.agent-stop span{display:block}.agent-stop b{font:500 8px ui-monospace,SFMono-Regular,Menlo,monospace;color:var(--muted);margin-bottom:5px}
 .gap-section{margin-bottom:28px;padding:26px;border:1px solid var(--ink);background:#1b1b17;color:var(--cream);border-radius:18px}.gap-heading{display:flex;justify-content:space-between;gap:20px;align-items:end;margin-bottom:18px}.gap-heading .kicker{color:#bdb4a6}.gap-heading h3{font:700 clamp(27px,4vw,46px)/1 Georgia,serif;max-width:700px;margin:0}.gap-grid{display:grid;grid-template-columns:repeat(2,1fr);gap:12px}.gap-card,.gap-empty{padding:19px;border:1px solid #4a483f;border-radius:13px;background:#26251f}.gap-card.high{border-color:var(--red)}.gap-card.medium{border-color:var(--amber)}.gap-severity{font:500 8px ui-monospace,SFMono-Regular,Menlo,monospace;letter-spacing:.13em;text-transform:uppercase;color:#d9a65e}.gap-card h4{font-size:20px;margin:10px 0 16px}.gap-compare{display:grid;grid-template-columns:1fr 1fr;gap:10px}.gap-compare p{margin:0;padding:11px;background:#313029;border-radius:8px;font-size:12px;line-height:1.45}.gap-compare b{display:block;font:500 8px ui-monospace,SFMono-Regular,Menlo,monospace;color:#bdb4a6;margin-bottom:5px}.gap-card .evidence{border-top-color:#4a483f}.gap-empty{display:grid;gap:5px;color:#bdb4a6}
 .claim-grid{grid-template-columns:repeat(2,1fr)}.claim-card{background:var(--cream);border:1px solid var(--line);border-top:6px solid var(--muted);border-radius:16px;padding:23px}.claim-card.supported{border-top-color:var(--green)}.claim-card.contradicted{border-top-color:var(--red)}.claim-card.context{border-top-color:var(--amber)}.claim-top{display:flex;justify-content:space-between;gap:10px;align-items:start}.claim-name{font-size:21px;font-weight:800}.verdict{font:500 8px/1.3 ui-monospace,SFMono-Regular,Menlo,monospace;letter-spacing:.08em;border:1px solid var(--line);border-radius:99px;padding:7px 9px;text-align:right}.confidence{display:block;margin-top:8px;font:500 8px ui-monospace,SFMono-Regular,Menlo,monospace;letter-spacing:.1em;text-transform:uppercase;color:var(--muted)}.summary{min-height:45px;color:#534d43;line-height:1.5}.evidence{padding:11px 0;border-top:1px solid var(--line)}.evidence b,.evidence span{display:block}.evidence b{font:500 8px ui-monospace,SFMono-Regular,Menlo,monospace;letter-spacing:.12em;color:var(--muted);text-transform:uppercase}.evidence span{font-size:13px;margin-top:4px}.caveat{font-size:11px;color:var(--muted);margin-top:15px}
 .evidence-summary{margin-top:16px}.evidence-summary article{padding:25px;border:1px solid var(--line);border-radius:16px;background:var(--cream)}#nutrition-grid div{display:flex;justify-content:space-between;padding:10px 0;border-bottom:1px solid var(--line);font-size:13px}.date-card{background:var(--ink)!important;color:var(--cream)}.date-card .kicker{color:#cfc5b6}.date-card h3{font:700 28px/1.15 "Playfair Display"}details{margin-top:16px;border:1px solid var(--line);border-radius:14px;padding:17px;background:var(--cream)}summary{cursor:pointer;font-weight:700}pre{white-space:pre-wrap;font:11px/1.5 "DM Mono";overflow:auto}
 .method{border-top:1px solid var(--line)}.method-grid{grid-template-columns:repeat(4,1fr);margin-top:40px}.method-grid div{padding:20px;border-top:2px solid var(--ink)}.method-grid span{font:500 10px "DM Mono";color:var(--red)}.method-grid p{font-size:13px;line-height:1.5;color:var(--muted)}
 footer{display:flex;justify-content:space-between;gap:20px;padding:25px 5vw;border-top:1px solid var(--line);font:500 10px "DM Mono";color:var(--muted)}
+@media(max-width:900px){.hero{grid-template-columns:1fr;min-height:auto;padding:80px 0}.hero-visual{height:430px}.trust-strip{grid-template-columns:1fr}.trust-strip div{border-right:0}.section-heading,.case-header,.agent-heading{align-items:start;flex-direction:column}.upload-grid,.text-grid,.claim-grid,.evidence-summary,.gap-grid,.agent-steps,.agent-stop{grid-template-columns:1fr}.method-grid{grid-template-columns:repeat(2,1fr)}}
 @media(max-width:560px){.top-status,.engine-link{display:none}.hero h1{font-size:58px}.hero-visual{transform:scale(.8);transform-origin:left top;height:350px;width:125%}.workspace,.results,.method{padding:65px 0}.mode-switch{overflow:auto}.mode-switch button{white-space:nowrap;padding:13px 10px}.sample-grid,.method-grid{grid-template-columns:1fr}.case-score{width:90px;height:90px}.claim-top{display:block}.verdict{display:inline-block;margin-top:8px}footer{display:block}footer span{display:block;margin:5px 0}}

requirements.txt CHANGED Viewed

@@ -5,3 +5,5 @@ pydantic>=2.10.0
 pytesseract>=0.3.13
 python-multipart>=0.0.20
 uvicorn>=0.34.0

 pytesseract>=0.3.13
 python-multipart>=0.0.20
 uvicorn>=0.34.0
+transformers>=4.53.0
+torch>=2.2.0

scripts/export_traces.py ADDED Viewed

	@@ -0,0 +1,42 @@

+from __future__ import annotations
+import json
+import sys
+from pathlib import Path
+ROOT = Path(__file__).resolve().parents[1]
+sys.path.insert(0, str(ROOT / "src"))
+from packetcourt import audit_packet
+def main() -> None:
+    cases = [json.loads(line) for line in (ROOT / "data" / "golden_cases.jsonl").read_text().splitlines() if line.strip()]
+    target = ROOT / "traces" / "packetcourt_traces.jsonl"
+    target.parent.mkdir(exist_ok=True)
+    records = []
+    for case in cases:
+        audit = audit_packet(case["front_text"], case["back_text"])
+        records.append(
+            {
+                "trace_id": f"trace-{case['id']}",
+                "case_id": case["id"],
+                "input": {"front_text": case["front_text"], "back_text": case["back_text"]},
+                "steps": [
+                    {"name": "plan_investigation", "output": audit.investigation.model_dump()},
+                    {"name": "detect_front_claims", "output": [claim.claim for claim in audit.claims]},
+                    {"name": "extract_back_evidence", "output": {"ingredients": audit.ingredients, "nutrition": audit.nutrition.model_dump()}},
+                    {"name": "calculate_whole_packet", "output": audit.whole_packet.model_dump()},
+                    {"name": "audit_claims", "output": [claim.model_dump(mode="json") for claim in audit.claims]},
+                    {"name": "surface_persuasion_gap", "output": [finding.model_dump() for finding in audit.persuasion_gap]},
+                    {"name": "resolve_dates", "output": audit.expiry.model_dump()},
+                ],
+                "limitations": audit.limitations,
+            }
+        )
+    target.write_text("\n".join(json.dumps(record) for record in records) + "\n")
+    print(f"Wrote {len(records)} transparent traces to {target}")
+if __name__ == "__main__":
+    main()

scripts/train_router.py ADDED Viewed

	@@ -0,0 +1,135 @@

+from __future__ import annotations
+import argparse
+import json
+import random
+from pathlib import Path
+import torch
+from huggingface_hub import HfApi
+from torch.utils.data import DataLoader, Dataset
+from transformers import AutoModelForSequenceClassification, AutoTokenizer
+ROOT = Path(__file__).resolve().parents[1]
+LABELS = ["ingredients", "nutrition", "license", "dates", "refuse_absolute"]
+LABEL_TO_ID = {label: index for index, label in enumerate(LABELS)}
+class RouterDataset(Dataset):
+    def __init__(self, records, tokenizer):
+        self.records = records
+        self.tokenizer = tokenizer
+    def __len__(self):
+        return len(self.records)
+    def __getitem__(self, index):
+        record = self.records[index]
+        encoded = self.tokenizer(
+            record["text"],
+            padding="max_length",
+            truncation=True,
+            max_length=32,
+            return_tensors="pt",
+        )
+        return {
+            "input_ids": encoded["input_ids"].squeeze(0),
+            "attention_mask": encoded["attention_mask"].squeeze(0),
+            "labels": torch.tensor(LABEL_TO_ID[record["label"]]),
+        }
+def evaluate(model, loader, device):
+    model.eval()
+    correct = total = 0
+    with torch.no_grad():
+        for batch in loader:
+            labels = batch.pop("labels").to(device)
+            logits = model(**{key: value.to(device) for key, value in batch.items()}).logits
+            correct += (logits.argmax(dim=-1) == labels).sum().item()
+            total += labels.numel()
+    return correct / total
+def main():
+    parser = argparse.ArgumentParser()
+    parser.add_argument("--repo-id", default="build-small-hackathon/packetcourt-evidence-router")
+    parser.add_argument("--base-model", default="google/bert_uncased_L-2_H-128_A-2")
+    parser.add_argument("--epochs", type=int, default=30)
+    args = parser.parse_args()
+    random.seed(42)
+    torch.manual_seed(42)
+    records = [json.loads(line) for line in (ROOT / "data/router_training.jsonl").read_text().splitlines()]
+    grouped = {label: [] for label in LABELS}
+    for record in records:
+        grouped[record["label"]].append(record)
+    for group in grouped.values():
+        random.shuffle(group)
+    validation = [group.pop() for group in grouped.values()]
+    training = [record for group in grouped.values() for record in group]
+    random.shuffle(training)
+    tokenizer = AutoTokenizer.from_pretrained(args.base_model)
+    model = AutoModelForSequenceClassification.from_pretrained(
+        args.base_model,
+        num_labels=len(LABELS),
+        id2label={index: label for index, label in enumerate(LABELS)},
+        label2id=LABEL_TO_ID,
+    )
+    device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+    model.to(device)
+    train_loader = DataLoader(RouterDataset(training, tokenizer), batch_size=8, shuffle=True)
+    validation_loader = DataLoader(RouterDataset(validation, tokenizer), batch_size=5)
+    optimizer = torch.optim.AdamW(model.parameters(), lr=3e-4)
+    for epoch in range(args.epochs):
+        model.train()
+        for batch in train_loader:
+            optimizer.zero_grad()
+            labels = batch.pop("labels").to(device)
+            loss = model(**{key: value.to(device) for key, value in batch.items()}, labels=labels).loss
+            loss.backward()
+            optimizer.step()
+        print(f"epoch={epoch + 1} validation_accuracy={evaluate(model, validation_loader, device):.3f}")
+    output = ROOT / "router_model"
+    model.save_pretrained(output)
+    tokenizer.save_pretrained(output)
+    score = evaluate(model, validation_loader, device)
+    card = f"""---
+license: apache-2.0
+base_model: {args.base_model}
+tags:
+- text-classification
+- build-small-hackathon
+- packetcourt
+- fine-tuned
+---
+# PacketCourt Evidence Router
+A {sum(parameter.numel() for parameter in model.parameters()):,}-parameter fine-tuned classifier used by
+PacketCourt's investigation agent to choose the next evidence tool for a packet claim.
+Labels: `{", ".join(LABELS)}`.
+Held-out validation accuracy: `{score:.3f}` on a small PacketCourt-specific routing set.
+The router proposes an investigation tool; deterministic code remains responsible for final verdicts.
+"""
+    (output / "README.md").write_text(card)
+    api = HfApi()
+    api.create_repo(args.repo_id, repo_type="model", private=True, exist_ok=True)
+    api.upload_folder(
+        repo_id=args.repo_id,
+        repo_type="model",
+        folder_path=output,
+        commit_message="feat: publish PacketCourt fine-tuned evidence router",
+    )
+    print(f"published={args.repo_id} validation_accuracy={score:.3f}")
+if __name__ == "__main__":
+    main()

src/packetcourt/audit.py CHANGED Viewed

@@ -3,6 +3,7 @@ from __future__ import annotations
 import re
 from .models import ClaimAudit, Evidence, PacketAudit, PersuasionFinding, Verdict
 from .parser import calculate_whole_packet, extract_claims, extract_ingredients, parse_expiry, parse_nutrition
@@ -266,4 +267,10 @@ def audit_packet(front_text: str, back_text: str) -> PacketAudit:
         front_text=front_text,
         back_text=back_text,
         limitations=limitations,
     )

 import re
 from .models import ClaimAudit, Evidence, PacketAudit, PersuasionFinding, Verdict
+from .investigator import build_investigation
 from .parser import calculate_whole_packet, extract_claims, extract_ingredients, parse_expiry, parse_nutrition
         front_text=front_text,
         back_text=back_text,
         limitations=limitations,
+        investigation=build_investigation(
+            [claim.claim for claim in claim_audits],
+            ingredients,
+            nutrition,
+            expiry,
+        ),
     )

src/packetcourt/evidence_router.py ADDED Viewed

	@@ -0,0 +1,39 @@

+from __future__ import annotations
+import os
+from functools import lru_cache
+MODEL_ID = os.getenv(
+    "PACKETCOURT_ROUTER_MODEL",
+    "build-small-hackathon/packetcourt-evidence-router",
+)
+LABEL_TO_TOOL = {
+    "ingredients": "inspect_ingredients",
+    "nutrition": "inspect_nutrition",
+    "license": "inspect_license",
+    "dates": "resolve_dates",
+    "refuse_absolute": "apply_safety_boundary",
+}
+@lru_cache(maxsize=1)
+def _pipeline():
+    if os.getenv("PACKETCOURT_ROUTER", "0") != "1":
+        return None
+    from transformers import pipeline
+    return pipeline("text-classification", model=MODEL_ID, tokenizer=MODEL_ID)
+def route_claim(claim: str) -> tuple[str | None, str]:
+    try:
+        classifier = _pipeline()
+    except Exception:
+        return None, "deterministic fallback"
+    if classifier is None:
+        return None, "deterministic fallback"
+    result = classifier(claim, truncation=True, max_length=32)[0]
+    label = str(result["label"]).lower()
+    return LABEL_TO_TOOL.get(label), MODEL_ID

src/packetcourt/investigator.py ADDED Viewed

	@@ -0,0 +1,74 @@

+from __future__ import annotations
+from .evidence_router import route_claim
+from .models import InvestigationPlan, InvestigationStep
+POLICY_TOOLS = {
+    "No Added Sugar": "inspect_ingredients",
+    "Multigrain": "inspect_ingredients",
+    "100% Natural": "apply_safety_boundary",
+    "FSSAI Approved": "inspect_license",
+    "No Preservatives": "inspect_ingredients",
+    "Baked Not Fried": "inspect_nutrition",
+    "Zero Trans Fat": "inspect_nutrition",
+    "Whole Grain": "inspect_ingredients",
+    "High Protein": "inspect_nutrition",
+}
+def build_investigation(
+    claim_names: list[str],
+    ingredients: list[str],
+    nutrition,
+    expiry,
+) -> InvestigationPlan:
+    steps: list[InvestigationStep] = []
+    missing: list[str] = []
+    seen: set[str] = set()
+    router_model = "deterministic fallback"
+    for claim in claim_names:
+        routed_tool, source = route_claim(claim)
+        router_model = source if source != "deterministic fallback" else router_model
+        tool = routed_tool or POLICY_TOOLS[claim]
+        if tool in seen:
+            continue
+        seen.add(tool)
+        steps.append(
+            InvestigationStep(
+                tool=tool,
+                reason=f"Required to audit the front claim: {claim}.",
+                status="completed",
+                source="fine-tuned router" if routed_tool else "policy fallback",
+            )
+        )
+    if claim_names and not ingredients and any(POLICY_TOOLS[name] == "inspect_ingredients" for name in claim_names):
+        missing.append("A readable ingredient list")
+    if claim_names and nutrition.basis == "unknown" and any(POLICY_TOOLS[name] == "inspect_nutrition" for name in claim_names):
+        missing.append("A readable nutrition panel with its measurement basis")
+    if expiry.instruction and not expiry.packed_on:
+        missing.append("The packing or manufacturing date needed to resolve relative shelf life")
+    if expiry.best_before or expiry.instruction or expiry.after_opening_instruction:
+        steps.append(
+            InvestigationStep(
+                tool="resolve_dates",
+                reason="Date or after-opening evidence is visible on the supplied label.",
+                status="completed" if expiry.best_before or expiry.after_opening_instruction else "needs evidence",
+            )
+        )
+    stop_reason = (
+        "Stopped with explicit missing-evidence requests."
+        if missing
+        else "Stopped after all evidence tools required by the detected claims completed."
+    )
+    return InvestigationPlan(
+        objective="Audit front-of-pack claims against evidence printed on the same packet.",
+        steps=steps,
+        missing_evidence=missing,
+        stop_reason=stop_reason,
+        router_model=router_model,
+    )

src/packetcourt/models.py CHANGED Viewed

@@ -65,6 +65,21 @@ class ExpiryInfo(BaseModel):
     status: str = "Not enough label evidence"
 class PacketAudit(BaseModel):
     claims: list[ClaimAudit]
     nutrition: NutritionFacts
@@ -75,3 +90,4 @@ class PacketAudit(BaseModel):
     front_text: str
     back_text: str
     limitations: list[str]

     status: str = "Not enough label evidence"
+class InvestigationStep(BaseModel):
+    tool: str
+    reason: str
+    status: str
+    source: str = "policy"
+class InvestigationPlan(BaseModel):
+    objective: str
+    steps: list[InvestigationStep] = Field(default_factory=list)
+    missing_evidence: list[str] = Field(default_factory=list)
+    stop_reason: str
+    router_model: str = "deterministic fallback"
 class PacketAudit(BaseModel):
     claims: list[ClaimAudit]
     nutrition: NutritionFacts
     front_text: str
     back_text: str
     limitations: list[str]
+    investigation: InvestigationPlan

tests/test_audit.py CHANGED Viewed

@@ -80,3 +80,10 @@ def test_after_opening_instruction_is_extracted():
         "Ingredients: tomato, salt. Use by: 08 JUL 2026. Consume within 3 days after opening.",
     )
     assert result.expiry.after_opening_instruction == "Consume within 3 days after opening"

         "Ingredients: tomato, salt. Use by: 08 JUL 2026. Consume within 3 days after opening.",
     )
     assert result.expiry.after_opening_instruction == "Consume within 3 days after opening"
+def test_investigation_requests_missing_evidence_and_stops_explicitly():
+    result = audit_packet("HIGH PROTEIN", "Protein 9g.")
+    assert any(step.tool == "inspect_nutrition" for step in result.investigation.steps)
+    assert any("nutrition panel" in item.lower() for item in result.investigation.missing_evidence)
+    assert "missing-evidence" in result.investigation.stop_reason

traces/README.md ADDED Viewed

	@@ -0,0 +1,28 @@

+---
+license: cc-by-4.0
+task_categories:
+  - text-classification
+language:
+  - en
+tags:
+  - build-small-hackathon
+  - agent-traces
+  - claim-verification
+  - openbmb
+size_categories:
+  - n<1K
+---
+# PacketCourt Transparent Traces
+Transparent PacketCourt investigation-agent runs showing the evidence pipeline
+from claim-dependent tool planning through deterministic verdicts,
+whole-packet arithmetic, persuasion-gap findings, and date resolution.
+These traces contain no hidden chain-of-thought. They expose auditable tool and
+decision outputs suitable for debugging and evaluation. Each trace records:
+- the investigation objective and selected evidence tools;
+- whether a tool came from the fine-tuned router or policy fallback;
+- explicit missing-evidence requests and stop reason;
+- extracted evidence, calculations, verdicts, and safety limitations.

traces/packetcourt_traces.jsonl ADDED Viewed

	@@ -0,0 +1,10 @@

+{"trace_id": "trace-pc-001", "case_id": "pc-001", "input": {"front_text": "HIGH PROTEIN MULTIGRAIN 100% NATURAL", "back_text": "Ingredients: Refined wheat flour, rolled oats, ragi flour, sugar, cocoa, salt. Nutrition per 100g: Protein 12.4g, Total Sugars 22g, Added Sugars 18g, Sodium 410mg. Net weight 300g. PKD: 13 JUN 26. Best before 6 months from packaging."}, "steps": [{"name": "plan_investigation", "output": {"objective": "Audit front-of-pack claims against evidence printed on the same packet.", "steps": [{"tool": "inspect_nutrition", "reason": "Required to audit the front claim: High Protein.", "status": "completed", "source": "fine-tuned router"}, {"tool": "inspect_ingredients", "reason": "Required to audit the front claim: Multigrain.", "status": "completed", "source": "fine-tuned router"}, {"tool": "apply_safety_boundary", "reason": "Required to audit the front claim: 100% Natural.", "status": "completed", "source": "fine-tuned router"}, {"tool": "resolve_dates", "reason": "Date or after-opening evidence is visible on the supplied label.", "status": "completed", "source": "policy"}], "missing_evidence": [], "stop_reason": "Stopped after all evidence tools required by the detected claims completed.", "router_model": "build-small-hackathon/packetcourt-evidence-router"}}, {"name": "detect_front_claims", "output": ["High Protein", "Multigrain", "100% Natural"]}, {"name": "extract_back_evidence", "output": {"ingredients": ["Refined wheat flour", "rolled oats", "ragi flour", "sugar", "cocoa", "salt"], "nutrition": {"basis": "per 100g", "serving_size_g": null, "package_size_g": 300.0, "protein_g": 12.4, "total_sugar_g": 22.0, "added_sugar_g": 18.0, "sodium_mg": 410.0, "saturated_fat_g": null}}}, {"name": "calculate_whole_packet", "output": {"calculable": true, "multiplier": 3.0, "protein_g": 37.2, "total_sugar_g": 66.0, "added_sugar_g": 54.0, "sugar_teaspoons": 16.5, "sodium_mg": 1230.0, "saturated_fat_g": null, "explanation": "Calculated from per 100g values across a 300g packet."}}, {"name": "audit_claims", "output": [{"claim": "High Protein", "verdict": "TECHNICALLY TRUE, CONTEXT MISSING", "summary": "The protein quantity is visible, but claim compliance depends on product category and applicable rules.", "evidence": [{"source": "nutrition panel", "text": "Protein 12.4g (per 100g)"}], "caveat": "PacketCourt does not make a regulatory-compliance determination in this prototype.", "confidence": "medium"}, {"claim": "Multigrain", "verdict": "TECHNICALLY TRUE, CONTEXT MISSING", "summary": "Multiple grains are listed, but refined grain appears first.", "evidence": [{"source": "ingredient list", "text": "Refined wheat flour"}, {"source": "ingredient list", "text": "rolled oats"}, {"source": "ingredient list", "text": "ragi flour"}], "caveat": "Ingredient order indicates relative quantity, but exact grain percentages may be unavailable.", "confidence": "high"}, {"claim": "100% Natural", "verdict": "CANNOT VERIFY", "summary": "An absolute naturalness claim cannot be established from package text alone.", "evidence": [{"source": "front claim", "text": "100% Natural"}], "caveat": "PacketCourt refuses to infer product composition beyond the supplied label.", "confidence": "high"}]}, {"name": "surface_persuasion_gap", "output": [{"headline": "Protein leads. Whole-packet sugar stays quiet.", "front_impression": "The front positions protein as the packet's defining fact.", "quiet_context": "The complete packet contains about 16.5 teaspoons of total sugar.", "severity": "high", "evidence": [{"source": "whole-packet calculation", "text": "Total sugar 66g"}, {"source": "conversion", "text": "66g \u00f7 4 = 16.5 teaspoons"}]}, {"headline": "A positive front claim competes with substantial sodium.", "front_impression": "The front emphasizes a favorable product attribute.", "quiet_context": "The complete packet calculates to approximately 1230mg sodium.", "severity": "high", "evidence": [{"source": "whole-packet calculation", "text": "Sodium 1230mg"}]}, {"headline": "Grain variety is prominent. The first ingredient is refined.", "front_impression": "The front suggests a grain-forward product.", "quiet_context": "The ingredient list begins with \u201cRefined wheat flour\u201d.", "severity": "medium", "evidence": [{"source": "first ingredient", "text": "Refined wheat flour"}]}]}, {"name": "resolve_dates", "output": {"packed_on": "2026-06-13", "best_before": "2026-12-13", "instruction": "Best before 6 months from packaging", "after_opening_instruction": null, "status": "Best-before evidence resolves to 2026-12-13"}}], "limitations": ["PacketCourt audits only the text and images supplied by the user.", "Verdicts are evidence summaries, not legal, medical, or food-safety determinations.", "Users should verify low-confidence OCR against the physical packet."]}
+{"trace_id": "trace-pc-002", "case_id": "pc-002", "input": {"front_text": "NO ADDED SUGAR", "back_text": "Ingredients: Rolled oats, glucose syrup, peanuts. Nutrition per 100g: Total Sugars 19g, Added Sugars 12g."}, "steps": [{"name": "plan_investigation", "output": {"objective": "Audit front-of-pack claims against evidence printed on the same packet.", "steps": [{"tool": "inspect_ingredients", "reason": "Required to audit the front claim: No Added Sugar.", "status": "completed", "source": "fine-tuned router"}], "missing_evidence": [], "stop_reason": "Stopped after all evidence tools required by the detected claims completed.", "router_model": "build-small-hackathon/packetcourt-evidence-router"}}, {"name": "detect_front_claims", "output": ["No Added Sugar"]}, {"name": "extract_back_evidence", "output": {"ingredients": ["Rolled oats", "glucose syrup", "peanuts"], "nutrition": {"basis": "per 100g", "serving_size_g": null, "package_size_g": null, "protein_g": null, "total_sugar_g": 19.0, "added_sugar_g": 12.0, "sodium_mg": null, "saturated_fat_g": null}}}, {"name": "calculate_whole_packet", "output": {"calculable": false, "multiplier": null, "protein_g": null, "total_sugar_g": null, "added_sugar_g": null, "sugar_teaspoons": null, "sodium_mg": null, "saturated_fat_g": null, "explanation": "Package size and nutrition basis are required."}}, {"name": "audit_claims", "output": [{"claim": "No Added Sugar", "verdict": "CONTRADICTED BY PROVIDED LABEL", "summary": "The provided ingredient list names one or more added-sugar ingredients.", "evidence": [{"source": "ingredient list", "text": "glucose syrup"}], "caveat": "This verdict only checks the supplied label text; it is not a laboratory analysis.", "confidence": "high"}]}, {"name": "surface_persuasion_gap", "output": []}, {"name": "resolve_dates", "output": {"packed_on": null, "best_before": null, "instruction": null, "after_opening_instruction": null, "status": "No resolvable best-before date found"}}], "limitations": ["PacketCourt audits only the text and images supplied by the user.", "Verdicts are evidence summaries, not legal, medical, or food-safety determinations.", "Users should verify low-confidence OCR against the physical packet."]}
+{"trace_id": "trace-pc-003", "case_id": "pc-003", "input": {"front_text": "NO ADDED SUGAR", "back_text": "Ingredients: Rolled oats, peanuts, cocoa, salt. Nutrition per 100g: Total Sugars 2g."}, "steps": [{"name": "plan_investigation", "output": {"objective": "Audit front-of-pack claims against evidence printed on the same packet.", "steps": [{"tool": "inspect_ingredients", "reason": "Required to audit the front claim: No Added Sugar.", "status": "completed", "source": "fine-tuned router"}], "missing_evidence": [], "stop_reason": "Stopped after all evidence tools required by the detected claims completed.", "router_model": "build-small-hackathon/packetcourt-evidence-router"}}, {"name": "detect_front_claims", "output": ["No Added Sugar"]}, {"name": "extract_back_evidence", "output": {"ingredients": ["Rolled oats", "peanuts", "cocoa", "salt"], "nutrition": {"basis": "per 100g", "serving_size_g": null, "package_size_g": null, "protein_g": null, "total_sugar_g": 2.0, "added_sugar_g": null, "sodium_mg": null, "saturated_fat_g": null}}}, {"name": "calculate_whole_packet", "output": {"calculable": false, "multiplier": null, "protein_g": null, "total_sugar_g": null, "added_sugar_g": null, "sugar_teaspoons": null, "sodium_mg": null, "saturated_fat_g": null, "explanation": "Package size and nutrition basis are required."}}, {"name": "audit_claims", "output": [{"claim": "No Added Sugar", "verdict": "SUPPORTED BY PROVIDED LABEL", "summary": "No common added-sugar term was found in the provided ingredient list.", "evidence": [{"source": "ingredient list", "text": "Rolled oats, peanuts, cocoa, salt"}], "caveat": "Unrecognized sweeteners or incomplete OCR may change this result.", "confidence": "medium"}]}, {"name": "surface_persuasion_gap", "output": []}, {"name": "resolve_dates", "output": {"packed_on": null, "best_before": null, "instruction": null, "after_opening_instruction": null, "status": "No resolvable best-before date found"}}], "limitations": ["PacketCourt audits only the text and images supplied by the user.", "Verdicts are evidence summaries, not legal, medical, or food-safety determinations.", "Users should verify low-confidence OCR against the physical packet."]}
+{"trace_id": "trace-pc-004", "case_id": "pc-004", "input": {"front_text": "FSSAI APPROVED", "back_text": "FSSAI Lic. No. 12345678901234. Ingredients: oats, salt."}, "steps": [{"name": "plan_investigation", "output": {"objective": "Audit front-of-pack claims against evidence printed on the same packet.", "steps": [{"tool": "inspect_license", "reason": "Required to audit the front claim: FSSAI Approved.", "status": "completed", "source": "fine-tuned router"}], "missing_evidence": [], "stop_reason": "Stopped after all evidence tools required by the detected claims completed.", "router_model": "build-small-hackathon/packetcourt-evidence-router"}}, {"name": "detect_front_claims", "output": ["FSSAI Approved"]}, {"name": "extract_back_evidence", "output": {"ingredients": ["oats", "salt"], "nutrition": {"basis": "unknown", "serving_size_g": null, "package_size_g": null, "protein_g": null, "total_sugar_g": null, "added_sugar_g": null, "sodium_mg": null, "saturated_fat_g": null}}}, {"name": "calculate_whole_packet", "output": {"calculable": false, "multiplier": null, "protein_g": null, "total_sugar_g": null, "added_sugar_g": null, "sugar_teaspoons": null, "sodium_mg": null, "saturated_fat_g": null, "explanation": "Package size and nutrition basis are required."}}, {"name": "audit_claims", "output": [{"claim": "FSSAI Approved", "verdict": "TECHNICALLY TRUE, CONTEXT MISSING", "summary": "An FSSAI license indicates regulatory registration; it is not a health endorsement.", "evidence": [{"source": "back label", "text": "FSSAI license number 12345678901234"}], "caveat": "", "confidence": "high"}]}, {"name": "surface_persuasion_gap", "output": [{"headline": "Registration language can look like a health endorsement.", "front_impression": "\u201cFSSAI Approved\u201d may imply the product has been endorsed as healthy.", "quiet_context": "An FSSAI license identifies regulatory registration; it is not a nutrition recommendation.", "severity": "medium", "evidence": [{"source": "claim interpretation", "text": "FSSAI registration is not a health score."}]}]}, {"name": "resolve_dates", "output": {"packed_on": null, "best_before": null, "instruction": null, "after_opening_instruction": null, "status": "No resolvable best-before date found"}}], "limitations": ["PacketCourt audits only the text and images supplied by the user.", "Verdicts are evidence summaries, not legal, medical, or food-safety determinations.", "Users should verify low-confidence OCR against the physical packet."]}
+{"trace_id": "trace-pc-005", "case_id": "pc-005", "input": {"front_text": "BAKED NOT FRIED WHOLE GRAIN ZERO TRANS FAT", "back_text": "Ingredients: Refined wheat flour, whole wheat flour, vegetable oil, seasoning, salt. Nutrition per 100g: Protein 7g, Total Sugars 3g, Sodium 780mg, Saturated Fat 5g, Trans Fat 0g. Net weight 180g. PKD: 01 JUN 26. Best before 4 months from packaging."}, "steps": [{"name": "plan_investigation", "output": {"objective": "Audit front-of-pack claims against evidence printed on the same packet.", "steps": [{"tool": "inspect_nutrition", "reason": "Required to audit the front claim: Baked Not Fried.", "status": "completed", "source": "fine-tuned router"}, {"tool": "inspect_ingredients", "reason": "Required to audit the front claim: Whole Grain.", "status": "completed", "source": "fine-tuned router"}, {"tool": "resolve_dates", "reason": "Date or after-opening evidence is visible on the supplied label.", "status": "completed", "source": "policy"}], "missing_evidence": [], "stop_reason": "Stopped after all evidence tools required by the detected claims completed.", "router_model": "build-small-hackathon/packetcourt-evidence-router"}}, {"name": "detect_front_claims", "output": ["Baked Not Fried", "Zero Trans Fat", "Whole Grain"]}, {"name": "extract_back_evidence", "output": {"ingredients": ["Refined wheat flour", "whole wheat flour", "vegetable oil", "seasoning", "salt"], "nutrition": {"basis": "per 100g", "serving_size_g": null, "package_size_g": 180.0, "protein_g": 7.0, "total_sugar_g": 3.0, "added_sugar_g": null, "sodium_mg": 780.0, "saturated_fat_g": 5.0}}}, {"name": "calculate_whole_packet", "output": {"calculable": true, "multiplier": 1.8, "protein_g": 12.6, "total_sugar_g": 5.4, "added_sugar_g": null, "sugar_teaspoons": 1.4, "sodium_mg": 1404.0, "saturated_fat_g": 9.0, "explanation": "Calculated from per 100g values across a 180g packet."}}, {"name": "audit_claims", "output": [{"claim": "Baked Not Fried", "verdict": "TECHNICALLY TRUE, CONTEXT MISSING", "summary": "The preparation claim does not establish that the complete packet is low in fat, sodium, or calories.", "evidence": [{"source": "front claim", "text": "Baked Not Fried"}], "caveat": "Review the nutrition panel and ingredient list for the complete product context.", "confidence": "high"}, {"claim": "Zero Trans Fat", "verdict": "SUPPORTED BY PROVIDED LABEL", "summary": "The supplied nutrition panel reports 0g trans fat.", "evidence": [{"source": "nutrition panel", "text": "Trans Fat 0g"}], "caveat": "A zero declaration may still be subject to applicable rounding rules.", "confidence": "high"}, {"claim": "Whole Grain", "verdict": "TECHNICALLY TRUE, CONTEXT MISSING", "summary": "Whole grain is present, but refined grain appears first.", "evidence": [{"source": "ingredient list", "text": "whole wheat flour"}, {"source": "ingredient list", "text": "Refined wheat flour"}], "caveat": "", "confidence": "high"}]}, {"name": "surface_persuasion_gap", "output": [{"headline": "A positive front claim competes with substantial sodium.", "front_impression": "The front emphasizes a favorable product attribute.", "quiet_context": "The complete packet calculates to approximately 1404mg sodium.", "severity": "high", "evidence": [{"source": "whole-packet calculation", "text": "Sodium 1404mg"}]}, {"headline": "Grain variety is prominent. The first ingredient is refined.", "front_impression": "The front suggests a grain-forward product.", "quiet_context": "The ingredient list begins with \u201cRefined wheat flour\u201d.", "severity": "medium", "evidence": [{"source": "first ingredient", "text": "Refined wheat flour"}]}]}, {"name": "resolve_dates", "output": {"packed_on": "2026-06-01", "best_before": "2026-10-01", "instruction": "Best before 4 months from packaging", "after_opening_instruction": null, "status": "Best-before evidence resolves to 2026-10-01"}}], "limitations": ["PacketCourt audits only the text and images supplied by the user.", "Verdicts are evidence summaries, not legal, medical, or food-safety determinations.", "Users should verify low-confidence OCR against the physical packet."]}
+{"trace_id": "trace-pc-006", "case_id": "pc-006", "input": {"front_text": "100% NATURAL", "back_text": "Ingredients: Chickpea flour, spices, salt."}, "steps": [{"name": "plan_investigation", "output": {"objective": "Audit front-of-pack claims against evidence printed on the same packet.", "steps": [{"tool": "apply_safety_boundary", "reason": "Required to audit the front claim: 100% Natural.", "status": "completed", "source": "fine-tuned router"}], "missing_evidence": [], "stop_reason": "Stopped after all evidence tools required by the detected claims completed.", "router_model": "build-small-hackathon/packetcourt-evidence-router"}}, {"name": "detect_front_claims", "output": ["100% Natural"]}, {"name": "extract_back_evidence", "output": {"ingredients": ["Chickpea flour", "spices", "salt"], "nutrition": {"basis": "unknown", "serving_size_g": null, "package_size_g": null, "protein_g": null, "total_sugar_g": null, "added_sugar_g": null, "sodium_mg": null, "saturated_fat_g": null}}}, {"name": "calculate_whole_packet", "output": {"calculable": false, "multiplier": null, "protein_g": null, "total_sugar_g": null, "added_sugar_g": null, "sugar_teaspoons": null, "sodium_mg": null, "saturated_fat_g": null, "explanation": "Package size and nutrition basis are required."}}, {"name": "audit_claims", "output": [{"claim": "100% Natural", "verdict": "CANNOT VERIFY", "summary": "An absolute naturalness claim cannot be established from package text alone.", "evidence": [{"source": "front claim", "text": "100% Natural"}], "caveat": "PacketCourt refuses to infer product composition beyond the supplied label.", "confidence": "high"}]}, {"name": "surface_persuasion_gap", "output": []}, {"name": "resolve_dates", "output": {"packed_on": null, "best_before": null, "instruction": null, "after_opening_instruction": null, "status": "No resolvable best-before date found"}}], "limitations": ["PacketCourt audits only the text and images supplied by the user.", "Verdicts are evidence summaries, not legal, medical, or food-safety determinations.", "Users should verify low-confidence OCR against the physical packet."]}
+{"trace_id": "trace-pc-007", "case_id": "pc-007", "input": {"front_text": "NO PRESERVATIVES", "back_text": "Ingredients: Tomato pulp, sugar, salt, sodium benzoate. Use by: 08 JUL 2026."}, "steps": [{"name": "plan_investigation", "output": {"objective": "Audit front-of-pack claims against evidence printed on the same packet.", "steps": [{"tool": "inspect_ingredients", "reason": "Required to audit the front claim: No Preservatives.", "status": "completed", "source": "fine-tuned router"}, {"tool": "resolve_dates", "reason": "Date or after-opening evidence is visible on the supplied label.", "status": "completed", "source": "policy"}], "missing_evidence": [], "stop_reason": "Stopped after all evidence tools required by the detected claims completed.", "router_model": "build-small-hackathon/packetcourt-evidence-router"}}, {"name": "detect_front_claims", "output": ["No Preservatives"]}, {"name": "extract_back_evidence", "output": {"ingredients": ["Tomato pulp", "sugar", "salt", "sodium benzoate. Use by: 08 JUL 2026"], "nutrition": {"basis": "unknown", "serving_size_g": null, "package_size_g": null, "protein_g": null, "total_sugar_g": null, "added_sugar_g": null, "sodium_mg": null, "saturated_fat_g": null}}}, {"name": "calculate_whole_packet", "output": {"calculable": false, "multiplier": null, "protein_g": null, "total_sugar_g": null, "added_sugar_g": null, "sugar_teaspoons": null, "sodium_mg": null, "saturated_fat_g": null, "explanation": "Package size and nutrition basis are required."}}, {"name": "audit_claims", "output": [{"claim": "No Preservatives", "verdict": "CONTRADICTED BY PROVIDED LABEL", "summary": "The ingredient list contains a recognizable preservative term or code.", "evidence": [{"source": "ingredient list", "text": "sodium benzoate. Use by: 08 JUL 2026"}], "caveat": "", "confidence": "high"}]}, {"name": "surface_persuasion_gap", "output": []}, {"name": "resolve_dates", "output": {"packed_on": null, "best_before": "2026-07-08", "instruction": null, "after_opening_instruction": null, "status": "Best-before evidence resolves to 2026-07-08"}}], "limitations": ["PacketCourt audits only the text and images supplied by the user.", "Verdicts are evidence summaries, not legal, medical, or food-safety determinations.", "Users should verify low-confidence OCR against the physical packet."]}
+{"trace_id": "trace-pc-008", "case_id": "pc-008", "input": {"front_text": "NO PRESERVATIVES", "back_text": "Ingredients: Tomato, salt. Use by: 08 JUL 2026. Consume within 3 days after opening."}, "steps": [{"name": "plan_investigation", "output": {"objective": "Audit front-of-pack claims against evidence printed on the same packet.", "steps": [{"tool": "inspect_ingredients", "reason": "Required to audit the front claim: No Preservatives.", "status": "completed", "source": "fine-tuned router"}, {"tool": "resolve_dates", "reason": "Date or after-opening evidence is visible on the supplied label.", "status": "completed", "source": "policy"}], "missing_evidence": [], "stop_reason": "Stopped after all evidence tools required by the detected claims completed.", "router_model": "build-small-hackathon/packetcourt-evidence-router"}}, {"name": "detect_front_claims", "output": ["No Preservatives"]}, {"name": "extract_back_evidence", "output": {"ingredients": ["Tomato", "salt. Use by: 08 JUL 2026. Consume within 3 days after opening"], "nutrition": {"basis": "unknown", "serving_size_g": null, "package_size_g": null, "protein_g": null, "total_sugar_g": null, "added_sugar_g": null, "sodium_mg": null, "saturated_fat_g": null}}}, {"name": "calculate_whole_packet", "output": {"calculable": false, "multiplier": null, "protein_g": null, "total_sugar_g": null, "added_sugar_g": null, "sugar_teaspoons": null, "sodium_mg": null, "saturated_fat_g": null, "explanation": "Package size and nutrition basis are required."}}, {"name": "audit_claims", "output": [{"claim": "No Preservatives", "verdict": "SUPPORTED BY PROVIDED LABEL", "summary": "No recognizable preservative term was found in the supplied ingredient list.", "evidence": [{"source": "ingredient list", "text": "Tomato, salt. Use by: 08 JUL 2026. Consume within 3 days after opening"}], "caveat": "Incomplete OCR or unfamiliar additive codes may change this result.", "confidence": "medium"}]}, {"name": "surface_persuasion_gap", "output": []}, {"name": "resolve_dates", "output": {"packed_on": null, "best_before": "2026-07-08", "instruction": null, "after_opening_instruction": "Consume within 3 days after opening", "status": "Best-before evidence resolves to 2026-07-08"}}], "limitations": ["PacketCourt audits only the text and images supplied by the user.", "Verdicts are evidence summaries, not legal, medical, or food-safety determinations.", "Users should verify low-confidence OCR against the physical packet."]}
+{"trace_id": "trace-pc-009", "case_id": "pc-009", "input": {"front_text": "MULTIGRAIN", "back_text": "Ingredients: Whole wheat flour, oats, ragi flour. Best before 6 months from packaging."}, "steps": [{"name": "plan_investigation", "output": {"objective": "Audit front-of-pack claims against evidence printed on the same packet.", "steps": [{"tool": "inspect_ingredients", "reason": "Required to audit the front claim: Multigrain.", "status": "completed", "source": "fine-tuned router"}, {"tool": "resolve_dates", "reason": "Date or after-opening evidence is visible on the supplied label.", "status": "needs evidence", "source": "policy"}], "missing_evidence": ["The packing or manufacturing date needed to resolve relative shelf life"], "stop_reason": "Stopped with explicit missing-evidence requests.", "router_model": "build-small-hackathon/packetcourt-evidence-router"}}, {"name": "detect_front_claims", "output": ["Multigrain"]}, {"name": "extract_back_evidence", "output": {"ingredients": ["Whole wheat flour", "oats", "ragi flour"], "nutrition": {"basis": "unknown", "serving_size_g": null, "package_size_g": null, "protein_g": null, "total_sugar_g": null, "added_sugar_g": null, "sodium_mg": null, "saturated_fat_g": null}}}, {"name": "calculate_whole_packet", "output": {"calculable": false, "multiplier": null, "protein_g": null, "total_sugar_g": null, "added_sugar_g": null, "sugar_teaspoons": null, "sodium_mg": null, "saturated_fat_g": null, "explanation": "Package size and nutrition basis are required."}}, {"name": "audit_claims", "output": [{"claim": "Multigrain", "verdict": "SUPPORTED BY PROVIDED LABEL", "summary": "Multiple grain ingredients are present in the supplied ingredient list.", "evidence": [{"source": "ingredient list", "text": "Whole wheat flour"}, {"source": "ingredient list", "text": "oats"}, {"source": "ingredient list", "text": "ragi flour"}], "caveat": "Ingredient order indicates relative quantity, but exact grain percentages may be unavailable.", "confidence": "high"}]}, {"name": "surface_persuasion_gap", "output": []}, {"name": "resolve_dates", "output": {"packed_on": null, "best_before": null, "instruction": "Best before 6 months from packaging", "after_opening_instruction": null, "status": "Relative shelf-life found, but the starting date is missing"}}], "limitations": ["PacketCourt audits only the text and images supplied by the user.", "Verdicts are evidence summaries, not legal, medical, or food-safety determinations.", "Users should verify low-confidence OCR against the physical packet."]}
+{"trace_id": "trace-pc-010", "case_id": "pc-010", "input": {"front_text": "HIGH PROTEIN", "back_text": "Ingredients: Chickpea flour, salt. Protein 9g."}, "steps": [{"name": "plan_investigation", "output": {"objective": "Audit front-of-pack claims against evidence printed on the same packet.", "steps": [{"tool": "inspect_nutrition", "reason": "Required to audit the front claim: High Protein.", "status": "completed", "source": "fine-tuned router"}], "missing_evidence": ["A readable nutrition panel with its measurement basis"], "stop_reason": "Stopped with explicit missing-evidence requests.", "router_model": "build-small-hackathon/packetcourt-evidence-router"}}, {"name": "detect_front_claims", "output": ["High Protein"]}, {"name": "extract_back_evidence", "output": {"ingredients": ["Chickpea flour", "salt. Protein 9g"], "nutrition": {"basis": "unknown", "serving_size_g": null, "package_size_g": null, "protein_g": 9.0, "total_sugar_g": null, "added_sugar_g": null, "sodium_mg": null, "saturated_fat_g": null}}}, {"name": "calculate_whole_packet", "output": {"calculable": false, "multiplier": null, "protein_g": null, "total_sugar_g": null, "added_sugar_g": null, "sugar_teaspoons": null, "sodium_mg": null, "saturated_fat_g": null, "explanation": "Package size and nutrition basis are required."}}, {"name": "audit_claims", "output": [{"claim": "High Protein", "verdict": "CANNOT VERIFY", "summary": "Protein is listed, but its measurement basis could not be determined.", "evidence": [{"source": "nutrition panel", "text": "Protein 9g"}], "caveat": "", "confidence": "low"}]}, {"name": "surface_persuasion_gap", "output": []}, {"name": "resolve_dates", "output": {"packed_on": null, "best_before": null, "instruction": null, "after_opening_instruction": null, "status": "No resolvable best-before date found"}}], "limitations": ["PacketCourt audits only the text and images supplied by the user.", "Verdicts are evidence summaries, not legal, medical, or food-safety determinations.", "Users should verify low-confidence OCR against the physical packet."]}