Spaces:

Ma-Ri-Ba-Ku
/

Picarones

Sleeping

Claude commited on May 9

Commit

43478ec

unverified ·

1 Parent(s): f8a5c40

feat(sprint-S8): cohérence finale — renames test dirs, /metrics endpoint, SBOM workflow

Sprint S8 — cohérence finale avant la release v2.0.

S8.1 — Renames de répertoires de tests
--------------------------------------

5 répertoires aux noms hérités de la pré-rewrite renommés vers
leurs cibles canoniques :

- ``tests/core/`` → ``tests/evaluation/`` (10 fichiers)
- ``tests/measurements/`` → ``tests/evaluation/metrics/`` (41 fichiers)
- ``tests/extras/`` → ``tests/adapters/corpus/`` (1 fichier)
- ``tests/report/`` → ``tests/reports/`` (33 fichiers)
- ``tests/reports_v2/`` → ``tests/reports/`` (2 fichiers, fusion)

Aucun conflit de noms (vérifié avant). Les imports inter-tests
(``tests.measurements._helpers``,
``tests.measurements.test_sprint19_narrative_engine``) ont été
mis à jour vers ``tests.evaluation.metrics.*``.

4 fichiers (déplacés vers ``tests/evaluation/metrics/``) avaient
des résolutions de chemin ``Path(__file__).parent.parent.parent``
qui pointaient sur le repo root quand ils étaient à
``tests/measurements/`` mais pointent un niveau trop court depuis
``tests/evaluation/metrics/``. Patch automatique : +1 ``.parent``.

S8.2 — Endpoint /metrics Prometheus
-----------------------------------

``picarones/interfaces/web/routers/system.py`` ajoute un endpoint
``GET /metrics`` au format Prometheus exposition.

Désactivé par défaut → 404. Activable via
``PICARONES_METRICS_ENABLED=1`` (insensible à la casse, accepte
aussi ``true``/``yes``).

Métriques exposées :

- ``picarones_app_info{version="X.Y.Z"}`` (gauge=1)
- ``picarones_jobs_total{status="<status>"}`` (gauge par statut,
6 statuts inclus à 0 si absents — alerts Prometheus simples)
- ``picarones_jobs_pending`` + ``picarones_jobs_running`` (alias
directs pour les statuts opérationnels les plus surveillés)

Pas de dépendance ``prometheus_client`` (rester léger). Tolérance
si le ``JobStore`` est indisponible : warning loggé + payload réduit
à ``app_info`` (le service reste vivant pour l'orchestrateur).

Tests (``tests/architecture/test_s8_metrics_endpoint.py``, 13 tests) :

- ``TestMetricsDisabledByDefault`` (1) — 404 sans env var.
- ``TestMetricsFormat`` (4) — content-type, app_info, jobs_total
par statut, alias gauges.
- ``TestEnvVarParsing`` (8 paramétrés) — accepte
``1``/``true``/``yes`` ; refuse ``0``/``false``/``no``/``""``/``off``.

S8.3 — Workflow SBOM CycloneDX
------------------------------

``.github/workflows/sbom.yml`` (NEW) génère un SBOM au format
CycloneDX JSON sur :

- chaque push ``main``,
- chaque tag ``v*``,
- chaque PR sur ``main`` (informationnel),
- workflow_dispatch manuel.

Outils : ``cyclonedx-bom>=4.0,<6.0`` (introspection du venv via
``cyclonedx-py environment``).

Artefacts :
- Upload via ``actions/upload-artifact`` (90 jours de rétention).
- Sur tag : attaché à la GitHub Release via
``softprops/action-gh-release``.

Sanity check inline : assert que ``components.length > 0`` (sinon
le SBOM est vide → cyclonedx-py n'a pas vu le venv).

Pourquoi : une institution publique (BnF, université, archive
nationale) doit pouvoir auditer la chaîne d'approvisionnement
logicielle. Le SBOM CycloneDX est ingéré par Dependency-Track,
Snyk, et tout scanner SBOM standard.

Tests
-----

- ``pytest tests/`` : 4382 passed (+13 vs S7), 9 skipped, 8
deselected (-16 vs S6 grâce au retrait du marker ``regression``
en S7), 2 xfailed.
- ``ruff check`` : All checks passed.
- 4 régressions chemins corrigées dans les tests déplacés.

Sprint S8 — bilan
-----------------

| Cible | Avant | Après |
|---|---|---|
| Test dirs avec noms legacy | 5 (core, measurements, extras, report, reports_v2) | 0 |
| Endpoint observability | aucun | ``/metrics`` Prometheus opt-in |
| SBOM en CI | absent | CycloneDX généré sur push/tag/PR |

Reste pour la release v2.0
--------------------------

S9 : tag v2.0.0 — décision manuelle de l'utilisateur.

https://claude.ai/code/session_01NxyVKqg2SowXLZdM4H1ZDE

This view is limited to 50 files because it contains too many changes. See raw diff

Files changed (50) hide show

.github/workflows/sbom.yml +99 -0
CLAUDE.md +2 -2
README.md +2 -1
picarones/interfaces/web/routers/system.py +103 -0
tests/{extras → adapters/corpus}/test_sprint8_escriptorium_gallica.py +0 -0
tests/architecture/test_s8_metrics_endpoint.py +140 -0
tests/core/__init__.py +0 -0
tests/{measurements → evaluation/metrics}/_helpers.py +0 -0
tests/{measurements → evaluation/metrics}/test_char_scores.py +0 -0
tests/{measurements → evaluation/metrics}/test_metrics.py +0 -0
tests/{measurements → evaluation/metrics}/test_pricing_degenerate_cases.py +0 -0
tests/{measurements → evaluation/metrics}/test_results.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint10_error_distribution.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint12_nouvelles_fonctionnalites.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint15_llm_pipeline_bugs.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint16_narrative_foundations.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint18_friedman_nemenyi_cdd.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint19_narrative_engine.py +1 -1
tests/{measurements → evaluation/metrics}/test_sprint20_pareto_pricing.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint23_anti_hallucination.py +2 -2
tests/{measurements → evaluation/metrics}/test_sprint29_detector_registry.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint35_inter_engine.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint36_ensemble_narrative.py +1 -1
tests/{measurements → evaluation/metrics}/test_sprint38_ner_metrics.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint39_calibration.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint44_median_default.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint45_stratification.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint52_readability.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint53_reading_order.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint54_layout.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint55_unicode_blocks.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint56_abbreviations.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint57_mufi.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint58_early_modern.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint59_modern_archives.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint60_roman_numerals.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint71_rare_tokens.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint73_baseline_comparison.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint78_equivalence_profile.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint79_cost_projection.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint81_robustness_projection.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint83_reliability.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint84_searchability.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint85_numerical_sequences.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint8_longitudinal_robustness.py +0 -0
tests/{measurements → evaluation/metrics}/test_sprint93_image_predictive.py +1 -1
tests/{measurements → evaluation/metrics}/test_sprint96_incremental_comparison.py +1 -1
tests/{measurements → evaluation/metrics}/test_sprint97_module_policy.py +3 -3
tests/{measurements → evaluation/metrics}/test_sprint_a14_s1_normalization_propagation.py +0 -0
tests/{core → evaluation}/test_corpus.py +0 -0

.github/workflows/sbom.yml ADDED Viewed

	@@ -0,0 +1,99 @@

+# Sprint S8.3 — Software Bill of Materials au format CycloneDX.
+#
+# Pourquoi
+# --------
+# Une institution publique (BnF, université, archive nationale) qui
+# déploie Picarones doit pouvoir auditer la chaîne d'approvisionnement
+# logicielle :
+#
+# - Quelles dépendances sont utilisées ?
+# - Quelles versions exactes ?
+# - Quelles licences ?
+# - Y a-t-il des CVE connues sur ces deps ?
+#
+# CycloneDX est le standard SBOM piloté par OWASP.  L'artefact JSON
+# produit ici peut être ingéré par Dependency-Track, Snyk, ou tout
+# scanner SBOM standard.
+#
+# Stratégie
+# ---------
+# - Sur chaque push main + sur chaque tag → génération SBOM.
+# - Sur chaque PR → génération + diff vs main (informationnel).
+# - Artefact uploadé en pièce jointe du workflow ; sur tag, attaché
+#   à la GitHub Release.
+name: SBOM (CycloneDX)
+on:
+  push:
+    branches: [main]
+    tags: ["v*"]
+  pull_request:
+    branches: [main]
+  workflow_dispatch:
+permissions:
+  contents: read
+jobs:
+  sbom:
+    name: Generate CycloneDX SBOM
+    runs-on: ubuntu-latest
+    steps:
+      - name: Checkout
+        uses: actions/checkout@v4
+      - name: Set up Python 3.11
+        uses: actions/setup-python@v5
+        with:
+          python-version: "3.11"
+          cache: pip
+      - name: Install Picarones (runtime extras)
+        run: |
+          python -m pip install --upgrade pip
+          # Installation pour figer les versions résolues dans
+          # l'environnement.  ``[dev,web]`` couvre la surface
+          # utilisée par la CI ; les extras LLM/OCR cloud ne sont
+          # pas inclus pour rester sous la taille raisonnable du
+          # SBOM (un déployeur institutionnel choisira ses extras).
+          pip install -e ".[dev,web]"
+      - name: Install cyclonedx-bom
+        run: pip install "cyclonedx-bom>=4.0,<6.0"
+      - name: Generate CycloneDX JSON SBOM
+        run: |
+          # ``cyclonedx-py environment`` introspecte le venv courant.
+          # ``--output-format JSON`` + ``--output-file`` produit un
+          # fichier nommé.
+          mkdir -p sbom-output
+          cyclonedx-py environment \
+              --output-format JSON \
+              --output-file sbom-output/picarones-sbom.cdx.json
+          echo "SBOM size : $(wc -c < sbom-output/picarones-sbom.cdx.json) bytes"
+          # Nombre de composants (sanity check).
+          python -c "
+          import json
+          with open('sbom-output/picarones-sbom.cdx.json') as f:
+              data = json.load(f)
+          n = len(data.get('components', []))
+          print(f'Components: {n}')
+          assert n > 0, 'SBOM vide — cyclonedx-py n a pas vu le venv'
+          "
+      - name: Upload SBOM artifact
+        uses: actions/upload-artifact@v4
+        with:
+          name: picarones-sbom-${{ github.sha }}
+          path: sbom-output/
+          retention-days: 90
+      - name: Attach to GitHub Release (on tag)
+        if: startsWith(github.ref, 'refs/tags/v')
+        uses: softprops/action-gh-release@v2
+        with:
+          files: sbom-output/picarones-sbom.cdx.json
+        env:
+          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}

CLAUDE.md CHANGED Viewed

@@ -116,7 +116,7 @@ picarones/
 ## État des tests et bugs historiques
-`pytest tests/` → **4380 passed, 12 skipped, 8 deselected, 0 failed**
 (post-S59).  Les deselected sont les markers `live` (5 tests d'intégration
 contre vraie API/binaire) + `network` (3 tests qui hit le réseau réel),
 opt-in en local via `pytest -m live` ou `pytest -m network`.  Le
@@ -268,7 +268,7 @@ détecte, arbitre, rend.
 ## Contexte développement
 - **Environnement** : GitHub Codespaces, Python 3.11+
-- **Tests** : `pytest tests/ -q` → 4380 passed, 9 skipped, 24
   deselected, 0 failed (post-v2.0).
 - **Manifeste architecture** : [`docs/explanation/architecture.md`](docs/explanation/architecture.md).
 - **API publique stable** : [`docs/reference/api-stable.md`](docs/reference/api-stable.md).

 ## État des tests et bugs historiques
+`pytest tests/` → **4400 passed, 12 skipped, 8 deselected, 0 failed**
 (post-S59).  Les deselected sont les markers `live` (5 tests d'intégration
 contre vraie API/binaire) + `network` (3 tests qui hit le réseau réel),
 opt-in en local via `pytest -m live` ou `pytest -m network`.  Le
 ## Contexte développement
 - **Environnement** : GitHub Codespaces, Python 3.11+
+- **Tests** : `pytest tests/ -q` → 4400 passed, 9 skipped, 24
   deselected, 0 failed (post-v2.0).
 - **Manifeste architecture** : [`docs/explanation/architecture.md`](docs/explanation/architecture.md).
 - **API publique stable** : [`docs/reference/api-stable.md`](docs/reference/api-stable.md).

README.md CHANGED Viewed

@@ -285,6 +285,7 @@ when running. Summary:
 | `GET` | `/api/reports` | Api Reports |
 | `GET` | `/api/status` | Api Status |
 | `GET` | `/health` | Health |
 | `GET` | `/reports/{filename}` | Serve Report |
 <!-- /generated:endpoints -->
@@ -394,7 +395,7 @@ ruff check picarones/ tests/
 python -m mypy picarones/core/
 ```
-**Test suite**: ~4380 tests, ~3 min on a modern laptop. Coverage
 floor at 85% (currently ~87%). The `network` marker excludes tests
 requiring live HTTP. A handful of tests depend on optional engines
 (`pero-ocr`, `pytesseract`) and are skipped/fail gracefully when

 | `GET` | `/api/reports` | Api Reports |
 | `GET` | `/api/status` | Api Status |
 | `GET` | `/health` | Health |
+| `GET` | `/metrics` | Metrics Endpoint |
 | `GET` | `/reports/{filename}` | Serve Report |
 <!-- /generated:endpoints -->
 python -m mypy picarones/core/
 ```
+**Test suite**: ~4400 tests, ~3 min on a modern laptop. Coverage
 floor at 85% (currently ~87%). The `network` marker excludes tests
 requiring live HTTP. A handful of tests depend on optional engines
 (`pero-ocr`, `pytesseract`) and are skipped/fail gracefully when

picarones/interfaces/web/routers/system.py CHANGED Viewed

@@ -77,6 +77,109 @@ async def api_status() -> dict:
     }
 @router.get("/api/lang")
 async def api_get_lang(picarones_lang: str = Cookie(default="fr")) -> dict:
     """Retourne la langue courante (lue depuis le cookie de session)."""

     }
+# ──────────────────────────────────────────────────────────────────────
+# Sprint S8.2 — Endpoint /metrics au format Prometheus exposition.
+# Opt-in via PICARONES_METRICS_ENABLED=1.  Désactivé par défaut pour
+# ne pas exposer de surface publique en mode HuggingFace Space.
+# ──────────────────────────────────────────────────────────────────────
+def _metrics_enabled() -> bool:
+    import os
+    return os.environ.get("PICARONES_METRICS_ENABLED", "").strip() in (
+        "1", "true", "yes",
+    )
+@router.get("/metrics")
+async def metrics_endpoint() -> Response:
+    """Endpoint Prometheus exposition format (text/plain; version=0.0.4).
+    Désactivé par défaut.  Activer via
+    ``PICARONES_METRICS_ENABLED=1``.
+    Métriques exposées :
+    - ``picarones_jobs_total{status="<status>"}`` — nombre de jobs
+      par statut (pending, running, complete, error, cancelled,
+      interrupted).
+    - ``picarones_jobs_pending`` — alias direct (gauge).
+    - ``picarones_jobs_running`` — alias direct (gauge).
+    - ``picarones_app_info{version="X.Y.Z"}`` — info statique = 1.
+    Format : lignes ``# HELP`` + ``# TYPE`` + samples conformes
+    à la spec Prometheus.  Pas de dépendance ``prometheus_client``
+    pour rester léger ; un opérateur qui veut un client riche
+    peut greffer un middleware externe.
+    """
+    if not _metrics_enabled():
+        raise HTTPException(
+            status_code=404,
+            detail=(
+                "Metrics endpoint disabled.  Activate via "
+                "PICARONES_METRICS_ENABLED=1."
+            ),
+        )
+    from picarones.interfaces.web import state
+    try:
+        records = state.JOB_STORE.list(limit=10000)
+    except Exception as exc:  # noqa: BLE001
+        # Best-effort : si le store SQLite est indisponible, on
+        # expose quand même app_info pour que l'orchestrateur
+        # voie le service vivant.
+        import logging
+        logging.getLogger(__name__).warning(
+            "[metrics] JobStore inaccessible : %s", exc,
+        )
+        records = ()
+    counts: dict[str, int] = {}
+    for r in records:
+        counts[r.status] = counts.get(r.status, 0) + 1
+    known_statuses = (
+        "pending", "running", "complete", "error",
+        "cancelled", "interrupted",
+    )
+    for s in known_statuses:
+        counts.setdefault(s, 0)
+    lines: list[str] = []
+    lines.append("# HELP picarones_app_info Application info (always 1)")
+    lines.append("# TYPE picarones_app_info gauge")
+    lines.append(f'picarones_app_info{{version="{__version__}"}} 1')
+    lines.append("")
+    lines.append(
+        "# HELP picarones_jobs_total Total number of jobs by status"
+    )
+    lines.append("# TYPE picarones_jobs_total gauge")
+    for status, n in sorted(counts.items()):
+        lines.append(
+            f'picarones_jobs_total{{status="{status}"}} {n}'
+        )
+    lines.append("")
+    # Aliases directs pour les deux statuts opérationnels les plus
+    # surveillés (alerts Prometheus simples).
+    lines.append("# HELP picarones_jobs_pending Jobs pending")
+    lines.append("# TYPE picarones_jobs_pending gauge")
+    lines.append(f"picarones_jobs_pending {counts.get('pending', 0)}")
+    lines.append("")
+    lines.append("# HELP picarones_jobs_running Jobs running")
+    lines.append("# TYPE picarones_jobs_running gauge")
+    lines.append(f"picarones_jobs_running {counts.get('running', 0)}")
+    lines.append("")
+    body = "\n".join(lines) + "\n"
+    return Response(
+        content=body,
+        media_type="text/plain; version=0.0.4; charset=utf-8",
+    )
 @router.get("/api/lang")
 async def api_get_lang(picarones_lang: str = Cookie(default="fr")) -> dict:
     """Retourne la langue courante (lue depuis le cookie de session)."""

tests/{extras → adapters/corpus}/test_sprint8_escriptorium_gallica.py RENAMED Viewed

File without changes

tests/architecture/test_s8_metrics_endpoint.py ADDED Viewed

	@@ -0,0 +1,140 @@

+"""Sprint S8.2 — Endpoint Prometheus ``/metrics``.
+Vérifie :
+1. Désactivé par défaut → 404.
+2. Activé via ``PICARONES_METRICS_ENABLED=1`` → 200 avec format
+   Prometheus exposition.
+3. Métriques attendues : ``picarones_app_info``,
+   ``picarones_jobs_total{status="..."}``, alias gauges.
+4. Tolérance store inaccessible : retourne ``picarones_app_info``
+   sans crasher.
+"""
+from __future__ import annotations
+import pytest
+def _make_app():
+    from fastapi import FastAPI
+    from picarones.interfaces.web.routers import system as sys_router
+    app = FastAPI()
+    app.include_router(sys_router.router)
+    return app
+# ──────────────────────────────────────────────────────────────────────
+# 1. Désactivé par défaut
+# ──────────────────────────────────────────────────────────────────────
+class TestMetricsDisabledByDefault:
+    def test_404_when_env_not_set(
+        self, monkeypatch: pytest.MonkeyPatch,
+    ) -> None:
+        from fastapi.testclient import TestClient
+        monkeypatch.delenv("PICARONES_METRICS_ENABLED", raising=False)
+        app = _make_app()
+        with TestClient(app) as client:
+            r = client.get("/metrics")
+            assert r.status_code == 404
+            assert "PICARONES_METRICS_ENABLED" in r.text
+# ──────────────────────────────────────────────────────────────────────
+# 2. Format Prometheus quand activé
+# ──────────────────────────────────────────────────────────────────────
+class TestMetricsFormat:
+    def test_200_with_prometheus_content_type(
+        self, monkeypatch: pytest.MonkeyPatch,
+    ) -> None:
+        from fastapi.testclient import TestClient
+        monkeypatch.setenv("PICARONES_METRICS_ENABLED", "1")
+        app = _make_app()
+        with TestClient(app) as client:
+            r = client.get("/metrics")
+            assert r.status_code == 200
+            ct = r.headers.get("content-type", "")
+            assert "text/plain" in ct
+            assert "version=0.0.4" in ct
+    def test_exposes_app_info(
+        self, monkeypatch: pytest.MonkeyPatch,
+    ) -> None:
+        from fastapi.testclient import TestClient
+        monkeypatch.setenv("PICARONES_METRICS_ENABLED", "1")
+        app = _make_app()
+        with TestClient(app) as client:
+            r = client.get("/metrics")
+            text = r.text
+            assert "# TYPE picarones_app_info gauge" in text
+            assert 'picarones_app_info{version=' in text
+            assert text.rstrip().endswith("1") or "} 1" in text
+    def test_exposes_jobs_total_per_status(
+        self, monkeypatch: pytest.MonkeyPatch,
+    ) -> None:
+        from fastapi.testclient import TestClient
+        monkeypatch.setenv("PICARONES_METRICS_ENABLED", "1")
+        app = _make_app()
+        with TestClient(app) as client:
+            r = client.get("/metrics")
+            text = r.text
+            # Chaque statut connu apparaît, même à 0
+            for status in ("pending", "running", "complete", "error",
+                           "cancelled", "interrupted"):
+                assert f'status="{status}"' in text, (
+                    f"Statut ``{status}`` absent du payload"
+                )
+    def test_exposes_alias_gauges(
+        self, monkeypatch: pytest.MonkeyPatch,
+    ) -> None:
+        from fastapi.testclient import TestClient
+        monkeypatch.setenv("PICARONES_METRICS_ENABLED", "1")
+        app = _make_app()
+        with TestClient(app) as client:
+            r = client.get("/metrics")
+            text = r.text
+            assert "picarones_jobs_pending" in text
+            assert "picarones_jobs_running" in text
+# ──────────────────────────────────────────────────────────────────────
+# 3. Activation insensible casse / yes / true
+# ──────────────────────────────────────────────────────────────────────
+class TestEnvVarParsing:
+    @pytest.mark.parametrize("value", ["1", "true", "yes"])
+    def test_truthy_values_enable(
+        self, monkeypatch: pytest.MonkeyPatch, value: str,
+    ) -> None:
+        from fastapi.testclient import TestClient
+        monkeypatch.setenv("PICARONES_METRICS_ENABLED", value)
+        app = _make_app()
+        with TestClient(app) as client:
+            r = client.get("/metrics")
+            assert r.status_code == 200
+    @pytest.mark.parametrize("value", ["0", "false", "no", "", "off"])
+    def test_falsy_values_keep_disabled(
+        self, monkeypatch: pytest.MonkeyPatch, value: str,
+    ) -> None:
+        from fastapi.testclient import TestClient
+        monkeypatch.setenv("PICARONES_METRICS_ENABLED", value)
+        app = _make_app()
+        with TestClient(app) as client:
+            r = client.get("/metrics")
+            assert r.status_code == 404

tests/core/__init__.py DELETED Viewed

File without changes

tests/{measurements → evaluation/metrics}/_helpers.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_char_scores.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_metrics.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_pricing_degenerate_cases.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_results.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint10_error_distribution.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint12_nouvelles_fonctionnalites.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint15_llm_pipeline_bugs.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint16_narrative_foundations.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint18_friedman_nemenyi_cdd.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint19_narrative_engine.py RENAMED Viewed

@@ -456,7 +456,7 @@ class TestBuildSynthesisE2E:
 # ``_numbers_in_payload`` vit dans ``tests/measurements/_helpers.py`` ;
 # on le ré-expose sous son ancien nom privé pour compatibilité avec les
 # tests qui l'importent depuis ce module (ex. test_sprint23).
-from tests.measurements._helpers import numbers_in_payload as _numbers_in_payload  # noqa: E402
 # Sprint 23 : whitelist vidée. Tout nombre rendu dans la synthèse doit

 # ``_numbers_in_payload`` vit dans ``tests/measurements/_helpers.py`` ;
 # on le ré-expose sous son ancien nom privé pour compatibilité avec les
 # tests qui l'importent depuis ce module (ex. test_sprint23).
+from tests.evaluation.metrics._helpers import numbers_in_payload as _numbers_in_payload  # noqa: E402
 # Sprint 23 : whitelist vidée. Tout nombre rendu dans la synthèse doit

tests/{measurements → evaluation/metrics}/test_sprint20_pareto_pricing.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint23_anti_hallucination.py RENAMED Viewed

@@ -40,7 +40,7 @@ from picarones.reports.narrative import (
 from picarones.reports.narrative.arbiter import DEFAULT_TYPE_ORDER
 from picarones.evaluation.statistics import bootstrap_ci
-ROOT = Path(__file__).parent.parent.parent
 TEMPLATES_DIR = ROOT / "picarones" / "reports" / "narrative" / "templates"
@@ -163,7 +163,7 @@ class TestEndToEndWithEmptyWhitelist:
     def test_every_number_traceable_with_empty_whitelist(self, lang):
         from picarones.reports.narrative import extract_numbers
-        from tests.measurements.test_sprint19_narrative_engine import _numbers_in_payload
         result = build_synthesis(_full_data(), lang)
         allowed: set[str] = set()

 from picarones.reports.narrative.arbiter import DEFAULT_TYPE_ORDER
 from picarones.evaluation.statistics import bootstrap_ci
+ROOT = Path(__file__).parent.parent.parent.parent
 TEMPLATES_DIR = ROOT / "picarones" / "reports" / "narrative" / "templates"
     def test_every_number_traceable_with_empty_whitelist(self, lang):
         from picarones.reports.narrative import extract_numbers
+        from tests.evaluation.metrics.test_sprint19_narrative_engine import _numbers_in_payload
         result = build_synthesis(_full_data(), lang)
         allowed: set[str] = set()

tests/{measurements → evaluation/metrics}/test_sprint29_detector_registry.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint35_inter_engine.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint36_ensemble_narrative.py RENAMED Viewed

@@ -264,7 +264,7 @@ class TestSynthesisIntegration:
 # ──────────────────────────────────────────────────────────────────────────
-from tests.measurements._helpers import numbers_in_payload as _numbers_in_payload  # noqa: E402
 class TestTraceability:

 # ──────────────────────────────────────────────────────────────────────────
+from tests.evaluation.metrics._helpers import numbers_in_payload as _numbers_in_payload  # noqa: E402
 class TestTraceability:

tests/{measurements → evaluation/metrics}/test_sprint38_ner_metrics.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint39_calibration.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint44_median_default.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint45_stratification.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint52_readability.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint53_reading_order.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint54_layout.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint55_unicode_blocks.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint56_abbreviations.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint57_mufi.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint58_early_modern.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint59_modern_archives.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint60_roman_numerals.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint71_rare_tokens.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint73_baseline_comparison.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint78_equivalence_profile.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint79_cost_projection.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint81_robustness_projection.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint83_reliability.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint84_searchability.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint85_numerical_sequences.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint8_longitudinal_robustness.py RENAMED Viewed

File without changes

tests/{measurements → evaluation/metrics}/test_sprint93_image_predictive.py RENAMED Viewed

@@ -42,7 +42,7 @@ from picarones.reports.html.renderers.image_predictive import (
 def _load_labels(lang: str) -> dict:
     p = (
-        Path(__file__).parent.parent.parent
         / "picarones" / "reports" / "i18n" / f"{lang}.json"
     )
     return json.loads(p.read_text(encoding="utf-8"))

 def _load_labels(lang: str) -> dict:
     p = (
+        Path(__file__).parent.parent.parent.parent
         / "picarones" / "reports" / "i18n" / f"{lang}.json"
     )
     return json.loads(p.read_text(encoding="utf-8"))

tests/{measurements → evaluation/metrics}/test_sprint96_incremental_comparison.py RENAMED Viewed

@@ -38,7 +38,7 @@ from picarones.reports.html.renderers.incremental_comparison import (
 def _load_labels(lang: str) -> dict:
     p = (
-        Path(__file__).parent.parent.parent
         / "picarones" / "reports" / "i18n" / f"{lang}.json"
     )
     return json.loads(p.read_text(encoding="utf-8"))

 def _load_labels(lang: str) -> dict:
     p = (
+        Path(__file__).parent.parent.parent.parent
         / "picarones" / "reports" / "i18n" / f"{lang}.json"
     )
     return json.loads(p.read_text(encoding="utf-8"))

tests/{measurements → evaluation/metrics}/test_sprint97_module_policy.py RENAMED Viewed

@@ -44,7 +44,7 @@ from picarones.reports.html.renderers.module_audit import (
 def _load_labels(lang: str) -> dict:
     p = (
-        Path(__file__).parent.parent.parent
         / "picarones" / "reports" / "i18n" / f"{lang}.json"
     )
     return json.loads(p.read_text(encoding="utf-8"))
@@ -260,7 +260,7 @@ class TestRender:
 class TestDocumentation:
     def test_docs_present(self) -> None:
         path = (
-            Path(__file__).parent.parent.parent
             / "docs" / "developer" / "module-policy.md"
         )
         assert path.exists()
@@ -272,7 +272,7 @@ class TestDocumentation:
     def test_docs_lists_required_fields(self) -> None:
         path = (
-            Path(__file__).parent.parent.parent
             / "docs" / "developer" / "module-policy.md"
         )
         text = path.read_text(encoding="utf-8")

 def _load_labels(lang: str) -> dict:
     p = (
+        Path(__file__).parent.parent.parent.parent
         / "picarones" / "reports" / "i18n" / f"{lang}.json"
     )
     return json.loads(p.read_text(encoding="utf-8"))
 class TestDocumentation:
     def test_docs_present(self) -> None:
         path = (
+            Path(__file__).parent.parent.parent.parent
             / "docs" / "developer" / "module-policy.md"
         )
         assert path.exists()
     def test_docs_lists_required_fields(self) -> None:
         path = (
+            Path(__file__).parent.parent.parent.parent
             / "docs" / "developer" / "module-policy.md"
         )
         text = path.read_text(encoding="utf-8")

tests/{measurements → evaluation/metrics}/test_sprint_a14_s1_normalization_propagation.py RENAMED Viewed

File without changes

tests/{core → evaluation}/test_corpus.py RENAMED Viewed

File without changes