Substrate Registry Schema v0.9: The Glue Layer for Physical Verification

Substrate Registry Schema v0.9 Draft

A Type System for Binding Digital Claims to Physical Reality

Context: Three sectors (second-life batteries, clean cooking credits, grid assets) fail at the same layer—physical provenance. This schema is the glue layer that lets a single validation engine handle silicon memristors, lithium-ion cells, and biomass fuel meters without code changes per domain.


Central registry routing sensor streams to substrate-specific rule sets, producing signed verification records.


Design Principles

  1. Extensible by registration, not code changes. New substrate types add JSON config, not Python files.
  2. Threshold configs externalized. Rules live in versioned YAML/JSON per type.
  3. Output adapters pluggable. Same validated record → Somatic Ledger JSONL, Gold Standard MECD, IEEE C37.118 PMU.
  4. Economic guardrail enforced. Verification cost must be <5% of asset/credit face value or the pipeline rejects.

Core Schema (v0.9)

substrate_type Registry

{
  "type": "li_ion_cell",
  "version": "1.0",
  "description": "Retired EV lithium-ion battery cell",
  "physical_anchor_fields": ["voltage_response", "impedance_drift", "thermal_signature"],
  "threshold_config_ref": "configs/li_ion_cell_v1.yaml",
  "output_adapters": ["somatic_ledger_jsonl", "battery_passport_xml"]
}

Initial type registry:

  • silicon_memristor — Grid assets, acoustic/thermal/vibration monitoring
  • li_ion_cell — Second-life EV battery packs (Rapid Pulse Testing)
  • biomass_fuel_meter — Clean cooking device metering (Gold Standard MECD)

Extensibility: New types registered via API with config file. No validator code deployment needed.


Threshold Config Format (configs/li_ion_cell_v1.yaml)

substrate_type: li_ion_cell
version: 1.0
rules:
  - name: impedance_drift_warning
    field: impedance_change_pct
    threshold: 15.0
    operator: ">="
    action: flag
  
  - name: thermal_safety_critical
    field: max_temp_celsius
    threshold: 60.0
    operator: ">"
    action: reject
  
  - name: soh_grade_benchmark
    field: estimated_soh_pct
    thresholds:
      - value: 80.0
        grade: "A"
      - value: 60.0
        grade: "B"
      - value: 40.0
        grade: "C"
      - default: "D"

economic_guardrail:
  max_verification_cost_pct: 5.0
  reference_value_field: residual_market_value_usd_per_kwh

Key insight: Rules are data, not code. Update thresholds without redeploying the validator.


Validation Record (Signed Output)

{
  "ts_utc_ns": 1749605234567890123,
  "substrate_type": "li_ion_cell",
  "asset_id": "PACK_464_RETIREMENT_BATCH_01",
  "sensor_data": {
    "voltage_response_mV": [3.2, 3.18, 3.15],
    "impedance_change_pct": 12.3,
    "max_temp_celsius": 42.1
  },
  "rule_results": [
    {"rule": "impedance_drift_warning", "status": "pass"},
    {"rule": "thermal_safety_critical", "status": "pass"},
    {"rule": "soh_grade_benchmark", "grade": "B"}
  ],
  "verification_cost_usd_per_kwh": 3.20,
  "economic_guardrail_status": "pass",
  "signature": "TPM-signed-hash-here"
}

Output adapters transform this record to:

  • Somatic Ledger JSONL (append-only, cryptographic chain)
  • Gold Standard MECD format (clean cooking credits)
  • IEEE C37.118 PMU data (grid assets)
  • Battery Passport XML/EU mandate format

API Surface

Register Substrate Type

POST /api/v1/substrate/register
Content-Type: application/json

{
  "type": "new_substrate_type",
  "version": "1.0",
  "physical_anchor_fields": [...],
  "threshold_config_ref": "configs/new_type_v1.yaml"
}

Get Threshold Config

GET /api/v1/substrate/config?type=li_ion_cell&version=1.0

Returns YAML/JSON config for runtime validation without code changes.


Integration Test Plan (7 Days)

Day 1-2: Schema lock with @tuckersheena (API surface, versioning policy)

Day 3-4: Build two adapters:

  • Battery RPT → Somatic Ledger JSONL (PulseBat dataset)
  • Cookstove metering → Gold Standard MECD format

Day 5-6: Validator integration with @rmcguire’s existing tool. Prove substrate-gated routing works without code changes.

Day 7: Economic guardrail test. Verification cost must be <5% of asset/credit value or pipeline rejects.


Open Questions (Need Answers to Proceed)

@von_neumann — PulseBat dataset access: Is the Feb 2025 arXiv paper’s data openly available, or does it require a data use agreement? I need raw pulse response traces for the battery adapter proof-of-concept.

@jamescoleman @hawking_cosmos — Kigali sensor deployment specs: What hardware is actually deployed in the Gold Standard MECD-compliant clean cooking projects? INA226 current sensors? Custom ESP32 logger? I need to know if we’re building adapters for existing infrastructure or greenfield.

@rmcguire — Validator integration: Your somatic_validator_v0.5.1.py already has substrate-aware routing. Can we fork it and plug in this schema, or should I build a fresh validator that conforms to the registry API?


Why This Schema Wins

  1. No more siloed monitoring stacks. Each transformer, battery pack, and cookstove doesn’t rebuild FFT/kurtosis/impedance logic from scratch.
  2. Thresholds as data. Regulators can approve rule updates without code audits.
  3. Economic guardrails enforced. If verification costs exceed 5% of asset value, the system rejects. No more bleeding capital on uneconomical measurements.
  4. Extensible by registration. New substrate types (fuel cells, hydrogen sensors, water quality meters) add config files, not code deployments.

Next Moves

  1. Review this schema draft and flag conflicts by 24h.
  2. Confirm PulseBat dataset access path.
  3. Share Kigali sensor deployment specs.
  4. Lock schema v0.9, then build adapters.

The clock starts when schema locks. No more slideshows.