Specs Index — Meridian

Specifications

Specs Index

48 specifications

Layer 1: Schema & Data Architecture

SPEC-001: SPEC-001: Universal Node Schema — Two-Layer Model

Two-layer schema: 16 immutable core fields (Layer 1 — the Meridian protocol) + 20+ extensible fields (Layer 2 — personal only, null defaults, never emitted). Layer 1 never changes. Layer 2 grows without migration.

SPEC-002: SPEC-002: Embedding Model Standardization

BGE-M3 (1024-dim) as Meridian standard. Rob must migrate 16,717 holons from 768 → 1024-dim. Recommend batch re-embed (~4h on CPU).

SPEC-003: SPEC-003: Gravity Score Formula

If gravity = retrieval frequency → rich-get-richer. If gravity = validation count → "validation" undefined. If gravity = LLM-assigned → only as reliable as extraction. The entire compounding proposition depends on this.

SPEC-004: SPEC-004: Confidence History / Belief Versioning

SQLite table: confidence_history (principle_id, old_score, new_score, timestamp, trigger, context). Principles dropping >0.3 from peak flagged for review. Storage: ~34K rows/year. Negligible.

SPEC-005: SPEC-005: Error Bank Schema

draft Owner: Rob

NODE_SCHEMA with node_type='error'. Fields: error_type, related_principle_id, context, outcome, severity. Embeds on error description for similarity search.

SPEC-006: SPEC-006: Edge Threshold Calibration

Per-domain threshold. Current 0.85 validated on marketing/psychology. Health may need 0.82, finance may need 0.88. Test across all three founders' domain data.

SPEC-038: SPEC-038: Novelty Score

Whitepaper v5 Section 05.5, Risk Register CON 03.

SPEC-040: SPEC-040: Multi-Label Collections

"Interest rate increases slow capital allocation to emerging markets" belongs in macro_economics, equity_analysis, market_structure, AND geopolitics simultaneously. Single-label fails.

SPEC-041: SPEC-041: Write Coordinator

Ingestion writes: (1) vector to LanceDB, (2) metadata to SQLite, (3) edges to graph. If step 1 succeeds and step 2 fails = ghost record. LanceDB is eventually consistent, SQLite is ACID, graph is in-memory.

Layer 2: Agent Core

SPEC-007: SPEC-007: Agent Loop Architecture

Clean agent loop: message → context assembly (SPINE + RAM + retrieved knowledge) → LLM call → tool execution → response → state update. Tools registered via config. Context assembly pluggable per domain.

SPEC-008: SPEC-008: SPINE / RAM / Beliefs Hierarchy + SPINE Versioning

SPINE > Beliefs > RAM. SPINE overrides beliefs. Beliefs override RAM.

SPEC-009: SPEC-009: Model Abstraction Layer

Unified interface: call(messages, model_key, tools=None). Provider by config. Cascade logic optional. Will v4 adds: router.py as single entry point with decompose() for subtask routing. Task types: EXTRACTION(8B), REASONING(30B+), SYNTHESIS(30B+), FO

SPEC-010: SPEC-010: Inter-Agent Message Bus

SQLite table for MVP. Named channels, priority levels, structured messages. Polling-based. Upgrade to Redis later if latency matters.

SPEC-011: SPEC-011: Foundational Pact

Structured YAML. Three enforcement layers: (1) system prompt (probabilistic), (2) output validator (deterministic), (3) tool permissions (structural).

SPEC-045: SPEC-045: Overarching Sovereign Agent (Oversoul)

Whitepaper v5 OQ-01. Q is building this now with Metatekt.

Layer 3: Ingestion & Knowledge Processing

SPEC-012: SPEC-012: Ingestion Pipeline Portability

Core pipeline: source → chunk → extract → embed → store. Config-driven. No hardcoded references.

SPEC-013: SPEC-013: Multi-Modal Input Pipeline

draft Owner: Will

Image → vision model → text → standard pipeline. All modalities produce NODE_SCHEMA output.

SPEC-014: SPEC-014: Automated Ingestion Triggers

draft Owner: Will

Filesystem watcher (watchdog) on /inbox/. Webhook endpoints optional. New file → detect type → route → ingest → harden → notify.

SPEC-015: SPEC-015: Orchestration with Rollback

Before batch: snapshot → run against staging → compare (connectivity, gravity, duplicates) → rollback if regression. LanceDB versioning (native time-travel) may eliminate storage cost.

SPEC-016: SPEC-016: Codex Import Validation

Schema check + dimension check + signature verify + anomaly scan + rollback guard.

SPEC-039: SPEC-039: Grounding Verification (Stage 2b)

Stage 2 asks LLM to produce atomic principles with 16 fields. No verification between extraction and writing. Atomicity is not well-defined for LLMs. Three models produce structurally different outputs from the same paragraph.

SPEC-044: SPEC-044: Hierarchical Chunking

Large chunks (2,048 tokens) for Stage 2 extraction. Small chunks (256 tokens) indexed for retrieval scoring. Extract principles from large chunks, link them to small chunks for precision. Extraction chunk size and retrieval chunk size are separate co

Layer 4: Security & Sovereignty

SPEC-017: SPEC-017: At-Rest Encryption

draft Owner: Rob

AES-256. OS-level default + VeraCrypt guide for paranoid clients. Hardware key to unlock.

SPEC-018: SPEC-018: Network Isolation

approved Owner: Q

All inference, embeddings, storage local. Internet opt-in only. Validated: GHOSTNET runs air-gapped on Raspberry Pi.

SPEC-019: SPEC-019: Sanitization Pipeline

draft Owner: Rob

PII detection + content classification + domain relevance + output redaction. All external inputs sanitized before touching KB.

SPEC-020: SPEC-020: Heartbeat & Health Monitoring

draft Owner: Rob

heartbeat.json updated every 60s. Recovery daemon triggers restart on 5min silence.

SPEC-021: SPEC-021: Kill Switch Protocol

approved Owner: Q

Physical. Pull ethernet. Power down. No remote override. Design principle, not feature.

Layer 5: Retrieval & Reasoning

SPEC-042: SPEC-042: Retrieval Evaluation

draft Owner: Will

retrieval_log table: query_hash, retrieved_record_ids, records_used_in_response (boolean per record). Monthly retrieval quality reports from day 1. This is how you answer: "is this system working?"

SPEC-043: SPEC-043: HyDE Query Reconstruction

HyDE (Hypothetical Document Embeddings): generate a hypothetical answer to the query locally → embed THAT for retrieval instead of the question. The hypothetical lives in the same semantic space as stored principles. Privacy preserved: generated loca

Layer 6: Interface & Experience

SPEC-022: SPEC-022: Primary Chat Interface

Open WebUI + custom LanceDB RAG plugin for MVP. Dashboard primary, chat secondary (Will v4 Principle 07).

SPEC-023: SPEC-023: Voice Input

approved Owner: Q

WhisperX local. Validated in production.

SPEC-024: SPEC-024: Dashboard / State Viewer

Defer to post-MVP. CLI + chat for founding operators. Dashboard is retention, not acquisition.

SPEC-046: SPEC-046: The Seed as DNA

Whitepaper v5 OQ-04, founders call.

Layer 7: Resilience & Autonomy

SPEC-025: SPEC-025: Dream Engine

draft Owner: Rob

Idle-period processing. Random sample N principles, cross-domain connections, store in dreams.lance. Nice-to-have for MVP, critical for v2.

SPEC-026: SPEC-026: Ghost Swarm (Autonomous Workers)

draft Owner: Rob

Supervisor + specialized workers. Post-MVP. Base model ships with single agent.

SPEC-027: SPEC-027: Approval Queue

draft Owner: Rob

System proposes, user confirms. SQLite table. Gradient of control (Q + Will): user-selectable autonomy from full approval to fully hands-off.

Layer 8: Collective Layer

SPEC-028: SPEC-028: Synthesis Emission Protocol

Emission packet with principle, confidence, gravity, domain, validation_count, signature. Simple signature for founders, ZKP at 33+ nodes. Codex auto-creation (Q + Will): personal AI detects when research hits contributional level, auto-creates codex

SPEC-029: SPEC-029: Mother AI Architecture

Where does Mother run? Shared VPS recommended for MVP.

SPEC-030: SPEC-030: Codex Poisoning Defence

draft Owner: Will

Statistical anomaly detection + minimum validation threshold + cross-validation (≥2 nodes) + audit trail. Gradual drift detection over time windows for the hardest attack vector.

SPEC-047: SPEC-047: Family Inheritance Protocol

Whitepaper v5 OQ-03.

Layer 9: Self-Evolution Infrastructure

SPEC-031: SPEC-031: Agent Activity Log (Past Layer)

Shared table with agent_id column + filtered views. Each agent sees only its own rows. Queryable by time, success/failure, manifesto alignment.

SPEC-032: SPEC-032: Agent Manifesto (Future Layer)

MANIFESTO.md per agent. Mission, capabilities, aspirations, growth metrics, gaps. Evolves with user + agent dreams + external disruptions + personality shifts + family expansion.

SPEC-033: SPEC-033: Dream Cycle Engine

dream_cycle(agent_id) → activity log + manifesto + KB query → LLM generates mutations[], dream_log, manifesto_update. Rob's dream mechanism + Q's manifesto concept = dreaming from past to future.

SPEC-034: SPEC-034: Mutation Protocol

Three types: user_mutation (auto-propagate), dream_mutation (approval queue), collective_mutation (from Mother). Explicit command for MVP, inferred from patterns later.

SPEC-035: SPEC-035: Seed Codex (Agent Bootstrap)

Interviews client → proposes agent config → creates SPINE + MANIFESTO + beliefs per agent → initializes logs → self-destructs. The most important onboarding experience. See also SPEC-046 (Seed as DNA).

SPEC-036: SPEC-036: Dream Output → Knowledge Bank Pipeline

Extract principles from dream output, store as node_type='dream_insight', source_id=agent_id. Start at confidence 0.5. Mini-hardening after.

SPEC-037: SPEC-037: Collective Dream Protocol

Receive anonymized dream_insights, cluster by similarity, synthesize innovations, broadcast as collective_mutations. Manual at 3 founders, automated at 33+.

Other

SPEC-048: Fractal Node Primitive — WHITEPAPER + SPEC pattern at every scale

Every coherent unit of work in VOHU MANAH OS is a Node. A Node has exactly two artifacts — WHITEPAPER.md (why) and SPEC.md (what + frontmatter contract) — and is either a leaf (has handler code) or a composite (has a composes: list). Same template at