Specifications
Specs Index
47 specifications

Layer 1: Schema & Data Architecture

SPEC-001: SPEC-001: Universal Node Schema — Two-Layer Model
draft Owner: Q
Two-layer schema: 16 immutable core fields (Layer 1 — the Meridian protocol) + 20+ extensible fields (Layer 2 — personal only, null defaults, never emitted). Layer 1 never changes. Layer 2 grows without migration.
SPEC-002: SPEC-002: Embedding Model Standardization
draft Owner: Q
BGE-M3 (1024-dim) as Meridian standard. Rob must migrate 16,717 holons from 768 → 1024-dim. Recommend batch re-embed (~4h on CPU).
SPEC-003: SPEC-003: Gravity Score Formula
draft Owner: Q
If gravity = retrieval frequency → rich-get-richer. If gravity = validation count → "validation" undefined. If gravity = LLM-assigned → only as reliable as extraction. The entire compounding proposition depends on this.
SPEC-004: SPEC-004: Confidence History / Belief Versioning
draft Owner: Q
SQLite table: confidence_history (principle_id, old_score, new_score, timestamp, trigger, context). Principles dropping >0.3 from peak flagged for review. Storage: ~34K rows/year. Negligible.
SPEC-005: SPEC-005: Error Bank Schema
draft Owner: Rob
NODE_SCHEMA with node_type='error'. Fields: error_type, related_principle_id, context, outcome, severity. Embeds on error description for similarity search.
SPEC-006: SPEC-006: Edge Threshold Calibration
draft Owner: Q
Per-domain threshold. Current 0.85 validated on marketing/psychology. Health may need 0.82, finance may need 0.88. Test across all three founders' domain data.
SPEC-038: SPEC-038: Novelty Score
draft Owner: Q
Whitepaper v5 Section 05.5, Risk Register CON 03.
SPEC-040: SPEC-040: Multi-Label Collections
draft Owner: Q
"Interest rate increases slow capital allocation to emerging markets" belongs in macro_economics, equity_analysis, market_structure, AND geopolitics simultaneously. Single-label fails.
SPEC-041: SPEC-041: Write Coordinator
draft Owner: Q
Ingestion writes: (1) vector to LanceDB, (2) metadata to SQLite, (3) edges to graph. If step 1 succeeds and step 2 fails = ghost record. LanceDB is eventually consistent, SQLite is ACID, graph is in-memory.

Layer 2: Agent Core

SPEC-007: SPEC-007: Agent Loop Architecture
draft Owner: Q
Clean agent loop: message → context assembly (SPINE + RAM + retrieved knowledge) → LLM call → tool execution → response → state update. Tools registered via config. Context assembly pluggable per domain.
SPEC-008: SPEC-008: SPINE / RAM / Beliefs Hierarchy + SPINE Versioning
draft Owner: Q
SPINE > Beliefs > RAM. SPINE overrides beliefs. Beliefs override RAM.
SPEC-009: SPEC-009: Model Abstraction Layer
draft Owner: Q
Unified interface: call(messages, model_key, tools=None). Provider by config. Cascade logic optional. Will v4 adds: router.py as single entry point with decompose() for subtask routing. Task types: EXTRACTION(8B), REASONING(30B+), SYNTHESIS(30B+), FO
SPEC-010: SPEC-010: Inter-Agent Message Bus
draft Owner: Q
SQLite table for MVP. Named channels, priority levels, structured messages. Polling-based. Upgrade to Redis later if latency matters.
SPEC-011: SPEC-011: Foundational Pact
draft Owner: Q
Structured YAML. Three enforcement layers: (1) system prompt (probabilistic), (2) output validator (deterministic), (3) tool permissions (structural).
SPEC-045: SPEC-045: Overarching Sovereign Agent (Oversoul)
draft Owner: Q
Whitepaper v5 OQ-01. Q is building this now with Metatekt.

Layer 3: Ingestion & Knowledge Processing

SPEC-012: SPEC-012: Ingestion Pipeline Portability
draft Owner: Q
Core pipeline: source → chunk → extract → embed → store. Config-driven. No hardcoded references.
SPEC-013: SPEC-013: Multi-Modal Input Pipeline
draft Owner: Will
Image → vision model → text → standard pipeline. All modalities produce NODE_SCHEMA output.
SPEC-014: SPEC-014: Automated Ingestion Triggers
draft Owner: Will
Filesystem watcher (watchdog) on /inbox/. Webhook endpoints optional. New file → detect type → route → ingest → harden → notify.
SPEC-015: SPEC-015: Orchestration with Rollback
draft Owner: Q
Before batch: snapshot → run against staging → compare (connectivity, gravity, duplicates) → rollback if regression. LanceDB versioning (native time-travel) may eliminate storage cost.
SPEC-016: SPEC-016: Codex Import Validation
draft Owner: Q
Schema check + dimension check + signature verify + anomaly scan + rollback guard.
SPEC-039: SPEC-039: Grounding Verification (Stage 2b)
draft Owner: Q
Stage 2 asks LLM to produce atomic principles with 16 fields. No verification between extraction and writing. Atomicity is not well-defined for LLMs. Three models produce structurally different outputs from the same paragraph.
SPEC-044: SPEC-044: Hierarchical Chunking
draft Owner: Q
Large chunks (2,048 tokens) for Stage 2 extraction. Small chunks (256 tokens) indexed for retrieval scoring. Extract principles from large chunks, link them to small chunks for precision. Extraction chunk size and retrieval chunk size are separate co

Layer 4: Security & Sovereignty

SPEC-017: SPEC-017: At-Rest Encryption
draft Owner: Rob
AES-256. OS-level default + VeraCrypt guide for paranoid clients. Hardware key to unlock.
SPEC-018: SPEC-018: Network Isolation
approved Owner: Q
All inference, embeddings, storage local. Internet opt-in only. Validated: GHOSTNET runs air-gapped on Raspberry Pi.
SPEC-019: SPEC-019: Sanitization Pipeline
draft Owner: Rob
PII detection + content classification + domain relevance + output redaction. All external inputs sanitized before touching KB.
SPEC-020: SPEC-020: Heartbeat & Health Monitoring
draft Owner: Rob
heartbeat.json updated every 60s. Recovery daemon triggers restart on 5min silence.
SPEC-021: SPEC-021: Kill Switch Protocol
approved Owner: Q
Physical. Pull ethernet. Power down. No remote override. Design principle, not feature.

Layer 5: Retrieval & Reasoning

SPEC-042: SPEC-042: Retrieval Evaluation
draft Owner: Will
retrieval_log table: query_hash, retrieved_record_ids, records_used_in_response (boolean per record). Monthly retrieval quality reports from day 1. This is how you answer: "is this system working?"
SPEC-043: SPEC-043: HyDE Query Reconstruction
draft Owner: Q
HyDE (Hypothetical Document Embeddings): generate a hypothetical answer to the query locally → embed THAT for retrieval instead of the question. The hypothetical lives in the same semantic space as stored principles. Privacy preserved: generated loca

Layer 6: Interface & Experience

SPEC-022: SPEC-022: Primary Chat Interface
draft Owner: Q
Open WebUI + custom LanceDB RAG plugin for MVP. Dashboard primary, chat secondary (Will v4 Principle 07).
SPEC-023: SPEC-023: Voice Input
approved Owner: Q
WhisperX local. Validated in production.
SPEC-024: SPEC-024: Dashboard / State Viewer
draft Owner: Q
Defer to post-MVP. CLI + chat for founding operators. Dashboard is retention, not acquisition.
SPEC-046: SPEC-046: The Seed as DNA
draft Owner: Q
Whitepaper v5 OQ-04, founders call.

Layer 7: Resilience & Autonomy

SPEC-025: SPEC-025: Dream Engine
draft Owner: Rob
Idle-period processing. Random sample N principles, cross-domain connections, store in dreams.lance. Nice-to-have for MVP, critical for v2.
SPEC-026: SPEC-026: Ghost Swarm (Autonomous Workers)
draft Owner: Rob
Supervisor + specialized workers. Post-MVP. Base model ships with single agent.
SPEC-027: SPEC-027: Approval Queue
draft Owner: Rob
System proposes, user confirms. SQLite table. Gradient of control (Q + Will): user-selectable autonomy from full approval to fully hands-off.

Layer 8: Collective Layer

SPEC-028: SPEC-028: Synthesis Emission Protocol
draft Owner: Q
Emission packet with principle, confidence, gravity, domain, validation_count, signature. Simple signature for founders, ZKP at 33+ nodes. Codex auto-creation (Q + Will): personal AI detects when research hits contributional level, auto-creates codex
SPEC-029: SPEC-029: Mother AI Architecture
draft Owner: Q
Where does Mother run? Shared VPS recommended for MVP.
SPEC-030: SPEC-030: Codex Poisoning Defence
draft Owner: Will
Statistical anomaly detection + minimum validation threshold + cross-validation (≥2 nodes) + audit trail. Gradual drift detection over time windows for the hardest attack vector.
SPEC-047: SPEC-047: Family Inheritance Protocol
draft Owner: Q
Whitepaper v5 OQ-03.

Layer 9: Self-Evolution Infrastructure

SPEC-031: SPEC-031: Agent Activity Log (Past Layer)
draft Owner: Q
Shared table with agent_id column + filtered views. Each agent sees only its own rows. Queryable by time, success/failure, manifesto alignment.
SPEC-032: SPEC-032: Agent Manifesto (Future Layer)
draft Owner: Q
MANIFESTO.md per agent. Mission, capabilities, aspirations, growth metrics, gaps. Evolves with user + agent dreams + external disruptions + personality shifts + family expansion.
SPEC-033: SPEC-033: Dream Cycle Engine
draft Owner: Q
dream_cycle(agent_id) → activity log + manifesto + KB query → LLM generates mutations[], dream_log, manifesto_update. Rob's dream mechanism + Q's manifesto concept = dreaming from past to future.
SPEC-034: SPEC-034: Mutation Protocol
draft Owner: Q
Three types: user_mutation (auto-propagate), dream_mutation (approval queue), collective_mutation (from Mother). Explicit command for MVP, inferred from patterns later.
SPEC-035: SPEC-035: Seed Codex (Agent Bootstrap)
draft Owner: Q
Interviews client → proposes agent config → creates SPINE + MANIFESTO + beliefs per agent → initializes logs → self-destructs. The most important onboarding experience. See also SPEC-046 (Seed as DNA).
SPEC-036: SPEC-036: Dream Output → Knowledge Bank Pipeline
draft Owner: Q
Extract principles from dream output, store as node_type='dream_insight', source_id=agent_id. Start at confidence 0.5. Mini-hardening after.
SPEC-037: SPEC-037: Collective Dream Protocol
draft Owner: Q
Receive anonymized dream_insights, cluster by similarity, synthesize innovations, broadcast as collective_mutations. Manual at 3 founders, automated at 33+.