Forge Platform

Forge Platform Product Roadmap

Outcome-driven plan from platform foundation through cross-product MVP, governed reasoning, controlled execution, optional workcells and channels, and dogfood hardening. Sequencing follows: contracts before adapters…

Inputs used: docs/platform-charter.md, docs/product-boundaries.md, docs/intersection-map.md, docs/roadmap.md (milestone labels), docs/milestone-plan.md, docs/sprint-structure.md, existing schemas under schemas/, sprints/M*.


1. Product vision

Forge Platform is the governed delivery control plane for agentic software delivery: humans see intent move to evidence through a shared ForgeRun spine, with Blueprints shaping policy and discipline, LCDL governing reasoning, Fleet executing bounded work, Lenses as system of record for runs/approvals/evidence, and agents (Hermes, Factory, OpenClaw) as optional workcells or channels—never the center of truth.


2. Product thesis

We are not building generic “agent orchestration.” We are building an evidence-producing, human-governed path from intent to decision: Blueprints define what must be judged, Versonas structure discipline reviews, LCDL verifies and traces reasoning, Fleet runs approved templates, workcells return reviewable results, and Lenses holds approvals and visibility.


3. Primary users

Segment Need Platform promise
Platform / product engineer Coherent contracts across repos Single spine: ForgeRun, EvidencePacket, samples, ADRs
Operator / SRE Safe execution, auditability Fleet templates, correlation IDs, telemetry via summaries
Reviewer / EM / governance Human approval, evidence Lenses control plane, explicit approval boundaries
Methodology maintainer Policy without runtime sprawl Blueprints + Versona artifacts consumed, not duplicated
Agent/tool author Scoped tools, no shadow state WorkcellRequest/Result, MCP lanes after narrow surfaces

4. Strategic objectives

  1. S1 — Credible spine: Normative contracts and samples so every product can attach without inventing parallel run models.
  2. S2 — Visible control plane: Lenses renders the spine end-to-end (static first, live later).
  3. S3 — Governed reasoning in the loop: LCDL trace and verification are first-class evidence, not chat.
  4. S4 — Approved execution: Fleet jobs are template-bound, correlated to ForgeRun/approval, reviewable in Lenses.
  5. S5 — Optional automation: Hermes/Factory/OpenClaw augment; they do not own governance or final approval.
  6. S6 — Dogfood quality: Platform improves platform with assays, docs-health, and release evidence.

5. Releases overview

Release Codename Primary value
R0 Foundation Roadmap, boundaries, contract gap register, Sprint 1 charge ready
R1 Forge Run Spine MVP Static ForgeRun chain + hardened v1 contracts
R2 Lenses Control Plane MVP Human-visible run, evidence, approval, placeholders
R3 Governed Reasoning MVP LCDL traces and key reasoning flows on the spine
R4 Controlled Execution MVP Fleet template job + classification + EvidencePacket
R5 Agent Workcell MVP Hermes + Factory paths as evidence-returning workcells
R6 Channel Ingress MVP OpenClaw intent + status/evidence relay; Lenses remains authority
R7 Dogfood & hardening Cross-product smoke, assays, public alignment, limitations

6. Release roadmap

Release 0 — Foundation and Roadmap Readiness

Phase P0.1 — Charter & boundaries · P0.2 — Readiness & planning

Outcomes

  • Platform charter, boundaries, and intersection map are treated as normative for roadmap rows.
  • Roadmap rows and milestones pass Definition of Ready or are explicitly flagged.
  • Contract gap register links each gap to owner product and evidence (no silent schema drift).
  • Sprint 1 (M1) charge can be drafted without Hermes/Factory/OpenClaw as hard dependencies.

Milestones

ID Outcome
R0.M0 Platform charter “close” — mission, loop, MVP objective agreed for planning
R0.M1 Roadmap Definition of Ready applied to R0–R2 rows
R0.M2 Product boundary assay — no role collapse (Lenses SoR, etc.)
R0.M3 Contract gap register — schemas/samples vs intersection map
R0.M4 Sprint 1 charge prepared — M1 scope, stories, Sparks listed

Epics

ID Epic
R0.E1 EPIC-CORE-ROADMAP-READINESS — align docs/roadmap.md with release-model outcomes
R0.E2 EPIC-CORE-BOUNDARY-ASSAY — boundary violations logged with evidence
R0.E3 EPIC-CORE-CONTRACT-GAP-REGISTER — schema/API/prompt gaps with owners

Evidence required

  • Updated docs/product-roadmap.md, docs/roadmap-rows.md, Versona §5 output, roadmap gate playbook, Ember Log.
  • Gap register section in playbook or linked artifact under forge-logs/ or sprints/.

Non-goals

  • No production Lenses/UI implementation in R0.
  • No new agent runtime requirements for M1.

Release 1 — Forge Run Spine MVP

Phase P1.1 — Contract hardening · P1.2 — Static validation chain

Outcomes

  • A static chain can represent: intent linkage, ApprovalRequest, AgentRun / WorkcellRequest / WorkcellResult, LCDLTraceSummary placeholder, FleetJobSummary placeholder, EvidencePacket, decision state — without requiring live services.
  • Lenses can later consume the model without forking a parallel schema.

Milestones

ID Outcome
R1.M1 ForgeRun v1 hardened (identifiers, nesting, extension points)
R1.M2 EvidencePacket v1 hardened
R1.M3 ApprovalRequest v1 hardened
R1.M4 AgentRun / WorkcellRequest / WorkcellResult hardened
R1.M5 Static sample chain validates (CI or documented checklist)
R1.M6 M2 Lenses handoff sample complete — fixture set for UI/spec work

Epics

ID Epic
R1.E1 EPIC-CORE-FORGE-RUN-SPINE — cross-product identifiers & correlation
R1.E2 EPIC-CORE-EVIDENCE-PACKET — evidence shape for human decision
R1.E3 EPIC-LENSES-BLUEPRINTS-CONTEXT-PANEL — defer implementation; prepare BlueprintContextPack contract alignment
R1.E4 EPIC-BLUEPRINTS-VERSONA-DEFINITIONS — trace references for session reports (stub acceptable in R1)

Candidate stories (R1 only)

ID Story
R1.S1 Author end-to-end static JSON bundle: ForgeRun + nested refs matching samples/
R1.S2 EvidencePacket references every nested artifact type required by intersection map
R1.S3 ApprovalRequest sample encodes explicit human approval boundary per docs/approval-model.md
R1.S4 AgentRun + Workcell correlation: same forge_run_id / approval lineage
R1.S5 M2 handoff doc: fixtures, field glossary, Lenses panel mapping

Candidate Sparks (R1 immediate execution only)

ID Spark
R1.SP1 Audit schemas/*.json vs docs/intersection-map.md — gap list
R1.SP2 Normalize sample schema identifiers and filenames under samples/
R1.SP3 Add or refresh narrative in docs/agent-workcell-model.md for workcell correlation
R1.SP4 Record M1 evidence in sprints/M1-forge-run-spine/evidence/

Evidence required

  • Passing scripts/check_foundation.py (or successor check), schema/sample parity notes, M1 charter evidence.

Non-goals

  • Hermes, Factory, OpenClaw not required for R1 closure.
  • Live Fleet or LCDL execution not required.

Release 2 — Lenses Control Plane MVP

Phase P2.1 — IA & views · P2.2 — Boundary-validated UX copy

Outcomes

  • Lenses renders static ForgeRun + EvidencePacket (fixtures from R1.M6).
  • Approval, agent/workcell activity, LCDL trace placeholder, Fleet job placeholder, and decision state are visible in one coherent narrative.
  • UI copy passes product boundary review (Lenses ≠ reasoning runtime).

Milestones

ID Outcome
R2.M1 ForgeRun list + detail view (spec + stub implementation per Lenses repo)
R2.M2 EvidencePacket panel
R2.M3 Approval card
R2.M4 AgentRun / Workcell panel
R2.M5 LCDL trace panel placeholder wired to LCDLTraceSummary shape
R2.M6 Fleet job panel placeholder wired to FleetJobSummary shape
R2.M7 Product boundary validation in UI copy — SoR, approval authority

Epics

ID Epic
R2.E1 EPIC-LENSES-EVIDENCE-PACKET
R2.E2 EPIC-LENSES-LCDL-TRACE-PANEL
R2.E3 EPIC-LENSES-FLEET-JOB-CARD
R2.E4 EPIC-LENSES-BLUEPRINTS-CONTEXT-PANEL — read-only context surfacing

Candidate stories (R2 only)

ID Story
R2.S1 ForgeRun list/detail IA + empty/error states
R2.S2 EvidencePacket panel maps nested sections to charter loop
R2.S3 Approval card state machine copy aligned with docs/approval-model.md
R2.S4 Placeholder panels: LCDL trace + Fleet job from R1 fixtures

Evidence required

  • Screenshots or recorded demo from Lenses using forge-platform fixtures; platform assay answers “Lenses still control plane?”

Non-goals

  • Full live LCDL/Fleet integration (deferred to R3/R4).

Self-Host Alpha — Local platform improvement loop (SELFHOST-ALPHA)

Placement: after M2-lite (Lenses static/control slice) and before Release 3 — Governed Reasoning MVP. Hermes, Factory, OpenClaw, and full Fleet execution are not prerequisites.

Outcomes

  • A maintainer can open a ForgeRun backed by files under .forge/runs/, see approval + evidence, and (where enabled) transition decision state from Lenses with local JSON writes only.
  • scripts/selfhost_runner.py executes allowlisted actions after an ApprovalRequest in approved state; results merge into EvidencePacket via scripts/selfhost_import_evidence.py.
  • Static sample chain under sprints/selfhost-alpha/samples/ validates in CI/manual checks (scripts/validate_selfhost_chain.py).

Stories

ID Story
SELFHOST.S1 Static selfhost ForgeRun sample validates
SELFHOST.S2 Lenses can render/import selfhost run
SELFHOST.S3 ApprovalRequest controls runner execution
SELFHOST.S4 Local runner executes check_foundation
SELFHOST.S5 EvidencePacket is generated/imported
SELFHOST.S6 Follow-on Sparks are created from evidence

Sparks

ID Spark
SPARK-SELFHOST-STATIC-RUN-SAMPLE Correlated JSON bundle + validator
SPARK-SELFHOST-APPROVAL-SAMPLE Approval sample + model alignment
SPARK-SELFHOST-LOCAL-RUNNER-CONTRACT LocalRunnerAction/Result + runner
SPARK-SELFHOST-RUN-STORE .forge/runs/ + documentation
SPARK-SELFHOST-EVIDENCE-IMPORTER Importer merges runner → packet
SPARK-LENSES-SELFHOST-HANDOFF Lenses API + Studio route
SPARK-SELFHOST-CLOSE-ASSAY Close assay + demo recording

Product principle

Lenses + ForgeRun + EvidencePacket before adaptive agents: Hermes accelerates later; it does not gate the first self-improvement loop.

Evidence required

  • sprints/selfhost-alpha/evidence/selfhost-gap-assay.md
  • Passing scripts/check_foundation.py and manual runner/importer smoke on a seeded .forge/runs/ directory

Non-goals

  • Raw shell from JSON payloads, chat-only approval, cross-host evidence merge, autonomous code write from the runner.

Release 3 — Governed Reasoning MVP

Phase P3.1 — Trace emission · P3.2 — Run spine mapping · P3.3 — Lenses truthy rendering

Outcomes

  • LCDL emits LCDLTraceSummary tied to ForgeRun.
  • Key flows (decompose, plan DecisionPack-style structure, contradiction scan, evidence-class answer) produce reviewable artifacts in EvidencePacket path.
  • Lenses shows real trace payloads (not only placeholders).

Milestones

ID Outcome
R3.M1 LCDLTraceSummary emitted from at least one canonical task path
R3.M2 decompose_problem (or equivalent) mapped to ForgeRun metadata
R3.M3 plan_decision_pack (or equivalent) mapped to ForgeRun metadata
R3.M4 contradiction_scan (or equivalent) feeds EvidencePacket
R3.M5 Answer-from-evidence / failure-reduction mapped with explicit verification status
R3.M6 Lenses trace rendering with non-placeholder LCDL output

Epics (intersection-first)

ID Epic
R3.E1 EPIC-LCDL-TRACE-SUMMARY
R3.E2 EPIC-VERSONA-LCDL-CONTRACT-MAPPING
R3.E3 EPIC-LENSES-LCDL-DECISIONPACK-VIEW
R3.E4 EPIC-LCDL-STANDARDS-TRACEABILITY

Evidence required

  • Trace JSON + rerun script; EvidencePacket containing trace; assay “LCDL still governed reasoning, not orchestrator?”

Non-goals

  • Full recipe coverage of all Blueprints discipline surfaces.

Release 4 — Controlled Execution MVP

Phase P4.1 — Template & correlation · P4.2 — Visibility & classification

Outcomes

  • Fleet runs one approved template path (e.g. docs-health) with ForgeRun / approval correlation IDs.
  • Lenses shows job state, logs summary, exit code, artifact links via FleetJobSummary.
  • LCDL classifies Fleet output into pass / warning / fail / needs_decision for evidence closure.

Milestones

ID Outcome
R4.M1 FleetTemplate v1 hardened
R4.M2 docs_health (or agreed) template implemented
R4.M3 FleetJobSummary emitted and consumed in EvidencePacket
R4.M4 forge_run_id / approval_id propagation end-to-end
R4.M5 Lenses Fleet job card live
R4.M6 LCDL Fleet output classifier
R4.M7 EvidencePacket includes Fleet result as first-class evidence

Epics

ID Epic
R4.E1 EPIC-FLEET-TEMPLATE-EVIDENCE
R4.E2 EPIC-FLEET-DOCS-HEALTH-TEMPLATE
R4.E3 EPIC-LCDL-FLEET-OUTPUT-CLASSIFIER
R4.E4 EPIC-BLUEPRINTS-FLEET-TASKLET-MAPPING

Evidence required

  • Runnable job demo + captured FleetJobSummary + human approval audit trail.

Non-goals

  • Arbitrary Docker argv for agents by default (template-first guardrail stands).

Release 5 — Agent Workcell MVP

Phase P5.1 — Hermes surfaces · P5.2 — Factory lane · P5.3 — Verification

Outcomes

  • Hermes operates via MCP with Lenses as system of record; readonly perception + governed actions only.
  • Factory runs as WorkcellRequestWorkcellResult through Fleet template; output independently reviewable.
  • LCDL can verify Factory output where stakes require it.

Milestones

ID Outcome
R5.M1 Hermes Forge skill pack (platform-scoped)
R5.M2 Hermes ↔ Lenses MCP readonly tools
R5.M3 Hermes ↔ LCDL MCP reasoning tools (contract-bound)
R5.M4 Hermes ↔ Fleet template-only tools
R5.M5 Factory WorkcellRequest / WorkcellResult path E2E
R5.M6 Factory through Fleet template
R5.M7 LCDL verification of Factory output (selected flows)
R5.M8 Lenses agent/workcell evidence review

Epics (families)

ID Epic
R5.E1 EPIC-HERMES-LENSES-MCP
R5.E2 EPIC-HERMES-LCDL-VERIFY
R5.E3 EPIC-HERMES-FLEET-TEMPLATE-LIST
R5.E4 EPIC-FACTORY-WORKCELL-RESULT
R5.E5 EPIC-LCDL-FACTORY-PATCH-VERIFIER

Evidence requests (R5 — milestone level only)

  • Security review of MCP tools (Security Versona) before widening tool surface.
  • UX review for agent activity semantics in Lenses.

Non-goals

  • Hermes as autonomous approver or canonical policy author.

Release 6 — Channel Ingress MVP

Phase R6.1 — ChatIntent · R6.2 — Safe relay

Outcomes

  • OpenClaw creates ChatIntent; Lenses creates/updates ForgeRun.
  • Status and evidence summaries relay to channel; approval policy remains Lenses.

Milestones

ID Outcome
R6.M1 ChatIntent v1 hardened
R6.M2 OpenClaw → Lenses bridge (documented + working prototype)
R6.M3 ForgeRunStatusSummary projection
R6.M4 EvidenceSummary projection
R6.M5 Safe approval relay policy
R6.M6 Channel ingress demo

Epics

ID Epic
R6.E1 EPIC-OPENCLAW-CHATINTENT
R6.E2 EPIC-OPENCLAW-RUN-STATUS
R6.E3 EPIC-OPENCLAW-EVIDENCE-SUMMARY

Non-goals

  • Chat-native implicit approval for destructive execution.

Release 7 — Dogfood, Assay, and Hardening

Phase R7.1 — Loops · R7.2 — Release readiness

Outcomes

  • Cross-product smoke, docs-health dogfood, roadmap gate, release readiness produce EvidencePackets.
  • Drift and boundary violations are visible and tracked.

Milestones

ID Outcome
R7.M1 Cross-product smoke test
R7.M2 Docs-health dogfood loop
R7.M3 Roadmap-gate dogfood loop
R7.M4 Schema hardening loop
R7.M5 Release readiness assay
R7.M6 Public docs alignment
R7.M7 Known limitations + next-cycle planning

7. Product-intersection epic map

Family Example epic IDs
Lenses ↔ Blueprints EPIC-LENSES-BLUEPRINTS-CONTEXT-PANEL, EPIC-LENSES-BLUEPRINTS-VERSONA-INDEX
Blueprints ↔ Versonas EPIC-BLUEPRINTS-VERSONA-DEFINITIONS, EPIC-BLUEPRINTS-VERSONA-SESSION-REPORT
Blueprints/Versonas ↔ LCDL EPIC-VERSONA-LCDL-CONTRACT-MAPPING, EPIC-LCDL-STANDARDS-TRACEABILITY
Lenses ↔ LCDL EPIC-LENSES-LCDL-TRACE-PANEL, EPIC-LENSES-LCDL-DECISIONPACK-VIEW
Lenses ↔ Fleet EPIC-LENSES-FLEET-JOB-CARD, EPIC-LENSES-FLEET-LOG-SUMMARY
Fleet ↔ Blueprints EPIC-BLUEPRINTS-FLEET-TASKLET-MAPPING, EPIC-FLEET-DOCS-HEALTH-TEMPLATE
Fleet ↔ LCDL EPIC-LCDL-FLEET-OUTPUT-CLASSIFIER, EPIC-LCDL-FLEET-EVIDENCE-VERIFIER
Hermes ↔ Lenses EPIC-HERMES-LENSES-WORKSPACE-STATE, EPIC-HERMES-LENSES-APPROVALREQUEST
Hermes ↔ Blueprints/Versonas EPIC-HERMES-BLUEPRINT-CONTEXT-PACK, EPIC-HERMES-VERSONA-INVOCATION
Hermes ↔ LCDL EPIC-HERMES-LCDL-DECOMPOSE, EPIC-HERMES-LCDL-VERIFY
Hermes ↔ Fleet EPIC-HERMES-FLEET-TEMPLATE-LIST, EPIC-HERMES-FLEET-APPROVED-SUBMIT
Factory ↔ Blueprints EPIC-FACTORY-BLUEPRINT-CONTEXT-PACK, EPIC-FACTORY-FORGE-SKILL-PACK
Factory ↔ Fleet EPIC-FLEET-FACTORY-READONLY-TEMPLATE, EPIC-FACTORY-WORKCELL-RESULT
Factory ↔ LCDL EPIC-LCDL-FACTORY-PATCH-VERIFIER, EPIC-LCDL-FACTORY-EVIDENCE-REVIEW
OpenClaw ↔ Lenses EPIC-OPENCLAW-CHATINTENT, EPIC-OPENCLAW-APPROVAL-RELAY-POLICY

8. Roadmap row Definition of Ready

Each row (milestone, epic, or story) must state:

  1. Outcome — observable user/operator/business result.
  2. Evidence — why now; known vs assumed.
  3. Horizon — Release / Phase / NOW–NEXT–LATER / target milestone ID.
  4. OKR / strategy fit — link to S1–S6.
  5. Dependencies — product, technical, external.
  6. Non-goals — explicit exclusions.
  7. Next gate — what unlocks the next lifecycle step (e.g. Architecture review, BA flow, PM gate).
  8. Owner product — DRI product for the outcome.
  9. Consumer product — primary consumer of the artifact/API.
  10. Contract touched — schema, API, prompt, skill, template, ADR.
  11. Human approval boundary — what requires human decision.
  12. Evidence expected — artifacts proving done.

If incomplete: mark Hold for evidence, Needs discovery, Needs Versona routing, or Too broad; split before WBS.


9. Sequencing rationale

  1. R0 prevents role collapse and silent contract debt before spend on Lenses/LCDL/Fleet.
  2. R1 establishes the lingua franca; all later releases attach, not redefine.
  3. R2 validates human understanding before automating reasoning or jobs.
  4. R3 adds LCDL only when the spine can carry trace evidence.
  5. R4 adds Fleet only after approval model is expressible in artifacts.
  6. R5 constrains agents to template-first execution and evidence return.
  7. R6 keeps channels downstream of Lenses authority.
  8. R7 institutionalizes quality and narrative honesty.

Rejected patterns: OpenClaw before Lenses spine; Hermes before readonly Lenses + template Fleet; Factory verification before LCDL classifier patterns exist; generic agent orchestration as “platform center.”


10. Risks and assumptions

Risk Mitigation
Schema drift across repos R0 contract gap register + CI checks; single forge-platform source
Lenses becomes shadow orchestrator Boundary copy + assay; no hidden LLM runs in Lenses
Fleet gains policy ownership Blueprints tasklet mapping stays recipe; Fleet stays execution
Agents bypass approval WorkcellResult + ApprovalRequest linkage; Security review at R5
Over-planning early R5–R7 stay epic/milestone until R4 closes

Assumption: M1 scope excludes mandatory Hermes/Factory/OpenClaw per charter MVP.


11. Evidence requests

  1. Architecture Versona: validate correlation-ID scheme across ForgeRun, ApprovalRequest, FleetJobSummary for R1–R4.
  2. Security Versona: threat model for MCP tool surfaces (R5) and OpenClaw relay (R6).
  3. Testing Versona: cross-product smoke criteria and schema validation strategy (R7.M1, R7.M4).
  4. UX Versona: Lenses IA for R2 panels — single narrative vs tab hell.
  5. BA Versona: detailed flows for approval edge cases when M1 stories split for implementation.

12. Suggested next Versonas

Versona When Why
Architecture Before R1.SP1 deepens schemas Correlation, MCP seams, extension points
BA When M1 stories become implementable Approval edge cases, acceptance language
Project Management When R1 charge is capacity-bound RAID, sequencing, critical path
Security R4–R6 planning Execution + channel attack surfaces
Testing R1 sample chain + R7 loops Evidence closure automation
UX R2 planning Control-plane IA
Marketing R7.M6 Public MVP narrative only when spine credible

13. Immediate next gates

  1. Roadmap gate (PM): run playbook in forge-logs/versona/product-management/roadmap-initiation/outputs/roadmap-gate-playbook.md.
  2. R0.M3: publish contract gap register (can live in docs/ or sprint evidence).
  3. M1: execute sprints/M1-forge-run-spine/roadmap-handoff.md — Sparks R1.SP1–SP4.
  4. Ember Log: capture prioritization trade-offs for any deferred milestone.

Product Management Versona validation

Definition of Ready coverage

R0–R2 rows in docs/roadmap-rows.md include full readiness fields; R3–R7 milestones include outcomes and evidence at milestone/epic depth with explicit evidence requests where detail is intentionally deferred.

Product-boundary risks

No deliberate boundary violations in sequence. Watch: Lenses feature creep into reasoning; Fleet storing policy; chat as approval.

Overplanned areas

Release 5–7 intentionally avoid Spark-level detail until R4 evidence exists.

Under-evidenced assumptions

LCDL task names (decompose_problem, etc.) treated as placeholders for contract-mapped flows — actual LCDL catalog names must be reconciled (Hold for evidence on exact task IDs until LCDL owners confirm).

Must-run next Versonas

Architecture (correlation scheme) and BA (approval flows) before expanding R1 beyond the four Sparks listed.

Recommendation

Proceed with conditions — roadmap is executable for M1 and R2 planning, provided the contract gap register (R0.M3) and correlation-ID architecture note are closed or explicitly waived with recorded rationale.