Standout Builder Studio · planning Index / Roadmap review Updated 13 Jun 2026

The roadmap: from today's pipeline to zero-engineering

The rest of this suite maps the territory — the eight-step flow, the edge and agent architecture, the real GCP pipeline, the generative-AI touchpoints, the UI bets. This page sequences the crossing. Five phases, each with the dependencies that order them, exit criteria you can measure, and the decisions that gate them. The reader who finishes here should know exactly what to do next, and in what order. The phases and decisions are CONCEPT · proposed; everything they build on is grounded in the pages they cite. The audience includes external Game Builders — terms are glossed on first use across the suite.

The five phases

The plan is not a feature list — it is a dependency graph with gates. P0 · Decide unblocks everything by recording the choices the build pages keep deferring. P1 · Foundations makes a spec scaffold to a previewable build with no hand edits. P2 · Reskin and P3 · Tune then take engineers out of the two highest-volume paths, in parallel. P4 moves the sandbox to the real edge and opens the external portal. The diagram places the phases on a timeline with their hard dependencies; the critical path P0→P1→P2 is drawn in flame.

Diagram · Phase timeline with dependency edgesCONCEPT · proposed

Read the flame edges as the critical path: nothing in P2 can start until P1 ships scaffold v2 and template extraction, and nothing in P1 starts until P0 records the UI-treatment and import decisions it depends on. P3 needs only P1's simulation-harness foundations, so it overlaps P2. P4 needs both P2 and P3 plus the builder-auth decision from P0 — which is why that one P0 decision has a long edge all the way to the end.

P0 · Decide

Record the choices the build pages keep deferring

now · days

Depends on: nothing — this is the unblocker. Everything downstream waits on it.

Four decisions are blocking the build, and all are framed elsewhere in the suite but not yet made. P0 is the cheapest, highest-leverage phase: it is conversation and a written record, not code.

UI treatment per surface. Pick the path-picker, mechanics-config, and validation treatments — the recommendations are reasoned but unconfirmed. ui-variants · deciding frames each choice and the cheapest validation (a 15-minute builder interview).
Builder auth model. Cloudflare Access vs builder accounts on the D1 token system — the tenancy seam (ownerPrincipal) is already in the data model, but the auth choice gates the external portal in P4. architecture · two audiences, one surface.
Import path — info sheet up front? Does Import require a PDF before the builder proceeds, or can they start with screenshots only and defer it? The extraction-confidence trade-off is material (flow/01 open questions).
Mechanic-prototype surfacing. Sixth card, New-game variant, or advanced toggle? Today it is not one of the five named paths (flow/01 open questions).

Exit criteria

Each decision is recorded in the decision log (see §decisions) with a where-recorded link, and the flow/01–06 proposals are updated to the chosen UI treatment per the cadence. Be honest: until these four are recorded, P0 is not done — and P1 cannot start, because it builds the surfaces P0 chooses.

P1 · Foundations

A spec scaffolds to a previewable build with zero manual edits

weeks

Depends on: P0's UI-treatment and import decisions (the surfaces these tools feed). Unblocks: P2 (needs scaffold v2 + extraction) and P3 (needs the sim-harness foundations laid here).

The plumbing phase. It closes the manual gaps traced in variant → live · gaps and makes the Scaffolder agent buildable on a real CLI rather than a hypothetical one.

Template auto-extraction Effort S — generate GameTemplates from each engine DEFAULT_CONFIG at build time. Opens all 15 engine games to the Builder (not three) and kills template drift at the source. build-changes #2.
scaffold-game v2 Effort M — runs sync-games.ts, patches the CI GAMES input, tsc --noEmits the variant, validates the registry entry, writes the action to audit_log, and exposes itself over MCP. build-changes #3.
Spec ↔ game linkage in D1 — versions[] / baseGameRef on the grown spec record the PR and Worker, so provenance lives in the dashboard, not only in variant.meta.json. architecture · data model.
Per-variant preview deploys Effort M — a CI stage deploys each variant branch to a throwaway preview before merge. build-changes #4.

Exit criteria

A spec scaffolds to a previewable build with ZERO manual file edits (theme aside — that is P2's keystone), and every scaffold is visible in the dashboard audit trail. If a human still has to hand-run sync-games.ts or hand-edit the CI GAMES arg, P1 is not done.

P2 · Reskin without engineers

A reskin ships to sandbox with no engineer in the loop

1–2 months

Depends on: P1's scaffold v2 + template extraction. On the critical path — this is the first phase that delivers the zero-engineering promise to a real path.

The keystone phase. The single highest-leverage change in the whole plan — the shared client shell — finally makes the captured theme read instead of dead data in variant.meta.json.

Shared client shell + theme tokens Effort L — one shell all variants share, driven by a per-variant token file. A reskin becomes a data file, not a forked client. build-changes #1.
Theme-application codegen — generated tokens become the client skin via the shell; a small, reviewable token-file write rather than hand-edited assets.ts (generative-ai · map).
Asset agent v1 — image candidates + lobby safe-area checks, behind the art-owner gate (architecture · roster).
Copy / Localisation agent v1 — help text and operator-facing copy, each locale gated by a reviewer.

Exit criteria

A reskin ships to sandbox with no engineer in the loop — the 2:05 benchmark (reskin · the clock) achieved for real, not on paper — and engineering hours per reskin = 0.

P3 · Tune with proof

A config variant goes spec → sim-proof → approved → sandbox in a day

1–2 months · overlaps P2

Depends on: P1's simulation-harness foundations. Overlaps P2 — different people, no shared keystone, so the two run in parallel.

Where tuning earns its proof. The Math-Sim agent's tool turns "reviewers eyeball numbers" into "reviewers approve a number a simulation established."

RTP simulation harness Effort L — replay the certified, locked RNG against a config to produce an RTP/volatility/max-win report. build-changes #5. Raise the recertification question with compliance early — whether simulating against the certified RNG carries recert implications is flagged open at architecture · open questions and must be answered before the harness is trusted.
Config impact gauges — the live RTP / volatility / max-win cockpit, ui-variants Surface 2 · C, advisory until a proof confirms them.
Math-owner gate workflow — the approve-the-proof handoff wired into the dashboard on the comment-as-handoff channel.
flow/02 proposal built — the config-intent surface elevated to full depth per the cadence.

Exit criteria

A config variant goes spec → sim-proof → approved → sandbox inside a day — the 3:10 benchmark where the math gate dominated — and model config-drafting enters at assistive on the autonomy ladder (generative-ai · map).

P4 · Real sandbox on the edge + the portal

One production game on the edge; one external builder unassisted

quarter+

Depends on: P2 + P3 (the agent workforce and proof loop must exist) and the builder-auth decision from P0. The longest phase, and the one the whole vision points at.

Where the edge target stops being a beachhead and the second audience arrives. Incremental, never a big bang.

Launch Lab, real — stub-wallet Worker, HMAC-signed launch URLs, captured session evidence (the things flow/04 mocks today, wired for real).
Grow the edge beachhead — take trunk-raider-edge, edge-engine, edge-do, edge-game-sdk into the Workers-for-Platforms namespaces game by game. The migration is credible because the experiments already run the target runtime — see variant → live · today vs target.
External builder portal — behind the chosen auth, scoped by ownerPrincipal so a builder sees only their own games and no revenue or pipeline surfaces.

Exit criteria

One production game serving from Workers for Platforms, and one external builder completes a reskin → sandbox unassisted. Until a real game serves from the edge and a real external partner gets through alone, P4 is in progress.

The decision log

A roadmap is only as honest as its open questions. Some choices the suite has already decided and built on; others are pending and genuinely block P0's exit. The pending rows below are not rhetorical — four of them carry a "decide by P0" hint precisely because P1 cannot start until they are recorded. Marketplace is marked an open direction: it is documented, not scheduled.

Decision	Where it is recorded / framed	Status / decide by
Decided — built on across the suite
Workers for Platforms is the target runtime	Specified at architecture · edge delivery; today-vs-target contrast at variant → live.	decided
Builder Validation folded into Builder Studio	The validation surface is step 5 of the one flow — ui-variants · validation; not a separate product.	decided
Mocks carry honesty badges	The honesty rule on the index: every mock is GROUNDED or CONCEPT.	decided
Proposal cadence — one step per proposal	The seven-part proposal shape on the index; flow/01 and flow/04 are the exemplars.	decided
Pending — these block P0 exit
UI treatments per surface	Framed with recommendations and cheapest validation at ui-variants · deciding.	pending · decide by P0
Builder auth model	Two options weighed at architecture · two audiences; restated as an open question at open questions.	pending · decide by P0
Import — PDF info sheet up front?	Framed in flow/01 open questions; the import benchmark (0:25-to-spec) assumes a PDF was supplied.	pending · decide by P0
Mechanic-prototype surfacing	Framed in flow/01 open questions — sixth card vs New-game variant vs advanced toggle.	pending · decide by P0
Marketplace direction	Documented as an open direction at architecture · open questions (mechanics as licensable products) and the mind map's Marketplace branch.	open direction · no commitment

The four "decide by P0" rows are the literal content of P0's exit criteria. They are not blockers on the architecture (the suite is careful to keep the rest un-gated on them), but they are blockers on starting to build: you cannot write scaffold v2's audited-MCP surface without knowing which validation treatment it reports into, and you cannot finish the data model's portal index without the auth choice.

What we measure

The north star is "little to no engineering per game." That is only real if it is instrumented. This is the panel — and the honest starting point is that none of it is measurable today because there is no telemetry on the build pipeline. The benchmarks below come from the worked examples (a wall-clock trace), not from a live dashboard; the job of P1 is to make them live.

Measure	Target / benchmark	Today
Time-to-playable per path	Reskin 2:05 · Tune 3:10 (math gate dominated) · Import 0:25-to-spec — from the worked examples.	Not measurable — no build telemetry exists.
Engineering hours per game	0 for reskin and tune (the P2 and P3 exit bars).	Hours per reskin (hand-edited `assets.ts` + Tailwind + art); not tracked.
Builds started per week	Trending up as the Builder opens to all 15 games and external builders.	Not tracked.
Gate first-pass rate	Per gate — rising as agents and proof loops mature.	Not tracked; gates are not yet wired as runs.
Agent autonomy promotions	Per touchpoint, up the autonomy ladder (assistive → autonomous·gated → autonomous), earned on track record.	N/A — no agents wired; every touchpoint starts assistive.
Sandbox sessions per build	A signal of iteration depth before handoff.	Not tracked — Launch Lab is mocked, not wired.

Instrument these in the dashboard from P1, so every later phase proves itself with numbers, not vibes — the P2 and P3 exit bars (hours = 0, a-day-with-proof) are only credible if the panel above is live to read them off.

Literally next

The immediate actions — most of them P0, one of them the first piece of P1 you can start in parallel because it depends on no decision.

Run the ui-variants decision session — put each surface's variants in front of 2–3 real Game Builders for 15 minutes each and record the chosen treatment per ui-variants · deciding.
Answer the auth and import questions — make the builder-auth call (Access vs D1 tokens) and the import PDF-upfront call (flow/01), and write both into the decision log.
Raise the recertification question with compliance — get an expert answer on whether simulating against the certified RNG carries recert implications (architecture · open questions) before the P3 harness is trusted.
Write template auto-extraction first — it is the Effort-S win (build-changes #2), depends on no pending decision, and opens all 15 games. Start it now, in parallel with P0.
Promote-or-close the two prototype branches — give origin/feat/builder-studio-path-picker-mock and origin/feat/builder-studio-readiness-grounded a disposition once the UI treatments are chosen, so neither lingers as an ambiguous half-state.
Decide the mechanic-prototype surfacing — pick sixth-card vs New-game-variant vs advanced-toggle (flow/01) and record it; it shapes the path picker P1 builds.
Schedule the flow/02 full proposal — slot the mechanic-setup proposal into the daily cadence so it reaches full depth ahead of P3.

That is the whole crossing: the suite mapped the territory, this page sequenced it, and the cadence keeps it moving — one decision recorded, one proposal elevated, one phase exited at a time.