From 5b00379f2c37a976f5bf0d239b0192d47fd0aa5d Mon Sep 17 00:00:00 2001 From: Nexus Dev Date: Thu, 2 Apr 2026 22:37:33 +0000 Subject: [PATCH] docs: create milestone v1.5 roadmap (6 phases) --- .planning/REQUIREMENTS.md | 48 ++++---- .planning/ROADMAP.md | 239 ++++++++++++++++++++++++++++---------- .planning/STATE.md | 172 ++++----------------------- 3 files changed, 226 insertions(+), 233 deletions(-) diff --git a/.planning/REQUIREMENTS.md b/.planning/REQUIREMENTS.md index 6cea3a5a..e3fc0145 100644 --- a/.planning/REQUIREMENTS.md +++ b/.planning/REQUIREMENTS.md @@ -69,33 +69,33 @@ | Requirement | Phase | Status | |-------------|-------|--------| -| ONBD-01 | — | Pending | -| ONBD-02 | — | Pending | -| ONBD-03 | — | Pending | -| ONBD-04 | — | Pending | -| ONBD-05 | — | Pending | -| ONBD-06 | — | Pending | -| ONBD-07 | — | Pending | -| CLOUD-01 | — | Pending | -| CLOUD-02 | — | Pending | -| CLOUD-03 | — | Pending | -| CLOUD-04 | — | Pending | -| CLOUD-05 | — | Pending | -| VOICE-01 | — | Pending | -| VOICE-02 | — | Pending | -| VOICE-03 | — | Pending | -| ASST-01 | — | Pending | -| ASST-02 | — | Pending | -| ASST-03 | — | Pending | -| ASST-04 | — | Pending | -| CLI-01 | — | Pending | -| CLI-02 | — | Pending | +| ONBD-01 | Phase 30 | Pending | +| ONBD-02 | Phase 30 | Pending | +| ONBD-03 | Phase 30 | Pending | +| ONBD-07 | Phase 30 | Pending | +| CLOUD-01 | Phase 31 | Pending | +| CLOUD-02 | Phase 31 | Pending | +| CLOUD-03 | Phase 31 | Pending | +| CLOUD-04 | Phase 31 | Pending | +| CLOUD-05 | Phase 31 | Pending | +| ONBD-04 | Phase 32 | Pending | +| ONBD-05 | Phase 32 | Pending | +| ONBD-06 | Phase 32 | Pending | +| ASST-01 | Phase 33 | Pending | +| ASST-02 | Phase 33 | Pending | +| ASST-03 | Phase 33 | Pending | +| ASST-04 | Phase 33 | Pending | +| VOICE-01 | Phase 34 | Pending | +| VOICE-02 | Phase 34 | Pending | +| VOICE-03 | Phase 34 | Pending | +| CLI-01 | Phase 35 | Pending | +| CLI-02 | Phase 35 | Pending | **Coverage:** - v1.5 requirements: 21 total -- Mapped to phases: 0 -- Unmapped: 21 ⚠️ +- Mapped to phases: 21 +- Unmapped: 0 ✓ --- *Requirements defined: 2026-04-02* -*Last updated: 2026-04-02 after initial definition* +*Last updated: 2026-04-02 after roadmap created (phases 30-35)* diff --git a/.planning/ROADMAP.md b/.planning/ROADMAP.md index 9a24c722..df8bff16 100644 --- a/.planning/ROADMAP.md +++ b/.planning/ROADMAP.md @@ -1,93 +1,199 @@ -# Roadmap: v1.4 Hermes Default Provider +# Roadmap: Nexus -**Milestone:** v1.4 -**Status:** Active -**Phases:** 27-29 (3 phases) -**Granularity:** Coarse -**Coverage:** 16/16 requirements mapped +## Milestones + +- ✅ **v1.2.1 Universal Skill Management** - Phase 1 (shipped 2026-04-01) +- ✅ **v1.3 Chat & PWA** - Phases 21-26 (shipped 2026-04-02) +- ✅ **v1.4 Hermes Default Provider** - Phases 27-29 (shipped 2026-04-02) +- 🚧 **v1.5 Smart Onboarding + Personal AI Assistant** - Phases 30-35 (in progress) --- -## Phases +
+✅ v1.2.1 Universal Skill Management (Phase 1) - SHIPPED 2026-04-01 -- [x] **Phase 27: Hermes Adapter** — Install and enable the Hermes adapter, expose it in the Add Agent dialog, and deliver working heartbeat execution with session persistence (completed 2026-04-02) -- [x] **Phase 28: Ollama Integration & Agent Surface** — Detect Ollama, list and recommend models, expose skill/cost/dashboard data for Hermes agents (completed 2026-04-02) -- [x] **Phase 29: Default Provider & End-to-End** — Onboarding fallback to Hermes, agent template compatibility, GSD workflow validation, full end-to-end smoke test (completed 2026-04-02) +### Phase 1: Foundation +**Goal**: Establish the display-layer rename infrastructure, git hygiene tooling, and rebase safety primitives that all subsequent phases depend on +**Plans**: 2/2 plans complete ---- +Plans: +- [x] 01-01-PLAN.md — Branding package, VOCAB constants, commit-msg hook +- [x] 01-02-PLAN.md — Zone taxonomy, rerere config, rebase safety infrastructure -## Phase Details +
+ +
+✅ v1.3 Chat & PWA (Phases 21-26) - SHIPPED 2026-04-02 + +### Phase 21: Chat Foundation +**Goal**: Users can have real-time chat conversations with agents +**Plans**: 7/7 plans complete + +### Phase 22: Agent Streaming +**Goal**: Agent responses stream in real-time with identity, edit, retry, and stop controls +**Plans**: 5/5 plans complete + +### Phase 23: Brainstormer Flow +**Goal**: Users can turn a chat conversation into a tracked project with one handoff action +**Plans**: 4/4 plans complete + +### Phase 24: Search, History & Branching +**Goal**: Users can find, bookmark, branch, and export any conversation +**Plans**: 4/4 plans complete + +### Phase 25: File System +**Goal**: Users can upload, preview, and version files within chat; voice input transcribes speech to text +**Plans**: 9/9 plans complete + +### Phase 26: PWA & Performance +**Goal**: Nexus installs as a PWA, works offline, and loads fast on mobile +**Plans**: 5/5 plans complete + +
+ +
+✅ v1.4 Hermes Default Provider (Phases 27-29) - SHIPPED 2026-04-02 ### Phase 27: Hermes Adapter **Goal**: Users can create a Hermes agent in Nexus, configure it, and have it execute heartbeats that spawn `hermes chat -q`, return a result, and persist the session across runs -**Depends on**: Nothing (first phase of v1.4; depends on v1.3 milestone being shipped) -**Requirements**: HERM-01, HERM-02, HERM-03, HERM-04 -**Success Criteria** (what must be TRUE): - 1. "Hermes" appears in the "Add Agent" dropdown and can be selected to create a new agent - 2. When creating a Hermes agent, the user can pick a model and toggle tool permissions; the agent is saved and appears in the agent list - 3. When a heartbeat fires for a Hermes agent, Nexus spawns `hermes chat -q` with the task, the process completes, and the result is written back as a task update - 4. A second heartbeat on the same Hermes agent resumes the prior session via `--resume`; context from the previous run is accessible -**Plans:** 1/1 plans complete +**Plans**: 1/1 plans complete + Plans: - [x] 27-01-PLAN.md — Close four integration gaps: SESSIONED_LOCAL_ADAPTERS, create-mode toolsets bug, duplicate constant, session codec test ### Phase 28: Ollama Integration & Agent Surface **Goal**: Users can see which Ollama models are available, get a recommendation for their hardware, configure any Hermes agent to use a local model, and see Hermes-specific runtime data in the dashboard and agent config -**Depends on**: Phase 27 -**Requirements**: OLLA-01, OLLA-02, OLLA-03, OLLA-04, OLLA-05, HERM-05, HERM-06, HERM-07 -**Success Criteria** (what must be TRUE): - 1. When Ollama is installed, a status indicator in the Hermes agent config shows "Ollama detected" with the version; when Ollama is absent, the config shows installation instructions with a link - 2. A Hermes agent's model selector lists all locally available Ollama models pulled from `ollama list` - 3. User can save a Hermes agent with any Ollama model from the list; heartbeats use the selected model - 4. The model selector shows a recommended model (highlighted) based on detected RAM/VRAM, derived from a shipped catalog - 5. The agent config page shows Nexus-managed skills alongside Hermes native skills in a single unified list - 6. The dashboard agent card for a Hermes agent shows model name, memory usage, and native skill count - 7. Token usage and estimated model cost are recorded per heartbeat and surfaced in the cost tracking view -**Plans:** 3/3 plans complete +**Plans**: 3/3 plans complete + Plans: - [x] 28-01-PLAN.md — Ollama service, routes, model catalog, and unit tests - [x] 28-02-PLAN.md — UI model selector dropdown, install callout, Hermes skill badge - [x] 28-03-PLAN.md — Hermes stateJson runtime data and dashboard HermesRuntimeCard -**UI hint**: yes ### Phase 29: Default Provider & End-to-End **Goal**: A fresh Nexus install with only Hermes and Ollama works end-to-end — onboarding offers Hermes as the default, PM and Engineer templates run correctly on the Hermes runtime, and GSD workflow tasks complete successfully -**Depends on**: Phase 28 -**Requirements**: DFLT-01, DFLT-02, DFLT-03, DFLT-04 -**Success Criteria** (what must be TRUE): - 1. During onboarding, if no cloud provider API key is detected, the setup flow offers "Use Hermes (local, free)" as the default provider with a one-click path to configure it - 2. Creating a PM agent or Engineer agent from the predefined templates and assigning it to a Hermes runtime produces a working agent — heartbeats execute and return meaningful results - 3. Running a full GSD workflow task (create issue, assign to Engineer, execute heartbeat, mark complete) with Hermes as the sole runtime completes without errors - 4. A machine with only Nexus, Hermes, and Ollama installed (no Anthropic/OpenAI key) can complete the entire onboarding-to-first-task flow with no paywalls or missing-key errors -**Plans:** 2/2 plans complete +**Plans**: 2/2 plans complete + Plans: - [x] 29-01-PLAN.md — Adapter probe route, onboarding wizard Hermes fallback, adapter-neutral templates - [x] 29-02-PLAN.md — Hermes skill injection via promptTemplate, integration tests +
+ +--- + +### 🚧 v1.5 Smart Onboarding + Personal AI Assistant (In Progress) + +**Milestone Goal:** The definitive onboarding experience — hardware detection, tiered provider setup (local/free cloud/paid), and a Personal AI Assistant mode that coexists with the Project Builder. + +## Phases + +- [ ] **Phase 30: Hardware Detection + Mode Selection** — Unauthenticated hardware probe, Apple Silicon unified memory handling, model recommendation database, and mode selector that gates all assistant-specific features +- [ ] **Phase 31: Puter.js Zero-Config Cloud** — Server-proxied Puter.js adapter with full cost tracking, Google OAuth PKCE tier, and subscription auto-detection; no API keys required for zero-config path +- [ ] **Phase 32: Multi-Step Onboarding Wizard** — Assemble all provider tiers and hardware data into a skippable multi-step wizard; summary screen routes directly into chat +- [ ] **Phase 33: Persistent Memory + Personal Assistant Mode** — File-backed memory with write-time sanitization, PersonalAssistantPage, conversation handoff to PM agent +- [ ] **Phase 34: Voice** — Piper TTS with pre-warm progress, Whisper STT wired into voice service, onboarding voice step activated +- [ ] **Phase 35: npx buildthis CLI** — Standalone bootstrapper package with hardware detection and provider tiering parity with web onboarding + +--- + +## Phase Details + +### Phase 30: Hardware Detection + Mode Selection +**Goal**: Users see accurate hardware information during onboarding, get a model recommendation matched to their machine, and choose a mode that correctly gates all downstream features — with the probe working before board auth exists +**Depends on**: Phase 29 (v1.4 shipped) +**Requirements**: ONBD-01, ONBD-02, ONBD-03, ONBD-07 +**Success Criteria** (what must be TRUE): + 1. On a fresh install (before any board auth token exists), the hardware probe returns GPU, RAM, and Apple Silicon unified memory data within 5 seconds + 2. A Mac Mini M4 reports "unified memory" (not VRAM) with the 0.75 multiplier applied and copy that says "runs entirely on your machine" + 3. The mode selector (Personal AI Assistant / Project Builder / Both) is visible during onboarding and the selected mode is persisted; assistant-specific UI is hidden when Project Builder-only is chosen + 4. The model recommendation shown to the user matches an entry in the pre-built JSON catalog for the detected hardware tier (GPU / Apple Silicon / CPU-only) +**Plans**: TBD + +### Phase 31: Puter.js Zero-Config Cloud +**Goal**: Users without Ollama installed can reach working AI in one click via Puter.js — all calls server-proxied, tokens server-stored, cost tracked; Google OAuth and subscription auto-detection round out the provider tier +**Depends on**: Phase 30 +**Requirements**: CLOUD-01, CLOUD-02, CLOUD-03, CLOUD-04, CLOUD-05 +**Success Criteria** (what must be TRUE): + 1. A user with no Ollama and no API keys clicks "Continue with Puter" in onboarding, completes the Puter auth popup, and immediately gets a working chat response — no API key entry required + 2. All Puter AI calls flow through `POST /api/puter-proxy/chat` (verifiable in server logs); the Puter auth token is stored server-side via secretService, not in localStorage + 3. Token cost for Puter responses appears in the cost tracking view, attributed correctly per conversation + 4. A user with Hermes, Claude Code, or OpenClaw already installed sees those tools pre-filled in the provider configuration step with no manual entry + 5. A user clicking "Sign in with Google" for Gemini completes PKCE OAuth and gets a Gemini-backed chat response; the UI displays a policy-risk note that Google OAuth may trigger abuse detection +**Plans**: TBD +**UI hint**: yes + +### Phase 32: Multi-Step Onboarding Wizard +**Goal**: Users move through a complete, skippable onboarding flow that assembles hardware data, provider selection, and voice options into a summary screen — and can jump straight into chat from there +**Depends on**: Phase 31 +**Requirements**: ONBD-04, ONBD-05, ONBD-06 +**Success Criteria** (what must be TRUE): + 1. A user can click "Skip" on every onboarding step (hardware, provider, voice) and reach the summary screen; the resulting workspace has at least one working agent with a valid provider + 2. The summary screen shows the configured providers and agent-model pairings for the selected mode; no corporate language ("company", "CEO", "mission") appears anywhere in the flow + 3. From the summary screen, one click navigates directly to the Personal Assistant chat or the project dashboard (depending on chosen mode) with no additional prompts +**Plans**: TBD +**UI hint**: yes + +### Phase 33: Persistent Memory + Personal Assistant Mode +**Goal**: Users in Personal AI Assistant mode accumulate memory across sessions that shapes future responses — with no risk of credentials leaking into prompts — and can hand off any conversation to a PM agent with context intact +**Depends on**: Phase 32 +**Requirements**: ASST-01, ASST-02, ASST-03, ASST-04 +**Success Criteria** (what must be TRUE): + 1. A fact stated in one chat session ("I prefer TypeScript") is referenced correctly by the assistant in a new session started after closing and reopening the browser + 2. Pasting an API key or token into chat and then starting a new session results in the assistant having no knowledge of that credential — the sanitization blocklist prevented it from being stored + 3. A user clicks "Turn this into a project" in an assistant conversation; a PM agent is created with a system message containing the conversation summary and they land on the project dashboard + 4. A user with mode set to "Both" can switch between Personal Assistant chat and the project dashboard without losing context or cross-contaminating assistant memory with project agent messages +**Plans**: TBD +**UI hint**: yes + +### Phase 34: Voice +**Goal**: Users can speak to the assistant (Whisper STT) and hear responses read aloud (Piper TTS) — Piper pre-warms visibly so the first synthesis call does not appear broken, and voice is offered during onboarding based on hardware capability +**Depends on**: Phase 32 +**Requirements**: VOICE-01, VOICE-02, VOICE-03 +**Success Criteria** (what must be TRUE): + 1. On a CPU-only machine (no GPU), enabling Piper TTS in the assistant produces audible speech output within a reasonable time after the first synthesis (not a silent hang) + 2. When Piper's WASM voice model is downloading for the first time, a visible progress indicator is shown before the TTS toggle is enabled; the download completes and TTS works without a page reload + 3. The onboarding voice step offers Whisper STT and Piper TTS toggles only when the hardware detection step has confirmed sufficient capability; on hardware below the threshold, the step is skipped or shows a capability warning +**Plans**: TBD + +### Phase 35: npx buildthis CLI +**Goal**: A developer can run `npx buildthis` on a fresh machine and either open an already-running Nexus or be guided through install — with the same hardware detection and provider tiering as the web onboarding +**Depends on**: Phase 30 (hardware detection service must exist) +**Requirements**: CLI-01, CLI-02 +**Success Criteria** (what must be TRUE): + 1. Running `npx buildthis` on a machine where Nexus is already running opens the Nexus UI in the default browser; running it on a machine with no Nexus guides the user through installation steps + 2. The CLI bootstrapper detects the same hardware tier (GPU / Apple Silicon / CPU-only) as the web onboarding and presents the matching provider tier recommendations in the terminal prompt +**Plans**: TBD + --- ## Coverage Validation -All 16 v1 requirements are mapped to exactly one phase. No orphans. +All 21 v1.5 requirements are mapped to exactly one phase. No orphans. | Requirement | Phase | |-------------|-------| -| HERM-01 | 27 | -| HERM-02 | 27 | -| HERM-03 | 27 | -| HERM-04 | 27 | -| HERM-05 | 28 | -| HERM-06 | 28 | -| HERM-07 | 28 | -| OLLA-01 | 28 | -| OLLA-02 | 28 | -| OLLA-03 | 28 | -| OLLA-04 | 28 | -| OLLA-05 | 28 | -| DFLT-01 | 29 | -| DFLT-02 | 29 | -| DFLT-03 | 29 | -| DFLT-04 | 29 | +| ONBD-01 | 30 | +| ONBD-02 | 30 | +| ONBD-03 | 30 | +| ONBD-07 | 30 | +| CLOUD-01 | 31 | +| CLOUD-02 | 31 | +| CLOUD-03 | 31 | +| CLOUD-04 | 31 | +| CLOUD-05 | 31 | +| ONBD-04 | 32 | +| ONBD-05 | 32 | +| ONBD-06 | 32 | +| ASST-01 | 33 | +| ASST-02 | 33 | +| ASST-03 | 33 | +| ASST-04 | 33 | +| VOICE-01 | 34 | +| VOICE-02 | 34 | +| VOICE-03 | 34 | +| CLI-01 | 35 | +| CLI-02 | 35 | --- @@ -95,6 +201,19 @@ All 16 v1 requirements are mapped to exactly one phase. No orphans. | Phase | Milestone | Plans Complete | Status | Completed | |-------|-----------|----------------|--------|-----------| -| 27. Hermes Adapter | v1.4 | 1/1 | Complete | 2026-04-02 | -| 28. Ollama Integration & Agent Surface | v1.4 | 3/3 | Complete | 2026-04-02 | -| 29. Default Provider & End-to-End | v1.4 | 2/2 | Complete | 2026-04-02 | +| 1. Foundation | v1.2.1 | 2/2 | Complete | 2026-04-01 | +| 21. Chat Foundation | v1.3 | 7/7 | Complete | 2026-04-02 | +| 22. Agent Streaming | v1.3 | 5/5 | Complete | 2026-04-02 | +| 23. Brainstormer Flow | v1.3 | 4/4 | Complete | 2026-04-02 | +| 24. Search, History & Branching | v1.3 | 4/4 | Complete | 2026-04-02 | +| 25. File System | v1.3 | 9/9 | Complete | 2026-04-02 | +| 26. PWA & Performance | v1.3 | 5/5 | Complete | 2026-04-02 | +| 27. Hermes Adapter | v1.4 | 1/1 | Complete | 2026-04-02 | +| 28. Ollama Integration & Agent Surface | v1.4 | 3/3 | Complete | 2026-04-02 | +| 29. Default Provider & End-to-End | v1.4 | 2/2 | Complete | 2026-04-02 | +| 30. Hardware Detection + Mode Selection | v1.5 | 0/TBD | Not started | - | +| 31. Puter.js Zero-Config Cloud | v1.5 | 0/TBD | Not started | - | +| 32. Multi-Step Onboarding Wizard | v1.5 | 0/TBD | Not started | - | +| 33. Persistent Memory + Personal Assistant Mode | v1.5 | 0/TBD | Not started | - | +| 34. Voice | v1.5 | 0/TBD | Not started | - | +| 35. npx buildthis CLI | v1.5 | 0/TBD | Not started | - | diff --git a/.planning/STATE.md b/.planning/STATE.md index 443109bf..7f61883b 100644 --- a/.planning/STATE.md +++ b/.planning/STATE.md @@ -7,7 +7,7 @@ stopped_at: null last_updated: "2026-04-02T18:00:00.000Z" last_activity: 2026-04-02 progress: - total_phases: 0 + total_phases: 6 completed_phases: 0 total_plans: 0 completed_plans: 0 @@ -21,28 +21,22 @@ progress: See: .planning/PROJECT.md (updated 2026-04-02) **Core value:** A fresh onboard asks for ONE thing (root directory), auto-creates PM + Engineer agents, and drops you in the dashboard. -**Current focus:** Defining requirements for v1.5 +**Current focus:** Phase 30 — Hardware Detection + Mode Selection (ready to plan) ## Current Position -Phase: Not started (defining requirements) -Plan: — -Status: Defining requirements -Last activity: 2026-04-02 — Milestone v1.5 started +Phase: 30 of 35 (Hardware Detection + Mode Selection) +Plan: — (not yet planned) +Status: Ready to plan +Last activity: 2026-04-02 — v1.5 roadmap created; 21 requirements mapped across 6 phases (30-35) Progress: [__________] 0% -### Upstream Rebase Log - -| Date | Commits Behind | Conflicts | Build | Notes | -|------|---------------|-----------|-------|-------| -| 2026-04-01 | 0 | 0 | OK | Already rebased from 120+ commits session; upstream hasn't moved | - ## Performance Metrics **Velocity:** -- Total plans completed: 0 +- Total plans completed: 0 (v1.5) - Average duration: - - Total execution time: 0 hours @@ -54,149 +48,26 @@ Progress: [__________] 0% **Recent Trend:** -- Last 5 plans: none yet +- Last 5 plans: none yet (v1.5) - Trend: - *Updated after each plan completion* -| Phase 01-foundation P01 | 2 | 2 tasks | 7 files | -| Phase 21-chat-foundation P02 | ~15min | 2 tasks | 5 files | -| Phase 21-chat-foundation P01 | 2 | 2 tasks | 8 files | -| Phase 21-chat-foundation P00 | 2 | 2 tasks | 4 files | -| Phase 21-chat-foundation P04 | 4min | 2 tasks | 7 files | -| Phase 21-chat-foundation P03 | 6 | 2 tasks | 6 files | -| Phase 21-chat-foundation P05 | 4 | 3 tasks | 8 files | -| Phase 21-chat-foundation P06 | 10min | 2 tasks | 7 files | -| Phase 22-agent-streaming P01 | 6min | 2 tasks | 6 files | -| Phase 22-agent-streaming P03 | 3 | 2 tasks | 4 files | -| Phase 22-agent-streaming P04 | 4min | 2 tasks | 5 files | -| Phase 22-agent-streaming P05 | 20min | 3 tasks | 6 files | -| Phase 23-brainstormer-flow P00 | 3min | 2 tasks | 11 files | -| Phase 23-brainstormer-flow P02 | 5min | 2 tasks | 6 files | -| Phase 23-brainstormer-flow P01 | 10min | 2 tasks | 2 files | -| Phase 23-brainstormer-flow P03 | 5 | 2 tasks | 4 files | -| Phase 24-search-history-branching P00 | 5 | 2 tasks | 9 files | -| Phase 24-search-history-branching P01 | 12 | 2 tasks | 2 files | -| Phase 24-search-history-branching P02 | 3min | 2 tasks | 7 files | -| Phase 24-search-history-branching P03 | 4 | 3 tasks | 7 files | -| Phase 25-file-system P00 | 6 | 2 tasks | 11 files | -| Phase 25-file-system P02 | 15 | 2 tasks | 5 files | -| Phase 25-file-system P01 | 15 | 2 tasks | 17 files | -| Phase 25-file-system P03 | 3 | 2 tasks | 7 files | -| Phase 25-file-system P08 | 8 | 2 tasks | 5 files | -| Phase 25-file-system P04 | 5min | 2 tasks | 5 files | -| Phase 25-file-system P06 | 5 | 2 tasks | 5 files | -| Phase 25-file-system P07 | 10 | 2 tasks | 6 files | -| Phase 26-pwa-performance P00 | 5 | 2 tasks | 9 files | -| Phase 26-pwa-performance P01 | 4 | 2 tasks | 2 files | -| Phase 26-pwa-performance P02 | 20 | 2 tasks | 8 files | -| Phase 26-pwa-performance P04 | 15 | 2 tasks | 10 files | -| Phase 27-hermes-adapter P01 | 2 | 3 tasks | 3 files | -| Phase 28-ollama-integration P01 | 3 | 2 tasks | 6 files | -| Phase 28 P03 | 12 | 2 tasks | 3 files | -| Phase 28-ollama-integration P02 | 5min | 2 tasks | 3 files | -| Phase 29-default-provider P01 | 8 | 2 tasks | 4 files | -| Phase 29-default-provider P02 | 12 | 2 tasks | 8 files | ## Accumulated Context ### Decisions Decisions are logged in PROJECT.md Key Decisions table. -Recent decisions affecting current work: +Key constraints for v1.5 (established at roadmap): -- Roadmap: Company → Workspace (not Project) to avoid collision with existing Project entity -- Roadmap: Display-only renames — all code identifiers, DB schema, routes, env vars unchanged -- Roadmap: Branding package (`packages/branding/`) as single string mutation surface -- Roadmap: Vite alias redirects OnboardingWizard import to Nexus-owned replacement (Phase 4) -- Roadmap: `~/.nexus` pointer file with read-both-paths fallback for migration safety (Phase 2) -- [Phase 01-foundation]: Keep @paperclipai/branding package name for upstream sync compatibility -- [Phase 01-foundation]: Use as const for VOCAB to enable TypeScript literal type inference on all values -- [Phase 01-foundation]: Hook source tracked in scripts/nexus-commit-msg-hook.sh for post-clone reinstallation -- [Phase 01-foundation]: rerere.autoupdate=true so resolved conflicts are auto-staged during future rebases -- [Phase 21-chat-foundation]: Used object-syntax (table) => ({}) for Drizzle index callbacks to match existing codebase convention in documents.ts, agents.ts -- [Phase 21-chat-foundation]: Use it.todo() (not it.skip()) for Wave 0 test scaffolding — vitest marks todos semantically, no false positives -- [Phase 21-chat-foundation]: Minimal imports in test stubs — no service mocks until Plans 01-05 wire up implementations -- [Phase 21-02]: Use ExtraProps from react-markdown for ChatCodeBlock type signature to satisfy ComponentType constraint -- [Phase 21-02]: Add hljs CSS as plain rules (not @import) scoped to .dark, .theme-tokyo-night, :root selectors -- [Phase 21-chat-foundation]: ChatPanelProvider inside PanelProvider so ChatPanel can call setPanelVisible to close PropertiesPanel -- [Phase 21-chat-foundation]: Chat toggle button in desktop sidebar bottom controls with hidden md:inline-flex -- [Phase 21-03]: Pitfall 3 (updatedAt bump): addMessage always updates chatConversations.updatedAt after inserting — ensures conversation list sort order stays correct -- [Phase 21-03]: Pitfall 5 (auto-title idempotency): WHERE title IS NULL guard makes auto-title set safe to call multiple times -- [Phase 21-03]: Missing export fix: createConversationSchema/updateConversationSchema/createMessageSchema were in validators/chat.ts but not re-exported from shared/src/index.ts -- [Phase 21-chat-foundation]: Two-path handleSend in ChatPanel: direct chatApi for path 1 (no active conversation) avoids hook mutation needing a conversationId that does not exist yet -- [Phase 21-chat-foundation]: messages array in useChatMessages flattened from pages and reversed so display is chronological (API returns desc by createdAt) -- [Phase 21-chat-foundation]: Custom window event (nexus:focus-chat-search) used instead of forwardRef drilling to focus search input from Cmd+K -- [Phase 21-chat-foundation]: Cmd+K handler placed before input-guard early return in useKeyboardShortcuts so it fires globally even from input/textarea -- [Phase 22-agent-streaming]: Use fetch ReadableStream instead of EventSource for POST SSE streaming endpoint -- [Phase 22-agent-streaming]: streamEcho stub yields word-by-word with 50ms delay; Phase 23 replaces with real LLM adapter -- [Phase 22-agent-streaming]: Partial content on stop saved with [stopped] suffix via chatApi.savePartialMessage -- [Phase 22-agent-streaming]: isAnyStreaming prop (not isStreaming) distinguishes global streaming state for disabling edit/retry globally in ChatMessage -- [Phase 22-agent-streaming]: /search disabled with Coming soon; resolveAgentFromContent routes slash > @mention > active agent -- [Phase 22-agent-streaming]: [Phase 22-05]: virtualizer.measure() on streamingContent change handles dynamic height re-measurement for growing streaming message (Pitfall 3) -- [Phase 22-agent-streaming]: [Phase 22-05]: handleRetry truncates from user message (not assistant), removing assistant + all subsequent messages before re-streaming -- [Phase 23-brainstormer-flow]: Added journal entries for idx 47 (nebulous_klaw) and 48 (add_chat_messages_updated_at) retroactively to keep journal consistent with files on disk -- [Phase 23-brainstormer-flow]: Used @/lib/router Link abstraction (not react-router-dom) for ChatTaskCreatedBadge and ChatStatusUpdateBadge — consistent with all other link components in the codebase -- [Phase 23-brainstormer-flow]: ChatSpecCardInner extracted as inner component to avoid conditional hook calls after JSON.parse error path in ChatSpecCard -- [Phase 23-brainstormer-flow]: issueService imported directly in chatRoutes(db) — option (a) from plan, simplest approach matching heartbeat.ts pattern -- [Phase 23-brainstormer-flow]: Used activeConversationId === null as proxy for new conversation in brainstormer auto-select -- [Phase 23-brainstormer-flow]: Used useToast()/pushToast() for error toast in ChatPanel handleHandoff (custom ToastContext, not sonner) -- [Phase 24-search-history-branching]: Used AnyPgColumn type annotation for parentConversationId self-referential FK — matches existing pattern in issues.ts, goals.ts, execution_workspaces.ts -- [Phase 24-search-history-branching]: content_search tsvector column omitted from Drizzle schema — Postgres-generated stored column queried via raw sql`` only -- [Phase 24-search-history-branching]: searchMessages returns empty items early when query.trim() is empty — avoids PostgreSQL error on blank tsquery -- [Phase 24-search-history-branching]: exportConversation uses this.getConversation() to reuse notFound guard without duplicating query logic -- [Phase 24-search-history-branching]: Used HighlightedText React component instead of innerHTML for term highlighting — eliminates XSS surface -- [Phase 24-search-history-branching]: exportConversation returns URL string (not fetch call) since server sends file download via browser navigation -- [Phase 24-search-history-branching]: shouldFilter=false pattern on Command for server-side FTS — all future search dialogs should follow this -- [Phase 24-search-history-branching]: Custom event nexus:open-chat-search routes search trigger through CommandPalette without Cmd+K conflict -- [Phase 24-search-history-branching]: Branch-on-edit checks editedIdx < messages.length - 1 before calling branchConversation -- [Phase 24-search-history-branching]: bookmarkedMessageIds as Set for O(1) lookup rebuilt from useChatBookmarks per-conversation data -- [Phase 25-file-system]: Used object-syntax (table) => ({}) for Drizzle index callbacks in chat_files and chat_file_references — matches existing codebase pattern -- [Phase 25-file-system]: chatFiles uses nullable FKs (SET NULL) for conversationId/messageId — file may exist before a conversation or message is created -- [Phase 25-file-system]: chatFileReferences uses CASCADE deletes for fileId and conversationId — reference has no meaning without both anchors -- [Phase 25-file-system]: Used XHR instead of fetch for chatApi.uploadFile to enable upload progress events (fetch lacks upload.onprogress) -- [Phase 25-file-system]: ChatInput onFilesPicked/pendingFiles/onRemoveFile props are all optional for backward compatibility -- [Phase 25-file-system]: Content-Length uses object.contentLength from storage ?? chatFile.sizeBytes to prevent stream ECONNRESET -- [Phase 25-file-system]: createFileReferenceSchema.safeParse() receives fileId injected from URL param for UUID format validation -- [Phase 25-file-system]: ChatFilePreview shows inline image with max-h-[300px] + ChatFileCard below; non-image types use ChatFileCard only -- [Phase 25-file-system]: listMessages fetches chatFiles with inArray(messageId) as second query, merged in-memory -- [Phase 25-file-system]: completedFileIds captured before clearCompleted in handleSend to avoid race condition -- [Phase 25-file-system]: Use functional setState for transcription append in VoiceRecordButton — avoids stale closure vs native DOM event approach -- [Phase 25-file-system]: enableVoiceInput defaults to false for backward-compat; ChatPanel passes true unconditionally — server returns 503 gracefully if whisper absent -- [Phase 25-file-system]: execFileAsync over exec for whisper CLI invocation — no shell injection risk with system-generated tmpPath -- [Phase 25-file-system]: Used DOMParser + replaceChildren to safely render hljs output — avoids raw HTML injection pattern while preserving same visual output as rehype-highlight -- [Phase 25-file-system]: highlight.js added as explicit ui/package.json dependency (was transitive via rehype-highlight only) -- [Phase 25-file-system]: Used execFile (not exec) for git commands in gitFileService — array-based args prevent shell injection -- [Phase 25-file-system]: Git commit is fire-and-forget after upload — response not blocked by git operation -- [Phase 25-file-system]: History route placed before /content route to avoid Express path ambiguity on /files/:fileId/* -- [Phase 25-file-system]: placeholderService reads/writes PLACEHOLDERS.md using regex row parser to avoid markdown deps -- [Phase 25-file-system]: addEntry is fire-and-forget — response not blocked by placeholder manifest I/O -- [Phase 25-file-system]: resolveDefaultStorageDir() used for projectDir in placeholder service — consistent with git-file-service pattern -- [Phase 26-pwa-performance]: [Phase 26-00]: Cache name changed from paperclip-v2 to nexus-v1; activate event deletes all non-nexus-v1 caches to bust stale entries -- [Phase 26-pwa-performance]: [Phase 26-00]: API paths (/api/*) pass through without SW interception — network-only for all API traffic -- [Phase 26-pwa-performance]: All pages use named exports so React.lazy requires .then(m => ({ default: m.X })) module re-mapping -- [Phase 26-pwa-performance]: manualChunks: vendor-react, vendor-router, vendor-query, vendor-markdown (excludes @mdxeditor/editor to avoid circular deps) -- [Phase 26-pwa-performance]: useMediaQuery uses addEventListener('change') not addListener() — addListener is deprecated in modern browsers -- [Phase 26-pwa-performance]: PullToRefresh wraps ScrollArea in ChatConversationList, not the entire list component — keeps desktop layout unaffected -- [Phase 26-pwa-performance]: MobileChatView uses 100dvh not 100vh — avoids keyboard-shrink issue (RESEARCH Pitfall 3) -- [Phase 26-pwa-performance]: pushService uses named exports (not class) matching existing chat.ts service pattern -- [Phase 26-pwa-performance]: initVapid is graceful — checks env vars before calling setVapidDetails, logs warning if absent -- [Phase 26-pwa-performance]: sendPushToAll uses Promise.allSettled so one failed delivery doesn't block others; stale 410/404 subscriptions auto-deleted -- [Phase 26-pwa-performance]: DELETE /api/push/subscribe uses request body (not URL param) — endpoints are long URLs; api.delete() extended with direct fetch -- [Phase 26-pwa-performance]: NotificationPermissionPrompt engagement gate: agentResponseCount >= 3 derived via useMemo from messages with role === assistant -- [Phase 27-hermes-adapter]: Toolsets field moved inside !isCreate guard — new agents get default toolsets; edit form uses adapterConfig.toolsets correctly -- [Phase 27-hermes-adapter]: Hermes session codec has no cwd field (unlike claude/codex/cursor/gemini) — only sessionId tracked -- [Phase 28-ollama-integration]: Force-added server/src/data/ with git add -f — source catalog JSON is not generated data despite data/ gitignore pattern -- [Phase 28-ollama-integration]: getRecommendedModel uses QUALITY_RANK map (best>reasoning>balanced>fast) to pick highest quality variant within 75% RAM budget -- [Phase 28]: hermesNativeSkillCount derived from agentsApi.skills in UI (not stateJson) — avoids cross-DB query in heartbeat path -- [Phase 28]: COALESCE jsonb concat pattern used for stateJson merge in hermes_local heartbeat — prevents overwriting existing fields -- [Phase 28-ollama-integration]: Used useCompany() hook for companyId in hermes config-fields — consistent with AgentConfigForm pattern -- [Phase 28-ollama-integration]: Create mode only sets model (not provider/base_url) — CreateConfigValues lacks those fields; buildHermesConfig resolves provider at runtime -- [Phase 29-default-provider]: Probe route uses board auth (no companyId) — adapter availability is instance-level, not company-level -- [Phase 29-default-provider]: hermes_local makes directory optional — Hermes runtime does not require cwd to operate -- [Phase 29-default-provider]: AGENT_TEMPLATES in NewAgentDialog are adapter-neutral — /agents/new form handles adapter default logic independently -- [Phase 29-default-provider]: hermesPromptTemplate constructed inline in wizard — static content avoids server round-trip -- [Phase 29-default-provider]: persistSession: true for hermes_local agents — preserves execution context across heartbeat runs -- [Phase 29-default-provider]: Test file duplicates promptTemplate string intentionally — validates the contract, not the implementation +- No DB schema changes — all state in existing JSONB fields (`instance_settings.general`) and file-backed JSON (`data/memory/.json`) +- Puter.js is server-proxied adapter only — `@heyputer/puter.js` browser import is for auth popup only; all AI calls via `POST /api/puter-proxy/chat` +- OAuth tokens (Google, Puter) stored server-side via `secretService` — never in localStorage +- Memory sanitization blocklist applied at write time, not retrieval time +- Apple Silicon: use `os.freemem()` × 0.75 for VRAM estimate; label as "unified memory" not "VRAM"; use `systeminformation` v5 (not v6) +- Unauthenticated `GET /system/providers` endpoint required for pre-auth hardware probe +- Google OAuth cloud tier: include but flag policy risk (Gemini CLI abuse detection issue #21866) +- Skip-all minimum valid state: one working agent with a valid provider must be created when user skips all steps ### Pending Todos @@ -204,10 +75,13 @@ None yet. ### Blockers/Concerns -None identified yet. +- [Phase 31] Puter.js Node.js server-side streaming API surface unverified — confirm `stream: true` works server-side before designing `puterProxyService`; plan-phase should include a research spike +- [Phase 31] Puter.js ToS on server-side request relaying unverified — attribute costs to user's Puter account in all UI copy as mitigation +- [Phase 33] Chat route injection point needs codebase inspection — confirm correct hook location in `server/src/services/chat.ts` during plan-phase +- [Phase 34] smart-whisper Apple Silicon acceleration claim unverified on Mac Mini M4 — fall back to `tiny.en` if `base.en` acceleration not confirmed on device ## Session Continuity -Last session: 2026-04-02T17:43:07.540Z -Stopped at: Completed 29-default-provider-29-02-PLAN.md +Last session: 2026-04-02T18:00:00.000Z +Stopped at: Roadmap created — v1.5 phases 30-35 defined and written to ROADMAP.md Resume file: None