chihlasm/resolutionflow

Fork 0

Files

Michael Chihlas b7d7ff06d2

Mirror to GitHub / mirror (push) Successful in 5s

Details

CI / frontend (pull_request) Successful in 5m8s

Details

CI / backend (pull_request) Successful in 9m46s

Details

CI / e2e (pull_request) Successful in 10m16s

Details

docs(ai): refresh handoff for compute swap

- HANDOFF: rewritten resume point. First action on resume is `git push`
  (commits 0f00ee5 and 665530f are local-only). Visual QA + bug bash is
  the active work; 4 plan-locked items + the structural task-lane fix
  all need real-browser verification.
- CURRENT_TASK: add 0f00ee5 and 665530f to the commit table; reframe
  "Just shipped" as a per-commit summary; flag the task-lane fix as
  needing visual confirmation.
- SESSION_LOG: chronological entry for this session with full detail
  (audit, four polish items, race-condition wiring, structural
  task-lane fix, test status, files touched).
- DECISIONS: new entry "Tag the task-lane state with an owner chatId"
  documenting the structural pattern, what was rejected, and the
  forward implication that future task-lane state slices follow the
  same owner-tagging pattern.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

2026-04-28 08:21:23 -04:00

6.8 KiB

Raw Blame History

HANDOFF.md

Last updated: 2026-04-28 02:00 EDT

Active task: Escalation Mode wedge build. Full status in CURRENT_TASK.md; this file is the resume point.

Branch: feat/escalation-metric-endpoint. Local tip is 665530f. Remote (origin) is at 8914391 — the last two commits (0f00ee5, 665530f) are local-only because the user is swapping computers and asked for the docs/handoff first. Push needed on next session before continuing work. Draft PR #155 is open against main.

What this session did

Two commits, both untested in a real browser:

0f00ee5 feat(escalations): close out plan-locked wedge polish. Four items from the design-plan audit (docs/plans/2026-04-27-escalation-mode-wedge-design.md):
- Live AI assessment refresh — frontend listener for the handoff_assessment_ready SSE event, refetches the handoff and updates magicHandoff / overlayHandoff in place. Closes the async-assessment loop from e8ba74e.
- Suggested-step chips below the composer in AssistantChatPage — surfaces ai_assessment_data.suggested_steps[] post-claim, click prefills the input, hides on first send or explicit X.
- Unread 6px dot on EscalationQueue cards — localStorage-persisted seen set (rf-escalation-seen), clears on open OR claim (NOT hover; Codex correction).
- Race-condition toast on claim conflict — new HandoffAlreadyClaimedError exception, endpoint returns 409 with structured {claimed_by_id, claimed_by_name, claimed_at}, frontend shows "Already claimed by {name} {time_ago}." and bounces the loser back to the queue. Backed by 2 new tests; full handoff/escalation suite (34 tests) green.
665530f fix(assistant-chat): tag task-lane state with owner chatId. Structural fix for the recurring "new session shows previous session's task lane" bug. The earlier fix 8914391 only covered the mount-time entry path; this change makes stale data structurally unable to display by adding taskLaneOwnerChatId state and a render gate taskLaneOwnerChatId === activeChatId ANDed into all three render conditions. Persistence effect now writes ownership chatId, not active chatId — that was the original write-side bug. See DECISIONS.md for the architecture write-up.

Verified: tsc -b clean after both. Backend handoff/escalation suite (34 tests) green. Not verified: anything in a real browser. The user explicitly asked for a debugging session after implementation — that's the next thing.

Resume point

First action: git push the two local commits. 0f00ee5 and 665530f are local-only.
Visual QA + bug bash. End-to-end demo flow:
- Junior escalates → senior gets bell-icon notification → click → magic-moment screen with placeholder AI assessment (because it's now async/background) → assessment populates in place within ~5–15s without manual reopen → Start here → chat surface loads with suggested-step chips above the composer → click a chip prefills input.
- On /escalations: backgrounded tab gets (N) title prefix when an arrival fires; new card has 6px accent dot top-right; clicking the card body OR Pick Up clears the dot (verify it persists across refresh, doesn't clear on hover).
- Race condition: claim the same handoff from two browsers; loser sees toast "Already claimed by {name} {time_ago}." and bounces.
- Task-lane regression check: create a new session via dashboard prefill / pickup / "New Chat" — the lane must NOT flash the previous session's questions/actions. The user previously reported this happening repeatedly; the fix in 665530f should kill it. If it still happens, that's the next debug target.
Deferred follow-ups in CURRENT_TASK.md: snapshot expansion, owner-facing /analytics/escalations page, Playwright e2e for the GTM Loom demo path, eventual cleanup of flowpilot_engine.escalate_session and the dead FlowPilotSessionPage.tsx magic-moment branch.

Useful breadcrumbs

SSE endpoint: backend/app/api/endpoints/session_handoffs.py — stream_escalations.
Pub/sub bus: backend/app/core/escalation_bus.py.
Frontend SSE consumer: frontend/src/api/aiSessions.ts → streamEscalations (now dispatches handoff_created AND handoff_assessment_ready).
Live-arrival queue UI: frontend/src/components/flowpilot/EscalationQueue.tsx.
Magic-moment screen: frontend/src/components/flowpilot/HandoffContextScreen.tsx.
Pickup integration + magic state machine + suggested-step chips + assessment-ready subscription + claim 409 handling + task-lane owner tagging: frontend/src/pages/AssistantChatPage.tsx.
Claim conflict exception: backend/app/services/handoff_manager.py — HandoffAlreadyClaimedError, claim_session, enrich_escalation_async.
Metric endpoint: backend/app/api/endpoints/flowpilot_analytics.py.

Watch-outs

The two new commits are local-only until pushed. Run git push before any other work.
The assessment-ready subscription opens a fresh SSE connection scoped by assessmentMissing && trackedHandoffId. If you change the magic-moment lifecycle, double-check the cleanup deps don't churn the subscription.
The claim conflict path is currently only wired into AssistantChatPage.handleStartHere. useHandoff (used by SessionQueuePage) and FlowPilotSessionPage.tsx (dead) were not updated. If SessionQueuePage claims start mattering, mirror the same axios.isAxiosError(e) && e.response?.status === 409 extraction.
The handoff snapshot is still sparse (problem_summary, problem_domain, status, step_count, confidence_tier). Magic-moment "What's been tried" still only shows engineer notes + step count pre-claim.
HandoffResponse.ai_assessment_data.confidence is typed number on the frontend but the backend currently emits 'low' | 'medium' | 'high'. Runtime handles both; type definition is stale.
Toolbar "Context" button is hidden on revisited active sessions where the senior didn't arrive via magic-moment this session — known scope cut.
Do not reintroduce client.stream()/ASGITransport tests for infinite SSE responses; test the generator directly.
Bus is acceptable for v1 pilot scale only (Railway single-replica). Redis pub/sub is the obvious swap when horizontal scaling appears.

6.8 KiB Raw Blame History Unescape Escape

HANDOFF.md

What this session did

Resume point

Useful breadcrumbs

Watch-outs

6.8 KiB

Raw Blame History