verify and document /monitor end-to-end via Untether Loop mode (no new code expected)

## Context

While planning multi-host fleet monitoring (see [`docs/plans/2026-05-13-fleet-monitoring-and-upgrades.md`](https://github.com/littlebearapps/untether/blob/dev/docs/plans/2026-05-13-fleet-monitoring-and-upgrades.md)), the question came up: can the `/monitor` Claude-Code slash command — which uses `/loop` to fire recurring audit passes — actually run end-to-end via Untether (Telegram → stream-json subprocess), or is it CLI-only?

Initial research suggested CLI-only because the subprocess exits between passes, and `ScheduleWakeup` is documented as a no-op outside `/loop` dynamic mode. **That conclusion was wrong** — it ignored the fact that v0.35.3 already shipped a comprehensive solution for exactly this case via #289, #507, #481, and #470. The current best understanding is that `/monitor` **probably already works via Untether** provided Loop mode is enabled for the chat, but it has never been tested end-to-end and isn't called out as a worked example anywhere in the docs.

This issue is to **verify the behaviour, document it, and fix any gaps**.

## What v0.35.3 already provides (foundation)

- **#289 (rc4) — \`/loop\` and ScheduleWakeup support.** New \`loop_scheduler\` module (sibling of \`at_scheduler\`); observer hooks in \`runners/claude.py\` parse \`CronCreate\` / \`ScheduleWakeup\` events from stream-json and bind their IDs; persistence to \`active_loops.json\`; drain integration in \`_drain_and_exit\`; runaway caps via the new \`[loop]\` config section; \`/config → 🔁 Loop mode\` per-chat toggle (default **OFF**); engine-aware (\`LOOP_SUPPORTED_ENGINES = frozenset({\"claude\"})\`); re-fire prompt wraps the original with \`\"Loop iteration N: <prompt>. Do the task now; do not summarize old results unless necessary.\"\`. Empirically grounded — \"Untether owns ALL firing across both CronCreate and ScheduleWakeup tool families.\"
- **#507 (rc11) — ScheduleWakeup post-result idle shortcut.** Fixed the bug where \`ScheduleWakeup\` outside Loop mode held the session alive 58 minutes; the post-result idle watchdog now shortens its effective timeout to \`min(timeout_s, max_armed_delay + 60s)\` when Loop mode is OFF.
- **#481 (rc11) — expected-wait stall suppression + countdowns.** \`progress_edits.stall_schedule_wakeup_suppressed\` / \`stall_monitor_active_suppressed\` / etc. prevent false stall warnings during legitimate waits. Long-running tools (Bash, BashOutput, ScheduleWakeup, Monitor) get a heartbeat-driven elapsed-time tail in the progress message.
- **#470 (rc8/9) — post-result idle suppression + \`✓ turn complete\` closing message.** Clean end-state signal at session-close.

## Scope of this issue

**Verify + document. No new code unless a bug surfaces.**

Hypothesis: with Loop mode enabled for the chat, typing \`/monitor untether-staging 30m 5m\` in Telegram should:

1. Pass 1 runs inline in the first Claude Code subprocess.
2. The monitor command's \`Skill(skill=\"loop\", args=\"5m Read .../loop-prompt.md ...\")\` invocation **is detected by the loop-scheduler observer** (the observer parses upstream \`CronCreate\` / \`ScheduleWakeup\` tool events from stream-json — the \`/loop\` skill itself ultimately fires one of those primitives under the hood).
3. Untether persists the loop to \`active_loops.json\`, restart-resilient.
4. Untether re-fires \`/monitor untether-staging\` with the existing run-id every 5 minutes for 30 minutes total.
5. Each re-fire reads the existing state dir, runs the next pass, writes findings + audit-log + GitHub issue updates.
6. Window-close behaviour (the loop-prompt.tmpl's \`if [ \"\$NOW\" -ge \"\$END_TS\" ]\` guard) triggers the synthesis pass exactly once, then short-circuits subsequent fires via \`.synthesis-done\` marker.

The big unknown: whether the monitor command's *specific* invocation shape gets observed correctly. The observer in #289 looks for \`CronCreate\` / \`ScheduleWakeup\` tool calls; \`/monitor\` calls the loop *skill*, which presumably ends up firing one of those. If the wiring is right, this Just Works. If not, the loop-scheduler observer needs a small extension to recognise the skill-driven path.

## Tasks

- [ ] Enable Loop mode for a test chat: \`/config → 🔁 Loop mode → on\`. Verify the warning message about cost+quota appears.
- [ ] In the same chat, fire \`/monitor untether-staging 30m 5m\` (short window, short interval — easier to observe). Watch logs.
- [ ] Confirm pass 1 runs and writes to \`~/.local/state/monitor/untether-staging/<run-id>/audit-log.md\`.
- [ ] Confirm \`~/.untether/active_loops.json\` gains an entry for this loop after pass 1 completes.
- [ ] Wait ~5 min, confirm pass 2 fires automatically via Untether (no Telegram input needed). Verify the structlog event chain: \`claude.loop.observed\` (or whatever #289 emits) → \`claude.loop.fired\` → \`handle.incoming\` with \`Loop iteration 2: …\` prefix → pass 2 audit-log entry.
- [ ] Confirm subsequent passes (3, 4, 5, 6) fire on schedule.
- [ ] At window-close (30 min in), confirm synthesis pass fires exactly once. Subsequent loop fires should short-circuit silently.
- [ ] **Drain-on-restart test:** mid-window, run \`systemctl --user restart untether-dev\`. Verify the loop survives restart (\`active_loops.json\` reloads, fires resume from where they left off).
- [ ] **Drain-on-cancel test:** \`/cancel\` from Telegram. Verify the loop is removed from \`active_loops.json\` and no further fires happen.

## Documentation acceptance

- [ ] \`docs/how-to/schedule-tasks.md\` gains a worked \`/monitor\` example under the \"Loop mode\" section (#289), explaining the per-chat opt-in requirement and showing the expected log output.
- [ ] \`~/.claude/quickrefs/monitor.md\` adds a Telegram-invocation note: \"To run /monitor via Untether, enable Loop mode for the chat first via \`/config → 🔁 Loop mode\`. Audit results fire to GitHub as normal; no Telegram chatter between passes.\"
- [ ] User's auto-memory \`feedback_no_schedule_wakeup_outside_loop.md\` updated with the post-#289 / #507 nuance: \"ScheduleWakeup outside /loop is a no-op for time-based delivery in TUI mode; in Untether mode with Loop mode ON for the chat, Untether takes over firing — that's the supported path.\"

## Out of scope

- **Fleet meta-target (\`/monitor untether-fleet\`)** — that's tracked separately in the fleet rollout plan ([`docs/plans/2026-05-13-fleet-monitoring-and-upgrades.md`](https://github.com/littlebearapps/untether/blob/dev/docs/plans/2026-05-13-fleet-monitoring-and-upgrades.md)). Once this issue confirms single-host \`/monitor\` works via Untether, the fleet variant is a thin extension.
- **Loop mode default = ON** — keep it opt-in; cost surface argues for the deliberate user gesture.
- **Auto-enable Loop mode when \`/monitor\` is invoked** — too magical. Better to surface a clear error message \"Loop mode is OFF for this chat — \`/monitor\` needs it on to continue past pass 1. Toggle via \`/config → 🔁 Loop mode\`.\" and let the user opt in deliberately.

## Cross-links

- Foundation: #289, #507, #481, #470
- Fleet rollout plan: \`docs/plans/2026-05-13-fleet-monitoring-and-upgrades.md\`
- User feedback memory: \`feedback_no_schedule_wakeup_outside_loop.md\` (\`~/.claude/projects/-home-nathan-untether/memory/\`) — needs update once verified
- \`/monitor\` quickref: \`~/.claude/quickrefs/monitor.md\`
- Loop mode how-to: \`docs/how-to/schedule-tasks.md\` (\"Loop mode\" section added in #289)

## Risk

Low. The infrastructure is already shipped and well-tested (58 tests for #289 alone). The most likely failure mode is a small wiring mismatch between how \`/monitor\` invokes \`/loop\` and what the observer expects — fixable in a follow-up PR if it surfaces. Worst case, the issue gets closed as \"works as expected\" with just doc updates.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

verify and document /monitor end-to-end via Untether Loop mode (no new code expected) #529

Context

What v0.35.3 already provides (foundation)

Scope of this issue

Tasks

Documentation acceptance

Out of scope

Cross-links

Risk

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

verify and document /monitor end-to-end via Untether Loop mode (no new code expected) #529

Description

Context

What v0.35.3 already provides (foundation)

Scope of this issue

Tasks

Documentation acceptance

Out of scope

Cross-links

Risk

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions