hermes-webui

mirror of https://github.com/nesquena/hermes-webui.git synced 2026-05-25 03:00:23 +00:00

Author	SHA1	Message	Date
nesquena-hermes	ec403fa3cf	fix(routes): persist openai-codex provider unconditionally on stale-session repair (Opus stage-303 follow-up) Opus advisor on stage-303 (#1738 verification Q4) flagged that the catalog-coverage branch produces a redundant repair-write per chat-start when the active Codex default is itself slash-prefixed: the repair sets `provider_context = None`, the next chat-start hits the same branch because `requested_provider is None` again, and the repair fires repeatedly. In practice Codex `default_model` is always a bare `gpt-...` ID from the Codex catalog, so this is theoretical. But once we've decided this session belongs to Codex, we should persist that decision. Drop the conditional catalog-coverage check and unconditionally attach `raw_active_provider` ("openai-codex") on this repair path. The shape is now stable across resolutions. Absorb-in-release per Opus stage-303 verdict — small, defensive, ≤10 LOC.	2026-05-06 15:18:34 +00:00
Michael Lam	3e2a945501	fix: repair stale OpenAI session models for Codex	2026-05-06 14:53:40 +00:00
nesquena-hermes	97aa3247e1	fix(test-isolation): in-stage fixes for stage-302 pre-release gate PR #1728's path/mtime-aware get_config() reload broke the common test idiom monkeypatch.setattr(config, 'cfg', {...}). The cfg = _cfg_cache alias bound at import time means the rebinding only changes the module attribute; _cfg_cache stays unchanged, so _cfg_has_in_memory_overrides() returned False and the path-aware reload silently overwrote the test's override. test_issue1426_openrouter_* and test_issue1680_codex_* failed in the full suite while passing standalone — exact polluter signature. Fix: - _cfg_has_in_memory_overrides() now also detects cfg-rebind via cfg is not _cfg_cache. - get_config() returns cfg (the override) when it differs from _cfg_cache, so callers see the test's intended override. - 4 new regression tests pin both prongs in test_stage302_config_override_regression.py. Defense-in-depth (prong 2 of test-isolation-flake-recipe): - test_sprint3.py::test_skills_list and test_skills_list_has_required_fields now skip on empty skills list rather than asserting > 0 / IndexError, so future profile-switch / SKILLS_DIR repointing pollutions don't break the build. The contract under test is 'API returns a non-empty list when there are entries' — empty list signals a polluter elsewhere. Pre-existing wall-clock flake fix (absorb-in-release): - test_issue1144_session_time_sync.py::test_relative_time_uses_server_clock now pins Date.now() to a fixed instant. Without pinning, when CI runs near 08:00 UTC the projected server time crosses midnight and '5 minutes ago' silently becomes '1d'. Same time-of-day-pin pattern as the sibling test_session_bucket_uses_server_clock used. Test count: 4580 → 4584 (+4 regression tests). 0 failures, stably green across multiple runs.	2026-05-06 08:10:08 +00:00
starship-s	74eb55d986	fix(profile): preserve context when starting chats	2026-05-06 06:27:00 +00:00
Michael Lam	63239d5b3c	fix(models): delegate generic provider catalogs to Hermes CLI	2026-05-06 06:26:44 +00:00
Michael Lam	e509faec44	feat: link Claude Code OAuth in onboarding	2026-05-06 06:26:43 +00:00
nesquena-hermes	29878259ca	docs(troubleshooting): bake the #1695 diagnostic flow into the error message + a new troubleshooting doc Closes #1695. @Patrick-81 reported the bare "AIAgent not available -- check that hermes-agent is on sys.path" error on a symlinked install (~/Programmes/hermes-agent linked to ~/hermes-agent). The maintainer's response — three diagnostic commands plus `pip install -e .` in the agent dir — fixed it for them. This PR captures both halves of that learning so the next user with the same shape doesn't have to file a new issue: 1. Error message diagnostic block. New helper `_aiagent_import_error_detail()` in api/streaming.py builds a multi-line diagnostic when the import fails, including: - the running Python interpreter - HERMES_WEBUI_AGENT_DIR (set value, or "(not set)") - sys.path entries that mention hermes/agent (or "no entries mention..." — itself a strong diagnostic signal) - the most-common fix (`pip install -e .` in the agent dir) - a pointer to docs/troubleshooting.md The original error message string is preserved as the FIRST line so existing log scrapers and docs-search keep matching. Helper is kept as a separate function so it stays out of the hot path until we actually need to raise — building it on every successful import would be wasted work. 2. New docs/troubleshooting.md. Symptom → Why → Diagnostic commands → Fix → When-to-file-a-bug template, with one entry to start: the "AIAgent not available" flow Patrick-81 walked through. Future recurring failure modes follow the same template. Required a one-line addition to .gitignore — docs/* is gitignored with an allowlist, and the new file needed `!docs/troubleshooting.md` to be tracked. 3. README link. docs/troubleshooting.md added to the `## Docs` section so users know where to look first. 13 regression tests in tests/test_1695_aiagent_import_error_detail.py: 9 for the helper output shape (preserves original message line, includes running python, shows HERMES_WEBUI_AGENT_DIR set/unset both ways, includes pip-install-e hint, points at troubleshooting doc, lists relevant sys.path entries when present, says "no entries..." when absent, output is multi-line) plus 4 for the docs-presence regression (file exists, has the AIAgent section, includes pip install -e ., describes the diagnostic chain with readlink + agent/__init__.py verification). 190 streaming/aiagent tests pass after the change. ast.parse on api/streaming.py clean. CI failure on prior push was due to the docs/* gitignore swallowing the new troubleshooting.md file silently — this commit adds the allowlist entry so the file is tracked.	2026-05-05 22:14:07 +00:00
Nathan Esquenazi	b6567addb1	Stage 303: PR #1719	2026-05-05 21:58:21 +00:00
Nathan Esquenazi	afe0c26df9	Stage 303: PR #1720	2026-05-05 21:58:21 +00:00
Michael Lam	f97b040985	fix: raise persisted tool snippet cap	2026-05-05 13:46:54 -07:00
Michael Lam	2c5acb9725	feat: show active elapsed timer in compact activity	2026-05-05 13:42:47 -07:00
ai-ag2026	8b34a79f02	fix: preserve imported session lineage visibility	2026-05-05 22:32:19 +02:00
Michael Lam	fdeac578da	feat: add VPS resource health panel	2026-05-05 17:30:56 +00:00
Nathan Esquenazi	a66feb2661	Stage 301: PR #1703	2026-05-05 15:41:43 +00:00
Nathan Esquenazi	08ea4fbc05	Stage 301: PR #1685	2026-05-05 15:41:43 +00:00
Nathan Esquenazi	bf8b5edc23	Stage 301: PR #1701	2026-05-05 15:41:43 +00:00
Nathan Esquenazi	db972afd99	Stage 301: PR #1693	2026-05-05 15:41:43 +00:00
Michael Lam	c4ef5b6945	fix: invalidate model cache on auth-store drift	2026-05-05 08:33:44 -07:00
Michael Lam	dc7ba0c845	fix: normalize update banner repository URLs	2026-05-05 08:29:00 -07:00
Manfred	52e7916cb8	fix: avoid adaptive title refresh session lock deadlock	2026-05-05 12:51:13 +02:00
Michael Lam	f6a532d7f0	fix: normalize named profile base homes	2026-05-05 00:00:29 -07:00
Michael Lam	0fe3927655	fix: surface Codex spark models	2026-05-04 23:10:36 -07:00
test	449f37ebd8	Stage 300: PR #1673 — feat: show LLM Gateway routing metadata by @Michaelyklam	2026-05-05 02:27:24 +00:00
test	32f37d3d78	Stage 300: PR #1676 — Add Hermes agent heartbeat alert by @Michaelyklam	2026-05-05 02:27:24 +00:00
Michael Lam	c94ec31dec	feat: show LLM Gateway routing metadata	2026-05-05 02:26:55 +00:00
Michael Lam	22df075b8a	feat: add active provider quota status	2026-05-05 02:26:52 +00:00
Michael Lam	960e45f77f	feat: add agent heartbeat alert	2026-05-05 02:25:06 +00:00
Nathan Esquenazi	e2748fe961	Apply Opus pre-release SHOULD-FIX (absorbed in stage-299) Per Opus advisor on stage-299: 1. Bounded WIKI_PATH walk + forbidden-root guard (api/routes.py) - _LLM_WIKI_MAX_FILES = 10000 caps rglob iteration (prevents hangs on symlink loops or pathologically-large trees) - _LLM_WIKI_FORBIDDEN_ROOTS blocklist refuses '/' '/etc' '/usr' '/var' '/opt' '/sys' '/proc' even if WIKI_PATH is misconfigured to point at them - Self-DoS prevention: /api/wiki/status fires on every Insights tab open via Promise.all, and unbounded rglob would block the endpoint 2. URL-scheme guard for docs_url interpolation (static/panels.js) - rawDocsUrl is regex-validated against /^https?:\/\//i before being interpolated into the <a href=> attribute - esc() HTML-escapes but doesn't validate URL scheme; docs_url is server-controlled today but the contributor scaffolded it for potential config-driven use, so future-proof against javascript: scheme XSS 6 regression tests in tests/test_stage299_opus_fixes.py pin both fixes.	2026-05-05 02:15:25 +00:00
test	136d858963	Stage 299: PR #1587 — Filter low-value CLI agent sessions by @franksong2702	2026-05-05 01:54:08 +00:00
test	df8ee6a8ad	Stage 299: PR #1662 — feat(logs): add Logs tab MVP by @Michaelyklam	2026-05-05 01:53:56 +00:00
Frank Song	8981d33543	Fix CLI session CI compatibility	2026-05-05 01:52:42 +00:00
Frank Song	79d0762d8c	Filter low-value CLI agent sessions	2026-05-05 01:52:42 +00:00
Michael Lam	af1c628292	feat: add logs tab MVP	2026-05-05 01:51:05 +00:00
Michael Lam	2684d6fa98	feat: add LLM Wiki status panel	2026-05-05 01:48:32 +00:00
test	3699e83c43	Stage 298: PR #1677 — feat: link official Hermes dashboard by @Michaelyklam	2026-05-05 01:29:49 +00:00
Michael Lam	b0953b6a7f	feat: link official Hermes dashboard	2026-05-05 01:23:55 +00:00
Michael Lam	e0e991126f	feat: add searchable MCP tool inventory	2026-05-05 01:20:32 +00:00
test	2ec18b728a	Stage 298: PR #1670 — feat: add MCP server visibility panel by @Michaelyklam	2026-05-05 01:18:35 +00:00
test	8c93b995ef	Stage 298: PR #1678 — Add Claude Code session imports by @Michaelyklam	2026-05-05 01:18:35 +00:00
test	def1507828	Stage 298: PR #1674 — feat(tasks): add scheduled job profile selector by @Michaelyklam	2026-05-05 01:18:35 +00:00
test	dfb3798470	Stage 298: PR #1663 — feat: add plugins visibility panel by @Michaelyklam	2026-05-05 01:18:35 +00:00
Michael Lam	399326f923	feat: add MCP server visibility panel	2026-05-05 01:18:34 +00:00
Michael Lam	e54a0470f0	Add Claude Code session imports	2026-05-05 01:18:34 +00:00
Michael Lam	3f3092a84e	feat: add scheduled job profile selector	2026-05-05 01:18:34 +00:00
Michael Lam	60ed948f42	feat: add plugins visibility panel	2026-05-05 01:18:33 +00:00
Michael Lam	66755b7fb1	feat: add insights token trends	2026-05-05 01:12:08 +00:00
Nathan Esquenazi	698384ecbc	fix(kanban): apply Opus advisor SHOULD-FIX (PATCH/DELETE routing + SSE id:) Two SHOULD-FIX items from the Opus advisor pass on PR #1675: 1. PATCH/DELETE handler routing asymmetry. The /boards/<slug> path match was running AFTER ?board= resolution, so a stray ?board=ghost on a 'PATCH /api/kanban/boards/experiments?board=ghost' would 404 on the missing 'ghost' board instead of editing 'experiments'. POST already routed /boards first; PATCH/DELETE now mirror that structure. The ?board= query is still resolved for the task-scoped routes that actually need it. 2. SSE event frames now emit 'id: <event_id>' lines. EventSource stores Last-Event-ID and sends it on auto-reconnect; without an 'id:' field on each frame the browser couldn't resume cleanly across connection drops, forcing the server to re-stream up to _KANBAN_SSE_BATCH_LIMIT=200 events the client already had. The handler now (a) emits 'id: <cursor>' on every events frame, and (b) reads Last-Event-ID from the request headers as a fallback when ?since= is absent. +4 regression tests: - test_handle_kanban_patch_routes_boards_slug_before_board_query_param - test_handle_kanban_delete_routes_boards_slug_before_board_query_param - test_sse_emits_id_lines_so_browser_can_resume_via_last_event_id - test_sse_honours_last_event_id_header_when_since_absent Total kanban tests: 67 -> 68 (CSS-injection fix in `60874db`) -> 72 (this). Co-authored-by: ai-ag2026 <ai-ag2026@users.noreply.github.com>	2026-05-05 00:32:43 +00:00
Nathan Esquenazi	397d851bdb	feat(kanban): multi-board management + SSE live event stream Closes the remaining gaps to first-party Hermes Agent dashboard parity: multi-board CRUD on /api/kanban/boards and a real-time event stream over Server-Sent Events. Builds on top of #1660 (review-feedback hardening). == Multi-board == Five new endpoints mirror the agent dashboard plugin contract verbatim (plugins/kanban/dashboard/plugin_api.py) so a single CLI / gateway slash command / dashboard / WebUI all share the same active-board pointer: GET /api/kanban/boards POST /api/kanban/boards PATCH /api/kanban/boards/<slug> DELETE /api/kanban/boards/<slug> POST /api/kanban/boards/<slug>/switch All existing endpoints accept ?board=<slug> (and writes also accept 'board' in the JSON body) — query takes precedence over body. The slug travels through the kanban_db library which already had multi-board support; the bridge is mostly thin wrappers around create_board / remove_board / list_boards / set_current_board / get_current_board. The default board is protected from deletion. Slugs are normalised through kb._normalize_board_slug() with path-traversal rejection. Archive is the default for DELETE; ?delete=1 hard-deletes. Frontend gets a 'Default ▾' switcher pill in the panel header. The menu lists every board (current first), per-status total badges, plus three actions (New / Rename / Archive). Create + rename use the same modal with a slug auto-derived from the name. Archive routes through the existing showConfirmDialog with a clear 'tasks remain on disk and the board can be restored from kanban/boards/_archived/' message. Active-board state is persisted to localStorage so a refresh stays put. The on-disk pointer in kanban/current is the cross-process source of truth, kept in sync via POST /boards/<slug>/switch. == SSE event stream == GET /api/kanban/events/stream is a long-lived Server-Sent Events feed that mirrors the agent dashboard's WebSocket /events contract. The WebUI uses SSE rather than WebSocket because (1) the existing transport is BaseHTTPServer, not async — WS would require a significant refactor or a hijack-the-socket hack; (2) SSE is the right tool for unidirectional server-pushed event streams; (3) browsers auto-reconnect on drop; (4) the existing /api/approval/stream and /api/clarify/stream patterns are proven and easy to copy. The handler polls task_events at 300ms (matching the agent dashboard's WebSocket poll cadence) so write-to-receive latency is identical. Heartbeats every 15s prevent proxy/CDN reaping. Hard cap of 200 events per batch. Frontend uses EventSource by default and falls back to 30s HTTP polling after 3 SSE failures. A 250ms debounce coalesces bursts of N events into a single board re-fetch. Stream is torn down when the user leaves the Kanban panel. == Bugs fixed during build == (1) read_only=True legacy lie. _board_payload, _events_payload, _task_log_payload, and the no-change short-circuit all hardcoded read_only=True from the read-only-bridge era of #1645. Bridge has been writable since #1649 — flag now matches reality. (2) Modal + dropdown menu transparent backgrounds. The PR stack used var(--panel) which is undefined in the WebUI design system (uses --surface, --bg, gradient panels). Replaced with the same gradient + accent border pattern used by the .app-dialog overlay. (3) Archive race. kb.connect(board=<slug>) auto-materialises the directory + sqlite on first call, so any in-flight SSE poll on a board mid-archive would silently un-archive it by re-creating the directory. Two-layer fix: (a) frontend stops the SSE stream BEFORE the DELETE call, restarts on failure; (b) bridge's _kanban_sse_fetch_new checks kb.board_exists() before connect(), returning empty results when the board is gone. (4) Save vs. Cancel button visual hierarchy. Both rendered as identical secondary buttons in the modal. Save now uses the .primary class with accent-tinted gold styling. (5) Mobile viewport gaps. Added 9 rules under @media (max-width: 640px) covering the switcher button (smaller padding/font), name truncation (max-width:140px), menu sizing (min(280px, 100vw - 24px)), modal padding, and inline-row stacking. == Tests == +45 new tests across two files. Bridge tests: 18 covering board CRUD endpoints, slug validation, default-board protection, dispatcher routing, board isolation (verified via connect() spy), and 3 SSE tests including a worker-thread integration test with threading.Event watchdog. UI static tests: 11 covering switcher markup, modal markup, JS handler presence, REST verb usage, board-param plumbing, localStorage persistence, showConfirmDialog usage, EventSource subscription, polling fallback, panel-switch teardown, and 250ms debouncing. Bridge tests: 18 → 36 (+18 multi-board, +3 SSE) UI static tests: 15 → 26 (+11) Total kanban: 33 → 63 Full repo test suite: 4351 passed, 0 regressions. == Live verification == End-to-end browser walkthrough on port 8789: - Create Sprint 12 + Backlog via modal: switcher updates ✓ - Switch between boards: count isolation correct ✓ - Add task on Sprint 12 via API: SSE delivers in 400ms ✓ - 5-task burst: 250ms debounce coalesces to single render ✓ - Rename board via modal: switcher label updates ✓ - Archive board: confirm dialog → board moved to _archived/, no zombie directory (race fix verified) ✓ - Zero JS errors throughout 11-step flow Co-authored-by: ai-ag2026 <ai-ag2026@users.noreply.github.com>	2026-05-05 00:18:36 +00:00
Nathan Esquenazi	7e48a2fd85	fix(kanban): polish + ImportError fallback Four follow-up issues found in the combined-stack live verification: (1) handle_kanban_get had no exception handler; ImportError (webui-only deploy without hermes_cli), ValueError, LookupError, RuntimeError would bubble as 500. Wrapped in same exception cascade as POST/PATCH/DELETE. (2) ImportError on any verb now returns 503 "kanban unavailable: <reason>" instead of 500. Frontend's existing try/catch surfaces a clean toast. (3) The 'Read-only view' banner (legacy of read-only PR #1645) was always visible regardless of actual board state. Default-hidden in HTML; loadKanban() toggles based on _kanbanBoard.read_only. (4) .btn / .btn.secondary class names were referenced in 4 places (Bulk action / Nudge dispatcher / New task / Back to board) but no matching CSS shipped — buttons rendered as browser-default beveled controls that clashed with the dark theme. Added scoped CSS rules under the kanban-* parent containers. +4 behavioral + static UI tests covering the contracts. Co-authored-by: ai-ag2026 <ai-ag2026@users.noreply.github.com>	2026-05-04 23:32:05 +00:00
Hermes Agent	a39ec45b9f	fix(kanban): protect dispatcher contract — reject raw status='running' PATCH The PATCH /api/kanban/tasks/:id endpoint allowed any status-to-any-status transition for the non-claim/complete/block/archive set via raw `UPDATE tasks SET status = ?`. This let UI users (or any client) flip a task to 'running' without going through kb.claim_task(), bypassing claim_lock + claim_expires + started_at + worker_pid. The dispatcher treats such a phantom-claimed task as orphaned and may reclaim, hide, or double-dispatch it. Match the agent dashboard plugin's contract (plugins/kanban/dashboard/plugin_api.py update_task): - status='running' via PATCH → ValueError (HTTP 400) - status='ready' from currently-blocked → kb.unblock_task() (fires 'unblocked' event) - status='ready' from anything else, plus status in {'todo', 'triage'} → new _set_status_direct() helper that nulls claim fields when leaving 'running', closes any active run with outcome='reclaimed', and appends a 'status' event row to task_events - status='done', 'blocked', 'archived' → unchanged (already structured) Frontend changes: - Drop 'running' from the .kanban-status-actions button row in the task detail pane (clicking it would always 400 anyway). - allowKanbanDrop() refuses the 'running' column as a drop target with dropEffect='none' so users see immediate visual feedback that the dispatcher/claim path owns running. Tests added (3, all passing): - test_patch_status_running_is_rejected_to_protect_dispatcher_contract - test_patch_status_done_to_running_is_rejected - test_patch_status_blocked_to_ready_routes_through_unblock_task Existing 12 tests still pass. Co-authored-by: ai-ag2026 <ai-ag2026@users.noreply.github.com>	2026-05-04 23:06:42 +00:00

1 2 3 4 5 ...

548 Commits