hermes-webui

mirror of https://github.com/nesquena/hermes-webui.git synced 2026-05-22 18:30:28 +00:00

Author	SHA1	Message	Date
ai-ag2026	dd07334d6c	fix(session): keep state db replays out of sidecar tail	2026-05-22 16:25:10 +00:00
s010mn	4153a47d0f	feat: new_session() reads display.personality from config as default When display.personality is set in config.yaml (e.g. personality: taleb), new sessions now inherit it automatically instead of starting with personality=None and requiring an explicit /personality command. This makes the selected personality sticky across new conversations rather than requiring per-session activation. Behavior: - display.personality values 'none', 'default', 'neutral', '' are treated as no personality (personality=None), matching TUI gateway semantics. - Config read is wrapped in try/except — if it fails, personality falls back to None (no crash, no regression). - Case-insensitive: 'Taleb' normalizes to 'taleb'. The /personality slash command still works for per-session overrides as before; this change only affects the initial default.	2026-05-22 16:13:33 +00:00
nesquena-hermes	d71b8977d6	Stage 401: PR #2742	2026-05-22 15:22:01 +00:00
Isla-Liu	5b41f03a92	fix(webui): close sqlite3 connections in handoff-summary path (#2233 ) Two functions on the /api/session/handoff-summary hot path were opening sqlite3.connect(...) inside a bare `with` statement, which commits the transaction at scope exit but does NOT close the connection. Per-turn invocations accumulated state.db / state.db-wal file descriptors and CPython heap pages on long-lived worker threads, surfacing as the multi-GB VmRSS and 6x duplicated state.db fds observed on the live instance (D0 pre-restart baseline: VmRSS 1,334,248 kB, 55 fds; cold baseline after restart: VmRSS 136,668 kB, 10 fds). Wrap both call sites with contextlib.closing(...) (already imported and used at seven other sites in the same files) so the connection is closed deterministically: - api/models.py :: count_conversation_rounds - api/routes.py :: _persist_handoff_summary_to_state_db Regression test: tests/test_issue2233_sqlite_connection_leak.py loops both functions 20 times against a tmp state.db and asserts /proc/<pid>/fd count does not grow more than 2. Linux-only via sys.platform skip. D1 live soak against a freshly-built worktree server (port 8799, isolated HERMES_HOME / HERMES_WEBUI_STATE_DIR) hitting /api/session/handoff-summary 20 times: fd_before = 5 fd_after = 5 (growth 0, threshold < 5) vmrss_before = 52636 kB vmrss_after = 52636 kB (growth 0 kB, threshold < 30 MB) The patched fix curve trends below the leak curve. Rollback: single git revert <this-sha> reverts both file edits. Refs #2233.	2026-05-22 18:34:06 +08:00
wdzhou	a4e6ffccd9	fix(session): deduplicate _write_session_index full rebuild entries by session_id The full rebuild path scans SESSION_DIR via glob('.json') and appends every loaded session to a plain list without deduplicating by session_id. When old-format session_.json files coexist alongside WebUI-format xxx.json files (both sharing session_id), the index gets duplicate entries, causing frontend Vue key crashes. Fix: use dict[session_id -> compact_entry] to naturally deduplicate.	2026-05-22 18:02:49 +08:00
wdzhou	16f9887846	fix(session): deduplicate _write_session_index full rebuild by session_id The full rebuild path of _write_session_index scans SESSION_DIR via glob('.json') and appends every loaded session to a plain list without deduplicating by session_id. When old-format session_.json files coexist alongside WebUI-format xxx.json files (both sharing the same session_id), the same session appears multiple times in the index, causing frontend Vue key collisions and a blank page. Fix: use dict[session_id -> compact_entry] to naturally deduplicate. Prefer the entry with the larger message_count when conflicts arise.	2026-05-22 16:13:42 +08:00
Hermes Agent	654f62e0bd	Stage 400: PR #2721 — fix(session): treat active runs as live during repair (skip restart-stale prune for sessions with live streams) Co-authored-by: ai-ag2026 <ai-ag2026@users.noreply.github.com>	2026-05-21 22:59:43 +00:00
Hermes Agent	4db8df5e29	Stage 399: PR #2686 — fix(session): dedupe restamped state.db replay rows in /api/session display merge Co-authored-by: ai-ag2026 <ai-ag2026@users.noreply.github.com>	2026-05-21 17:56:40 +00:00
nesquena-hermes	4d8b8d0ffe	Stage 393: PR #2633 # Conflicts: # CHANGELOG.md	2026-05-20 22:23:53 +00:00
nesquena-hermes	e35c94bf55	Stage 393: PR #2615	2026-05-20 22:23:53 +00:00
dobby-d-elf	87527ff4f6	Fix state db legacy dedup repeat preservation	2026-05-20 14:18:47 -06:00
dobby-d-elf	19ad20afff	Fix new chats using profile default model	2026-05-20 10:57:04 -06:00
nesquena-hermes	3d34eef02d	Stage 389: PR #2620	2026-05-20 16:41:45 +00:00
Isla Liu	2a303de2a3	fix(session): preserve retry budget while journal is still arriving	2026-05-20 20:55:07 +08:00
Isla Liu	d5a185d9c6	fix(session): serialize lazy journal retry per session	2026-05-20 20:48:38 +08:00
manji	ff0aa69d5f	fix(session): use second-level timestamp granularity in legacy dedup key The _normalized_message_timestamp_for_key helper was preserving microsecond precision (%.6f). When the same message is persisted by both the WebUI sidecar JSON writer and the Hermes agent state.db writer, their timestamps can differ by a few microseconds, causing _session_message_merge_key to produce different keys for the same logical message and letting both copies survive the dedup pass in merge_session_messages_append_only. Truncating to second-level granularity collapses sub-second drift to the same key, so the duplicate is suppressed correctly. Fixes #2616	2026-05-20 07:13:55 +00:00
Lumen Yang	b2c6af12f1	fix(webui): prefer sidecar counts over stale session index	2026-05-20 05:42:55 +00:00
Isla Liu	9870e8f111	fix(session): address Copilot review — scope tool-card dedupe by stream id + tighten docs Four code-review comments from the automated Copilot reviewer on this PR: 1. `_journal_tool_already_present` dedupe was session-wide, so a legitimately-repeated tool (e.g. a second `terminal: ls` in an earlier turn) could cause the retry path to falsely skip materializing the recovered tool card. The helper now takes a keyword `stream_id` argument; when supplied, a tool card whose `_recovered_stream_id` is set AND differs from the candidate is no longer treated as a duplicate. Untagged tool cards (live tools, or tool cards carried over from a pre-tagging core transcript) still match, preserving the existing 'core transcript already has this tool, don't duplicate' invariant. Two new tests in `TestJournalToolDedupeScoping` cover both legs of the rule. 2./3. The troubleshooting FAQ pointed at `~/.hermes/webui/sessions/session_<sid>.json` and `~/.hermes/_run_journal/...`. The actual sidecar filename has no `session_` prefix and the run-journal lives under the WebUI sessions dir (`~/.hermes/webui/sessions/_run_journal/<sid>/<stream>.jsonl`, default). Both paths fixed and an explicit note added about `HERMES_WEBUI_STATE_DIR` overriding the state root. 4. Drop unused `json` / `queue` / `Path` imports from `tests/test_session_lost_response_regression.py` so the file stops carrying noise that future linting would flag.	2026-05-20 12:18:03 +08:00
Isla Liu	75a26174aa	fix(session): lazily retry run-journal recovery so the interrupted-turn marker self-heals When the WebUI process restarts mid-stream and sidecar repair runs while the run-journal for the dead stream is not yet visible on disk (WSL2 9p / DrvFs page-cache loss, un-fsynced journal tail on network FS, …), `_append_journaled_partial_output()` returns False and the marker is permanently baked with the "no agent output was recovered" wording even though the journaled tokens appear on disk shortly afterwards. This commit reframes the recovery contract so the read side can self-heal: * `_interrupted_recovery_marker` gains a `pending_retry=True` mode that produces a third wording ("Recovering the partial output … reload this session to retry.") and stamps a `_pending_journal_recovery` flag. * `_apply_core_sync_or_error_marker` now writes that pending-retry marker (with `_journal_retry_stream_id`, `_journal_retry_attempts`, `_journal_retry_first_seen_ts` meta) whenever it cannot recover visible output AND the stream id is known. The legacy "no output" wording is reserved for the no-stream-id case. The core-sync branch leaves marker emission to the existing visible-output check (the core transcript itself is the canonical history in that branch). * A new `_retry_journal_recovery_in_place(session)` helper re-runs `_append_journaled_partial_output(…, dedupe_existing=True)` for the latest pending marker. On success the marker is promoted in place to the recovered-output wording, the journaled rows are reordered to sit above the marker (preserving chronological order), and all retry meta is stripped. On failure attempts is incremented; after _JOURNAL_RETRY_MAX_ATTEMPTS (12) or _JOURNAL_RETRY_GIVEUP_SECONDS (24h) the marker is demoted to a neutral "Partial output may have been lost." wording. * `get_session()` cheaply short-circuits via `_session_has_pending_journal_retry()` and invokes the helper on both cache-hit and cold-load paths when a pending marker is found. `metadata_only=True` skips the helper to keep sidebar refresh cheap. The retry call runs OUTSIDE the SESSIONS LOCK to avoid a deadlock with `session.save()` write paths. No streaming write path or run_journal fsync behaviour is changed — the fix is read-side only.	2026-05-20 11:58:26 +08:00
nesquena-hermes	cc8ef201be	Stage 387: PR #2600	2026-05-19 22:10:20 +00:00
nesquena-hermes	93727897b6	Stage 387: PR #2605 # Conflicts: # api/routes.py	2026-05-19 22:10:20 +00:00
nesquena-hermes	e63de7c15f	Stage 387: PR #2593 # Conflicts: # CHANGELOG.md	2026-05-19 22:08:56 +00:00
Lumen Yang	dc5c8168d1	fix(webui): refresh active session on external sidecar updates	2026-05-19 21:34:08 +00:00
Lumen Yang	8d2b9d4a16	feat(webui): render indexed context metadata	2026-05-19 18:52:50 +00:00
nesquena-hermes	86f52f67b8	Stage 386: PR #2581 # Conflicts: # api/streaming.py	2026-05-19 18:20:47 +00:00
Michael Lam	0736e45485	fix: dedupe tool-only partial recovery markers	2026-05-19 11:16:21 -07:00
starship-s	2e9ca283dc	fix: display canonical cache hit percentage	2026-05-19 02:27:12 -06:00
Lumen Yang	600bb48970	fix(webui): use active state db for metadata summary	2026-05-19 08:02:43 +00:00
Lumen Yang	6ca63e5815	perf(webui): keep external refresh metadata cheap	2026-05-19 08:02:43 +00:00
Lumen Yang	a63ab310b5	fix(webui): preserve reconciled session invariants	2026-05-19 08:02:43 +00:00
Lumen Yang	467ef33a24	feat(webui): reconcile external session updates When API server runs append messages directly to state.db, reconcile WebUI sidecar sessions with those canonical rows across API responses, model-facing streaming context, and active browser refresh. Add append-only state.db merge helpers, metadata-only counts for refresh polling, and regression coverage for API visibility, context incorporation, and frontend refresh behavior.	2026-05-19 08:02:43 +00:00
Frank Song	4661a5e94e	Recover journal output after core transcript sync	2026-05-17 12:28:05 +08:00
nesquena-hermes	8f98465024	Stage 374: PR #2427 — fix(streaming): recover journaled partial assistant output after WebUI restart by @franksong2702 (fixes #2423 ) Co-authored-by: Frank Song <franksong2702@gmail.com>	2026-05-17 02:49:35 +00:00
nesquena-hermes	47c210899e	Stage 374: PR #2421 — fix(cache-tokens): surface provider prompt-cache read/write tokens in WebUI usage by @Michaelyklam (fixes #2419 ) Co-authored-by: Michael Lam <michael@example.local>	2026-05-17 02:49:34 +00:00
Hermes Agent	026a9957f4	Stage 368: PR #2385 — Keep fuller compression snapshots reachable in sidebar by @franksong2702	2026-05-16 17:19:05 +00:00
Frank Song	4899ae17b9	Keep fuller compression snapshots reachable	2026-05-16 20:58:44 +08:00
Frank Song	c415c843df	Update interrupted recovery comment wording	2026-05-16 20:05:47 +08:00
Frank Song	49bea3ad01	Clarify interrupted turn recovery marker	2026-05-16 14:29:58 +08:00
Frank Song	40f69a2b75	Keep recovered pending turns in context	2026-05-16 04:07:02 +00:00
Hermes Agent	4826a31fbc	Merge pull request #2285 into stage-359 fix: hide pre-compression snapshots from sidebar (dso2ng, refs #2230) # Conflicts: # CHANGELOG.md	2026-05-15 14:55:19 +00:00
Dennis Soong	bfccdc5c94	fix: hide pre-compression snapshots from sidebar	2026-05-15 11:20:17 +08:00
ai-ag2026	5110005324	fix: load CLI continuation session transcripts	2026-05-14 23:48:49 +02:00
Dennis Soong	143d9d8ef7	fix: reconcile stale sidebar display titles	2026-05-14 16:18:53 +08:00
starship-s	4084c3cf56	perf(sessions): cache CLI session scans	2026-05-12 11:24:29 -06:00
Frank Song	186453ea0e	Add worktree-backed session creation	2026-05-11 12:12:40 +08:00
nesquena-hermes	c624770c63	Stage 331: PR #2015 — fix(sessions): stitch continued session transcripts by @Jellypowered	2026-05-10 17:09:21 +00:00
Jellypowered	8aed650b4c	Stitch continued session transcripts in WebUI	2026-05-10 11:10:54 -05:00
Frank Song	1bec8070f2	fix(1833): persist compression anchor summary for reload UI	2026-05-10 16:45:16 +08:00
Minimax	08c4ef8d88	feat: persistent composer draft — server-side, cross-client, survives refresh - Session.composer_draft field: {text, files} stored in session JSON - POST+GET /api/session/draft endpoint for save/load - loadSession: save draft before switch, restore from S.session.composer_draft - textarea input: debounced 400ms auto-save to server - send(): clear draft after message is sent - lockComposerForClarify(): save draft before card locks composer - _restoreComposerDraft: clears textarea when target has no draft, guards against stale responses racing new session loads, exact text comparison - Session.compact(): includes composer_draft in response - Fix: use handler.command instead of parsed.method (ParseResult has no .method) Co-authored-by: Minimax <noreply@minimax.io>	2026-05-09 13:47:57 +01:00
Frank Song	7e2709e281	fix: add request wedge diagnostics	2026-05-08 15:37:08 +00:00

1 2 3

131 Commits