hermes-webui

mirror of https://github.com/nesquena/hermes-webui.git synced 2026-05-25 03:00:23 +00:00

Author	SHA1	Message	Date
Jordan SkyLF	a291ffdde6	fix: refine update summary category handling Keep distinct generated summary categories, route update-summary generation through the configured auxiliary model first, disclose capped large-range summary input, and constrain long summary panels.	2026-05-14 01:07:47 -07:00
Hermes Agent	b8e9951492	Merge pull request #2236 into stage-354 fix: silent failure detection scans only new messages (jasonjcwu)	2026-05-14 07:15:16 +00:00
Hermes Agent	efad585b86	Merge pull request #2228 into stage-354 Add model picker to profile creation (franksong2702, refs #749)	2026-05-14 07:15:14 +00:00
Jordan SkyLF	afbcc9a6d5	fix: wrap update banner on mobile	2026-05-13 23:51:48 -07:00
fxd-jason	1e80b51560	fix: align usage-overwrite test FakeAgent with real agent message format The FakeAgent in test_issue1857_usage_overwrite returned only 2 messages (user + assistant) without the conversation history. The real agent always returns the full history plus new messages. This mismatch caused the new _has_new_assistant_reply helper (which checks only messages beyond the pre-turn offset) to see len(result)==len(prev) and incorrectly flag the turn as a silent failure. Fix: prepend conversation_history to the FakeAgent's response so the message list mirrors production behavior.	2026-05-14 14:48:08 +08:00
fxd-jason	120ec5eba2	fix: silent failure detection scans only new messages, not full history When a provider error (401/429/rate-limit) causes the agent to return without producing a new assistant reply, the WebUI should emit an apperror event so the user sees an inline error. However, the detection logic scanned ALL messages in result['messages'] — which includes the full conversation history. If any prior turn had an assistant response, _assistant_added would be True and the apperror would be silently skipped, leaving the user staring at a blank response. Extract a helper _has_new_assistant_reply(all_messages, prev_count) that only inspects messages beyond the pre-turn history offset. Apply it to both the main detection path and the self-heal/retry path. Tests: 15 new cases covering history masking, empty content, whitespace, edge-case shrinks, and multi-assistant scenarios.	2026-05-14 14:34:19 +08:00
Jordan SkyLF	62eb703dcf	fix: avoid duplicate update summary bullets	2026-05-13 22:54:45 -07:00
Frank Song	8b30ade923	Add profile creation model picker	2026-05-14 12:13:49 +08:00
Hermes Agent	3d34a72ee8	stage-353: apply Opus SHOULD-FIX — unconditional parent_session_id stamp on compression rotation Opus identified that PR #2227's preservation block had two related bugs in the parent_session_id handling: 1. During preservation save: code did _old_parent = s.parent_session_id s.parent_session_id = None s.save(touch_updated_at=False, skip_index=True) s.parent_session_id = _old_parent The save persisted parent=None to disk. The in-memory restoration didn't reach the disk copy. Result: a /branch fork session that subsequently compressed lost its 'Forked from X' badge on the preserved old snapshot. 2. Stamping the continuation: code did if not s.parent_session_id: s.parent_session_id = old_sid The 'if not' guard skipped the stamp when the session already had a parent_session_id from a prior fork. Result: fork-of-fork compression broke lineage — the continuation jumped back to the original fork parent instead of the just-preserved immediate predecessor snapshot. Fix (matches Opus's recommendation): - Remove the parent clearing during preservation save (preserve as-is) - Drop the 'if not' guard; always stamp continuation to old_sid This makes the lineage chain consistent: new → old → old.parent → ... root. Traversal from the continuation always walks through the just-preserved snapshot to get to its parent's parent, never jumping over the snapshot. Two new regression tests pin both invariants: - test_parent_session_id_stamped_unconditionally (no 'if not' guard) - test_old_session_parent_preserved_during_archive_save (no parent=None) Both pass against the fix. All 8 tests in the file pass.	2026-05-14 03:59:02 +00:00
Hermes Agent	bfb62abe35	Merge pull request #2225 into stage-353 Add extra-large Appearance font size option (franksong2702)	2026-05-14 03:43:52 +00:00
Frank Song	e2f319d730	Add extra large font size option	2026-05-14 11:09:21 +08:00
RØG3R L!M4	5bbf18324c	fix: preserve session history during compression rotation (#2223 ) The previous implementation renamed old_sid.json → new_sid.json during context compression, destroying the only persistent copy of the full conversation history. If the summarisation LLM call also failed, the user was left with zero recoverable messages. Fix: - Remove the destructive old_path.rename(new_path) call - Preserve old_sid.json as an immutable pre-compression archive - Create new_sid.json as a fresh file via s.save() - Set parent_session_id on the continuation session for lineage - Save in-memory messages to old_sid.json if they're newer than disk Test: test_issue2223_compression_no_rename.py (6 tests, all passing)	2026-05-14 03:02:44 +00:00
Hermes Agent	549140df31	Merge pull request #2216 into stage-352 fix: cap _summary_cache with LRU (max 16 entries) (franksong2702, closes #2215 Fix A — closes #2215)	2026-05-14 02:22:08 +00:00
Frank Song	9681761cdf	fix: cap _summary_cache with OrderedDict LRU Refs #2215 Fix A: replace plain dict _summary_cache with OrderedDict-based LRU capped at 16 entries to prevent unbounded memory growth from long-running update summary generations. Add regression coverage for the bounded LRU behavior: cache hits refresh recency, a new entry at capacity evicts the least-recently used key, and cache size never exceeds the cap.	2026-05-14 09:14:28 +08:00
Frank Song	28ec3af697	fix: strip only leading user-asking wrapper line Refs #2215 Fix B: remove the mid-response stripping hazard without losing leading multi-line wrapper cleanup. The pattern now strips only a leading 'the user is asking' wrapper line and preserves the visible answer that follows. Add regression coverage for both the leading-wrapper and mid-response prose cases.	2026-05-14 09:14:28 +08:00
Hermes Agent	2accf6335c	Merge pull request #2149 into stage-351 perf(sessions): cache CLI session scans (starship-s) Conflict resolution on api/routes.py: (1) Master grew a new helper '_messages_include_tool_metadata()' that pr-2149 doesn't have. Kept it (unrelated function — detects whether returned messages contain tool metadata, used elsewhere). (2) pr-2149 renames the CLI-metadata gate from '_needs_cli_session_metadata' to '_session_requires_cli_metadata_lookup' AND broadens it to cover legacy-imported sidecars with 'read_only=False' but persisted 'is_cli_session' or session_source markers. The new gate is strictly more inclusive than the master version — covers (a) is_cli_session, (b) read_only=True, (c) session_source in {messaging, external_agent}, AND (d) source_tag, raw_source, source, source_label, platform markers. All sessions that previously took the slow path still do, plus a few more legacy shapes that needed CLI metadata for correct display. (3) Removed the obsolete '_needs_cli_session_metadata()' definition from master (only consumer migrated to the new name). 29/29 tests pass across test_session_cli_scan_fast_path (new), claude_code session import, session_index, and session_lineage_full_transcript.	2026-05-13 23:54:15 +00:00
Hermes Agent	70f09aaeb6	Merge pull request #2207 into stage-351 feat: add per-target update summaries with separate WebUI/Agent What's-new links (Jordan-SkyLF, fixes #1579)	2026-05-13 23:51:28 +00:00
Frank Song	dc213d47b8	fix: preserve literal thinking tags	2026-05-14 07:13:34 +08:00
Jordan SkyLF	90c2ee7e04	Split What's New summaries by target	2026-05-13 15:53:01 -07:00
Jordan SkyLF	cae007b069	Refine What's New summary sections	2026-05-13 15:53:01 -07:00
Jordan SkyLF	623dfef499	Stabilize What's New summaries	2026-05-13 15:53:01 -07:00
Jordan SkyLF	bec21eafa0	Add What's New summary toggle	2026-05-13 15:53:01 -07:00
Jordan SkyLF	cfc0f68d23	fix: show update whats-new links for webui and agent	2026-05-13 15:53:01 -07:00
Hermes Agent	7209e89ef4	stage-350: apply Opus SHOULD-FIX — tighten _partial_already_present dedup scope Opus flagged that PR #2151's cancel-handler partial-dedup loop used a substring check that was too broad: any short prior assistant reply ('OK', 'Here is the answer:') would dedup a longer new partial containing it, silently dropping the partial and resurrecting the #893 data-loss bug. Tightened to only dedup against actual prior _partial=True markers with exact (whitespace-stripped) content match. Three new regression tests added (short-non-partial-prefix-does-not-dedup, exact-partial-match-still- dedups, same-content-non-partial-does-not-dedup). 10/10 partial-cancel tests pass after the fix. Also updated CHANGELOG with the conflict-resolution notes for #2151 vs #2136 and the #2178 test-fix.	2026-05-13 21:11:01 +00:00
Hermes Agent	3f851051cf	Merge pull request #2151 into stage-350 fix: clarify cancelled chat turn status (Jordan-SkyLF) Conflict resolution on api/streaming.py:4549-4567 (the cancel-handler ownership guard). Both this PR and the already-shipped PR #2136 add a guard at the same site against stale stream writebacks, from different angles: - PR #2136 (HEAD): _stream_writeback_is_current(_cs, stream_id) — strictly dominates by checking the active_stream_id token equality. - PR #2151: 'worker won the race' check via (active_stream_id != stream_id and not pending_user_message), with _emit_cancel_event = False to suppress the terminal cancel event. Resolution merges both: keep #2136's strictly-stronger condition for skip detection, and adopt #2151's _emit_cancel_event = False semantic so the cancel event isn't emitted in addition to skipping the writeback (when client may have already received the successful done payload). 55/55 tests pass across cancelled-turn-status + stale-stream-writeback + the four cancel/data-loss sibling test files.	2026-05-13 20:44:44 +00:00
Hermes Agent	5f8b834833	Merge pull request #2193 into stage-350 fix(auth) 3/3: full HMAC digest with upgrade migration bridge + restore Secure cookie heuristic (lucasrc)	2026-05-13 20:41:38 +00:00
Hermes Agent	ca82f60144	Merge pull request #2191 into stage-350 fix(auth) 1/3: thread-safe login rate limiter + PBKDF2 key separation + transparent migration (lucasrc)	2026-05-13 20:41:36 +00:00
Hermes Agent	f94314e164	Merge pull request #2204 into stage-350 Fix opencode-go custom provider overlap routing (Michaelyklam, closes #1894)	2026-05-13 20:41:33 +00:00
Michael Lam	1e17760a04	Fix opencode-go provider overlap routing Closes #1894	2026-05-13 12:13:37 -07:00
Hermes Agent	7150e9fe70	Merge pull request #2202 into stage-349 feat: show early session titles on chat start (Jordan-SkyLF)	2026-05-13 19:03:03 +00:00
Jordan SkyLF	0381294f1c	feat: add early session provisional titles	2026-05-13 11:37:11 -07:00
MrFant	520795fdd2	fix: preserve reasoning_content in API message whitelist Providers like Xiaomi MiMo, DeepSeek, and Kimi require reasoning_content to be echoed back on every assistant message in multi-turn conversations with tool calls. Omitting it causes HTTP 400: 'The reasoning_content in the thinking mode must be passed back to the API.' The WebUI's _sanitize_messages_for_api() strips all fields not in _API_SAFE_MSG_KEYS before sending conversation history to the LLM API. reasoning_content was not in this whitelist, so it was silently dropped. The CLI path (run_agent.py) is unaffected because it has its own _copy_reasoning_content_for_api() logic that operates on raw message dicts without going through this filter. This is why the same session works from CLI but fails from WebUI with HTTP 400. The fix adds 'reasoning_content' to _API_SAFE_MSG_KEYS so the field passes through sanitization intact.	2026-05-14 02:29:17 +08:00
Lucas Coutinho	7e6f7372d5	fix(auth): add type hint to verify_session()	2026-05-13 14:18:47 -03:00
Lucas Coutinho	9921bbb412	docs(auth): add X-Forwarded-Proto trust warning to _is_secure_context()	2026-05-13 14:18:47 -03:00
Lucas Coutinho	07a5fe0838	fix(auth): HMAC length migration bridge and restore Secure cookie heuristic HMAC length: create_session() now emits a full 64-char HMAC-SHA256 hex digest instead of the truncated 32-char form. verify_session() accepts both lengths during a transition window so existing sessions survive the upgrade without a forced global logout. The legacy 32-char branch can be removed once the default 30-day session TTL has elapsed. Secure flag: introduce _is_secure_context(handler) to encapsulate the env-var override and heuristic. Restores the getpeercert / X-Forwarded-Proto heuristic that was present before this refactor, keeping the env-var override (HERMES_WEBUI_SECURE) on top for proxy deployments that need explicit control. The bare `return False` stub that the previous commit left in place silently broke Secure-cookie delivery for all reverse-proxy users who never set the env var. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-13 14:18:47 -03:00
Lucas Coutinho	2bcf411519	fix(auth): invalidate password hash cache in save_settings() on password change	2026-05-13 14:08:37 -03:00
Hermes Agent	7c2b2785e7	stage-348: apply Opus SHOULD-FIX-pre-merge — add '://' to _SENSITIVE_LOWER_MARKERS Opus advisor flagged that PR #2171's credential prefilter only listed specific DB scheme prefixes and form keys, letting OAuth callback URLs, URL userinfo, signed-URL query params bypass the hard agent redactor. Adding the generic '://' marker restores the WebUI-as-hard-safety-boundary contract. Plain URLs without sensitive substrings still pass through unchanged because the redactor itself only mutates sensitive substrings. Regression-pinned with 5 new parametric cases in test_security_redaction.py plus 1 negative-case companion. Verified test FAILS without the fix and PASSES with it.	2026-05-13 16:54:36 +00:00
Hermes Agent	39df1a1ef3	Merge pull request #2171 into stage-348 Trim session tail response overhead (franksong2702)	2026-05-13 16:34:43 +00:00
Hermes Agent	ef042ad8c2	Merge pull request #2188 into stage-348 fix: refresh context ring after compression (LumenYoung)	2026-05-13 16:34:42 +00:00
Lucas Coutinho	978dbc15d8	fix(auth): correct misleading cache invalidation comment in verify_password()	2026-05-13 12:48:35 -03:00
Lucas Coutinho	8ca29618fe	fix(auth): tighten except to OSError, add type hints, fix test imports	2026-05-13 12:27:27 -03:00
Lucas Coutinho	720e69cb83	fix(auth): cache signing and PBKDF2 keys in memory, remove migration side-effect call	2026-05-13 11:13:23 -03:00
Lucas Coutinho	e6e91e4973	fix(auth): thread-safe login rate limiter, PBKDF2 key separation, and migration path Concurrent failed logins raced on _login_attempts because no lock guarded the dict. Add _LOGIN_ATTEMPTS_LOCK and wrap both _check_login_rate() and _record_login_attempt() with it. Extract _load_key() to de-duplicate key file I/O. Add _pbkdf2_key() that loads .pbkdf2_key (separate from .signing_key) so PBKDF2 and HMAC signing no longer share a key — key reuse across cryptographic primitives is unsafe. Update _hash_password() to use _pbkdf2_key() as its default salt, with an optional salt kwarg so verify_password() can try the legacy .signing_key salt during transparent migration. When the old hash matches, save_settings() re-hashes with _pbkdf2_key() and _invalidate_password_hash_cache() ensures the next request sees the upgraded hash without a restart. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-13 10:40:23 -03:00
Lumen Yang	3289c44fb6	fix: refresh context ring after compression	2026-05-13 14:02:28 +02:00
Frank Song	da73c00f06	Harden session tail redaction prefilter	2026-05-13 18:58:49 +08:00
fxd-jason	9e45de463d	fix: prevent 404 on /api/session/compress/status during session switch Two-part fix: - Backend: handle_get returns True (not None from j()) for compress/status route, preventing edge-case 404 fallback in do_GET - Frontend: resumeManualCompressionForSession silently returns on 404 instead of showing "Compression failed: not found" toast Includes 6 regression tests covering backend return value, idle/empty session responses, and frontend 404 guard presence.	2026-05-13 18:56:55 +08:00
Frank Song	b7ac5a8b88	Trim session tail response overhead	2026-05-13 15:57:29 +08:00
Hermes Agent	8060b2ba3a	Merge pull request #2179 into stage-347 fix(config): preserve nvidia/ prefix on NVIDIA NIM (closes #2177) Self-built. nesquena APPROVED with extensive end-to-end trace including cross-tool agent CLI verification and 12-shape behavioural harness.	2026-05-13 07:33:45 +00:00
nesquena-hermes	9b1d786459	fix(config): preserve nvidia/ prefix on NVIDIA NIM (closes #2177 ) Move the `_PORTAL_PROVIDERS` guard in `resolve_model_provider()` to run BEFORE the `prefix == config_provider` strip branch. The guard was added for NVIDIA (along with the Nous portal cases in #854 / #894) but was placed after the strip, so it never fired when `config_provider == "nvidia"` and the model id started with `nvidia/`. For `model_id="nvidia/nemotron-3-super-120b-a12b"`, `config_provider="nvidia"`: - prefix = "nvidia", bare = "nemotron-3-super-120b-a12b" - prefix == config_provider → True → strip branch returned bare name - `_PORTAL_PROVIDERS` guard never reached - bare "nemotron-3-super-120b-a12b" sent to NVIDIA NIM → HTTP 404 NIM requires the full namespaced path. The fix moves the portal guard to run first, so all portal providers (Nous, OpenCode-Zen, OpenCode-Go, NVIDIA NIM) always preserve the full `provider/model` id regardless of whether the prefix happens to equal the provider name. This also closes a latent symmetric bug for the Nous case if a `nous/<model>` id ever existed in the catalog. Test plan: - New `tests/test_issue2177_nvidia_prefix_preservation.py` covers: - nvidia/nemotron-... under nvidia (the reported case) - cross-namespace qwen/ and meta/ under nvidia (regression pin) - every static nvidia model in `_PROVIDER_MODELS` resolves to itself - latent nous/<model> under nous (structural ordering pin) - non-portal providers (anthropic) still strip — fix doesn't over-correct - Existing portal-routing suites (test_nous_portal_routing.py, test_issue895_894_nous_prefix.py) continue to pass. - Full test suite: 5320 passed, 4 skipped, 3 xpassed. Reported on Discord by @vishnu (Nathan forwarded as #2177).	2026-05-13 07:05:57 +00:00
Hermes Agent	afe42b96c1	Merge pull request #2156 into stage-346 Issue #2057 Slice 2: Add guarded worktree remove action	2026-05-13 06:56:25 +00:00

1 2 3 4 5 ...

810 Commits