hermes-webui

mirror of https://github.com/nesquena/hermes-webui.git synced 2026-05-25 11:10:18 +00:00

Author	SHA1	Message	Date
Frank Song	1e1a9481b4	fix(i18n): localize /goal runtime status strings	2026-05-10 15:21:24 +08:00
nesquena-hermes	a3af4a3c8f	fix(profile/mcp): discover MCP tools after per-session HERMES_HOME mutation Issue #1968: switching to a non-default profile in the WebUI dropdown had no effect on which MCP servers were available. Every chat session, regardless of profile, only saw the default profile's mcp_servers from ~/.hermes/config.yaml. Non-default profile MCP servers (postgres, custom stdio servers, anything in <profile>/config.yaml) never registered. Root cause: api/streaming.py:1922 called discover_mcp_tools() at the TOP of _run_agent_streaming(), about 100 lines BEFORE the per-session 'os.environ["HERMES_HOME"] = _profile_home' mutation at line 2053. discover_mcp_tools() reads ~/.hermes/config.yaml via get_hermes_home(), which uses os.environ['HERMES_HOME']. So at the call site, HERMES_HOME was still whatever the WebUI server process had at startup — the default profile, every time. Fix: relocate the discover_mcp_tools() call past the _ENV_LOCK block so get_hermes_home() resolves to the session's actual profile home. Same try/except wrapping is preserved; same idempotency semantics on already-connected servers; same lazy-import pattern. Caveat (out of scope, agent-side): _servers in tools/mcp_tool.py is a process-global Dict[str, MCPServerTask] keyed only by server name. So once profile A registers a server named e.g. 'postgres', profile B's discovery sees 'postgres' as already connected and skips it — even if B's config points at a different binary or DB. Concurrent multi-profile WebUI processes will still hit 'first profile wins per server name'. Fully fixing that requires keying _servers by (profile_home, name) upstream in hermes-agent. This PR ships layer 1 only — fixes the single-non-default-profile case (the headline symptom). Tests: tests/test_issue1968_mcp_profile_discovery.py — 4 static tests pinning the lexical ordering invariants. Verified mutation-safety: a proof-of-concept revert (re-adding a discover call before the HERMES_HOME mutation) makes the 'only called once' test fail. Test suite: 5047 passed, 4 skipped, 3 xpassed, 0 regressions. Closes #1968	2026-05-09 20:08:16 +00:00
nesquena-hermes	8782fd2675	fix(stage-326): apply Opus advisor critical + recommended fixes CRITICAL: #1951 PENDING_GOAL_CONTINUATION race Removes `PENDING_GOAL_CONTINUATION.discard(session_id)` from the streaming worker's `finally` cleanup block. The marker is set inside the SAME function call (line ~3328 on `goal_continue`) and the discard in the `finally` (line ~3553) almost always raced ahead of the frontend's SSE-receive → POST /api/chat/start round-trip, erasing the marker before the consumer in routes.py could read it. The consumer (`_start_chat_stream_for_session` in routes.py:6522) already discards atomically when consuming, so removing the streaming-side discard preserves single-use semantics and unblocks the goal-continuation chain. Adds tests/test_stage326_pending_goal_continuation_race.py with 5 regression guards: 1. streaming.py's finally must NOT discard PENDING_GOAL_CONTINUATION 2. routes.py consumer must check + set + discard atomically 3. PENDING_GOAL_CONTINUATION must be a set (GIL-safe single-op) 4. STREAM_GOAL_RELATED.pop must be keyed by stream_id, not session_id 5. PENDING_GOAL_CONTINUATION.add must precede the goal_continue SSE emission in source ordering HARDENING: #1956 composer-draft input validation Per Opus, the POST /api/session/draft handler accepted unbounded / arbitrary-typed text and files inputs. With the 400ms debounced auto-save firing on every keystroke, a misbehaving client could persist multi-MB strings into the session JSON. Adds: - text: coerced to str if not already; clamped to 50_000 chars - files: coerced to list if not already; clamped to 50 entries Validation runs BEFORE the session lock acquire / save. Adds tests/test_stage326_composer_draft_validation.py with 5 guards. Verdict from Opus advisor on stage-326: SHIP-WITH-FIXES. This commit applies the required + recommended fixes; #1957 hardening fixed in a prior stage commit.	2026-05-09 18:36:01 +00:00
nesquena-hermes	404e24ac9d	fix(stage-326): preserve SESSION_TTL constant + reconcile #1957 tests PR #1957 deleted the SESSION_TTL = 86400 * 30 module-level constant in favor of the new _resolve_session_ttl() helper. Two existing regression tests pin the constant: test_auth_sessions.TestSessionPruning.test_session_ttl_is_24_hours imports SESSION_TTL directly, and test_v050258_opus_followups.test_redirect_session_ttl_30_days asserts the literal "SESSION_TTL = 86400 * 30" line is present in source (guarding against the daily-kick-out regression from #1419). Restore SESSION_TTL as the named fallback for _resolve_session_ttl(); the new env-var/settings.json path is unchanged. Backwards-compatible. Also fix the new TestSessionTtlResolution suite: - Switch from pytest's `monkeypatch` fixture (incompatible with unittest.TestCase subclasses) to setUp/tearDown env snapshotting - Reconcile clamp tests with actual implementation: out-of-range env values fall through to settings/default, not snap to bounds - test_session_uses_dynamic_ttl now sets the env var so the dynamic resolved value (3600s) is exercised rather than expecting the default Verified: tests/test_auth_sessions.py + tests/test_v050258_opus_followups.py 21/21 pass.	2026-05-09 18:33:28 +00:00
nesquena-hermes	7cf8dcff4c	Stage 326: PR #1956 — feat: persistent composer draft — server-side, cross-client, survives refresh by @JKJameson	2026-05-09 18:17:51 +00:00
nesquena-hermes	4751b5ace5	Stage 326: PR #1951 — fix: only evaluate goal hook on goal-related turns (#1932 ) by @amlyczz	2026-05-09 18:17:20 +00:00
nesquena-hermes	22ea145d49	Stage 326: PR #1950 — Mute stale stopped gateway heartbeat by @franksong2702	2026-05-09 18:16:16 +00:00
nesquena-hermes	c2f0c6ccc0	Stage 326: PR #1961 — fix: WebUI respects image_input_mode — stop unconditionally embedding native images by @sbe27	2026-05-09 18:16:16 +00:00
nesquena-hermes	072ec41e0a	Stage 326: PR #1947 — fix: show same model from different custom providers instead of deduplicating by @happy5318	2026-05-09 18:16:16 +00:00
nesquena-hermes	1c84da07fc	Stage 326: PR #1953 — fix(config): skip #1776 provider peel for custom host:port slugs by @lucky-yonug	2026-05-09 18:16:16 +00:00
hermes-agent	b443e8ea5a	fix: WebUI respects image_input_mode — stop unconditionally embedding native images _build_native_multimodal_message() unconditionally embedded images as native image_url parts, bypassing the agent's image_input_mode config. Add _resolve_image_input_mode(cfg) helper mirroring the agent's decide_image_input_mode logic, and wire it into _build_native_multimodal_message with a new cfg parameter. When mode resolves to 'text' (explicit aux vision config, or image_input_mode: text), returns plain string so the agent's existing text-mode pipeline (vision_analyze) handles images. Closes #1959	2026-05-09 19:39:50 +02:00
hermes-gimmethebeans	9d7c213971	feat(auth): make session TTL configurable via env var and settings.json Add _resolve_session_ttl() with three-layer precedence: 1. HERMES_WEBUI_SESSION_TTL env var (highest priority) 2. session_ttl_seconds in settings.json 3. Default: 86400 * 30 (30 days) Clamped to [60s, 1 year] for safety. Settings changes take effect immediately since the function is called dynamically at each login/cookie-write. Closes #1954	2026-05-09 17:11:53 +00:00
Minimax	08c4ef8d88	feat: persistent composer draft — server-side, cross-client, survives refresh - Session.composer_draft field: {text, files} stored in session JSON - POST+GET /api/session/draft endpoint for save/load - loadSession: save draft before switch, restore from S.session.composer_draft - textarea input: debounced 400ms auto-save to server - send(): clear draft after message is sent - lockComposerForClarify(): save draft before card locks composer - _restoreComposerDraft: clears textarea when target has no draft, guards against stale responses racing new session loads, exact text comparison - Session.compact(): includes composer_draft in response - Fix: use handler.command instead of parsed.method (ParseResult has no .method) Co-authored-by: Minimax <noreply@minimax.io>	2026-05-09 13:47:57 +01:00
happy5318	a6599cd68e	fix: show same model from different custom providers instead of deduplicating When multiple custom providers expose the same model ID (e.g. baidu, huoshan, and liantong all offering glm-5.1), only the first provider's entry was shown in the model dropdown. Root cause (backend): used the bare model ID as the dedup key, so the second and subsequent providers with the same model were silently skipped. Root cause (frontend): stripped the @provider: prefix before comparing, so @custom:baidu:glm-5.1 and @custom:huoshan:glm-5.1 were treated as duplicates. Fix: - Backend: change _seen_custom_ids key to '{slug}:{model_id}' so each provider's models are tracked independently. - Frontend: add _providerOf() helper and deduplicate on the composite (normId, provider) key instead of normId alone. Bare model IDs (without @provider: prefix) still deduplicate on normId for backward compatibility.	2026-05-09 16:17:23 +08:00
liyang1116	7532482393	fix: fix(config): skip #1776 provider peel for custom host:port slugs model_with_provider_context can emit @custom:<host>:<port>:<model> when model_provider is derived from an OpenAI base_url authority (e.g. custom:10.8.0.1:8080). The colon-count heuristic meant for @custom:slug:model:free mistook those extra colons for an over-split model ID and prepended the port segment onto the bare model (8080:Qwen3-235B), breaking WebUI while CLI/curl stayed correct. Detect endpoint-style slugs (IPv4/localhost/hostname + numeric port) and skip the peel in that case. Add regression tests for IPv4, dotted hostname, localhost, and model_with_provider_context round-trip.	2026-05-09 16:16:32 +08:00
zqy	6fd07c2af4	fix: only evaluate goal hook on goal-related turns (#1932 ) The goal evaluation hook was firing on every completed assistant turn when a goal was active, even for unrelated messages like "what time is it". This burned the goal budget, triggered continuation prompts that interrupted unrelated conversations, and made /goal status numbers misleading. Add STREAM_GOAL_RELATED and PENDING_GOAL_CONTINUATION flags to gate the evaluate_goal_after_turn() call in the streaming loop. Only streams started from goal kickoff (/goal <text>) or goal continuation are marked as goal-related. Normal user messages skip the hook entirely.	2026-05-09 15:08:13 +08:00
Frank Song	b38cc2f1ea	Mute stale stopped gateway heartbeat	2026-05-09 14:53:42 +08:00
nesquena-hermes	bec4433c2a	Stage 325: PR #1929 — feat: add opt-in session endless scroll by @ai-ag2026 Conflict resolution: both #1928 (session jump buttons) and #1929 (endless scroll) add their own settings/UI/i18n keys. Resolved by keeping both — the features are independent opt-in toggles.	2026-05-08 21:23:34 +00:00
nesquena-hermes	fba860da48	Stage 325: PR #1928 — feat: add opt-in session jump buttons by @ai-ag2026	2026-05-08 21:16:33 +00:00
ai-ag2026	ea8aca2818	feat: add opt-in session endless scroll	2026-05-08 21:16:21 +00:00
ai-ag2026	df1ba9fde8	feat: add opt-in session jump buttons	2026-05-08 21:16:19 +00:00
ai-ag2026	8f58a8c94e	feat: add browser offline recovery and PWA cache hardening	2026-05-08 21:16:17 +00:00
Frank Song	e8fd8dac5d	Persist login rate limit attempts	2026-05-08 20:48:41 +00:00
nesquena-hermes	b71a2d4cba	Stage 323: PR #1866 — add WebUI /goal command support by @Michaelyklam	2026-05-08 17:40:31 +00:00
Michael Lam	8e513b596b	fix: surface goal evaluation status	2026-05-08 17:12:01 +00:00
Michael Lam	0db5bc6b76	feat: add WebUI goal command support	2026-05-08 17:12:01 +00:00
Samuel Gudi	c613cfa9a7	refactor(profiles): relocate _profiles_match to api/profiles.py (#1895 review) Maintainer review on PR #1895 flagged that mcp_server.py duplicated the visibility model from api/routes.py:75. Move the canonical helper into api/profiles.py (next to _is_root_profile, on which it depends) so both api/routes.py and mcp_server.py import the same function instead of carrying parallel definitions that could drift as the model evolves. - api/profiles.py: + _profiles_match (verbatim from former routes.py:75-97) - api/routes.py: replace local definition with re-export to keep all existing _profiles_match(...) call sites resolving without per-call-site refactors - mcp_server.py: drop local copy, import _profiles_match alongside the existing api.profiles imports (line 59) - tests: + test_profiles_match_single_source_of_truth asserts identity (mcp.module._profiles_match is api.profiles._profiles_match is api.routes._profiles_match) so any re-introduction of a local copy trips the test + test_profiles_match_input_matrix parametrize across the (None\|''\|'default'\|'foo') x (None\|''\|'default'\|'foo'\|'bar') visibility matrix per maintainer suggestion Behaviour unchanged. Zero call-site changes anywhere in api/routes.py. Co-Authored-By: Claude (Opus 4.7) <noreply@anthropic.com>	2026-05-08 17:12:01 +00:00
nesquena-hermes	8c4c253654	Stage 322: PR #1814 — custom named provider API key resolution by @hualong1009	2026-05-08 16:55:20 +00:00
王浩生	cdbdc28f5c	fix(config): custom named provider API key resolution in WebUI - add robust custom provider credential/base_url resolver - apply fallback in streaming and routes agent init/self-heal paths - support slug normalization and config fallbacks for custom:* providers	2026-05-08 16:40:17 +00:00
Frank Song	ccdc055c36	Fix workspace prefix sentinel handling	2026-05-08 16:40:17 +00:00
nesquena-hermes	b8426d047c	Stage 321: PR #1900 — pass config overrides into context-length fallback (closes #1896 )	2026-05-08 16:08:42 +00:00
Nathan Esquenazi	15b7b7ae12	fix(routes): pass config overrides into session-load context-length fallback PR #1900 patches the two get_model_context_length() fallback callsites in api/streaming.py to pass config_context_length, provider, and custom_providers — but a third callsite of the same shape lives at api/routes.py:2849, in the /api/session/get path that resolves context_length for older sessions (pre-#1318) that have context_length=0 persisted. Same bug shape: only `(model, base_url)` were forwarded, so the resolver fell through to the 256K DEFAULT_FALLBACK_CONTEXT even when the user had `model.context_length: 1048576` set in config.yaml. Visible symptom: the very first paint of a reloaded old session shows the wrong window in the chat-toolbar indicator until a turn fires (which would then trigger the streaming.py fallbacks fixed in this PR and overwrite with the correct value). Fix mirrors streaming.py: pass `config_context_length=`, `provider=effective_provider or ""`, and `custom_providers=` from the per-profile config (`get_config()`), with a TypeError fallback that retries the legacy 2-arg form for older hermes-agent builds whose get_model_context_length signature pre-dates the new kwargs. Adds `test_routes_session_load_fallback_passes_config_overrides` to lock the call shape — verified to fail pre-fix with the same "missing config_context_length=" error the streaming.py tests catch. Defense-in-depth completion of #1896 — closes the third leg of the same bug shape. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-08 16:08:42 +00:00
nesquena-hermes	0efa75827a	fix(streaming): pass config overrides into context-length fallback (#1896 ) The two get_model_context_length() fallback callsites in api/streaming.py (session save + SSE usage payload) were calling the resolver with only model + base_url. When the agent's compressor reports 0 (fresh/cached/ transitioning agent), resolution fell through to the 256K DEFAULT_FALLBACK even when users had set model.context_length: 1048576 in config.yaml. For LCM users on 1M-context models, the wrong window cascaded into a session-killing failure: auto-compression triggered at ~25% of the wrong value, floods of compress requests, 429s, credential pool exhaustion, fallback 429s, then 'API call failed after 3 retries'. Reported by @AvidFuturist on Discord with deepseek-v4-flash. Reproduced 5x. Both callsites now pass config_context_length, provider, and custom_providers. The resolver consults these BEFORE probing, so the config override wins. Both are wrapped in except TypeError blocks that retry with the legacy 2-arg form for older hermes-agent builds whose get_model_context_length signature pre-dates these kwargs. Tests: 7 source-string regressions guarding both call shapes, the safe config parse, the legacy fallback, and the per-profile config source. Also bumped the line-distance assertion in test_pr1341 (the test explicitly invites bumping when a new pre-save mutation block is added). Closes #1896 Co-authored-by: Hermes Agent <agent@hermes.local>	2026-05-08 16:08:42 +00:00
nesquena-hermes	03bb364917	Stage 321: PR #1898+#1904 — profile-home in agent cache signature + functional regression test (closes #1897 )	2026-05-08 16:08:18 +00:00
nesquena-hermes	f456daa574	fix(streaming): include profile home in agent cache signature (#1897 ) Same-session profile switches reused cached AIAgent from previous profile, silently leaking the old persona's SOUL.md / system prompt into the new profile's turns. session_id stays stable across profile switches, and the signature didn't include the active profile home, so every signature input matched and the stale agent was returned from SESSION_AGENT_CACHE. Append _profile_home to the signature blob so profile switches force a cache miss and a fresh agent build under the new HERMES_HOME (which triggers a fresh load_soul_md() call). Tests: 3 source-string regressions guarding the signature contract, ordering, and empty-home fallback. Closes #1897 Co-authored-by: Hermes Agent <agent@hermes.local>	2026-05-08 16:08:18 +00:00
nesquena-hermes	681456fc11	Stage 321: PR #1903 — scope skills endpoints to active profile by @Michaelyklam	2026-05-08 16:07:49 +00:00
Michael Lam	6c4b769324	fix: scope skills endpoints to active profile	2026-05-08 16:07:49 +00:00
Michael Lam	4366daba24	fix: use root home for gateway health status	2026-05-08 16:07:48 +00:00
nesquena-hermes	72b077ecce	Stage 320: PR #1889 — deduplicate workspace-prefixed user turns by @ai-ag2026	2026-05-08 15:48:28 +00:00
ai-ag2026	f6d09e06ca	fix: deduplicate workspace-prefixed user turns	2026-05-08 15:37:10 +00:00
nesquena-hermes	518453545c	Stage 320: PR #1865 — interim_assistant streaming in runtime + live UI by @franksong2702	2026-05-08 15:37:09 +00:00
nesquena-hermes	035c537281	Stage 320: PR #1861 — overwrite session usage per turn by @franksong2702	2026-05-08 15:37:09 +00:00
Frank Song	c1a9d7ce79	fix: overwrite session usage per turn	2026-05-08 15:37:09 +00:00
Frank Song	82c7367cef	Add interim_assistant streaming path to WebUI	2026-05-08 15:37:09 +00:00
nesquena-hermes	0039ae8c64	Stage 320: PR #1877 — honor configured max_turns in WebUI agents by @Michaelyklam	2026-05-08 15:37:08 +00:00
nesquena-hermes	f2194f13cd	Stage 320: PR #1860 — request wedge diagnostics by @franksong2702	2026-05-08 15:37:08 +00:00
Michael Lam	01b9c82dc9	fix: honor configured max_turns in WebUI agents Read agent.max_turns when constructing streaming WebUI AIAgent instances, pass it as max_iterations when supported, and include it in the per-session agent cache signature so budget changes take effect. Add regression coverage for the config read, constructor kwarg, and cache key.	2026-05-08 15:37:08 +00:00
Frank Song	7e2709e281	fix: add request wedge diagnostics	2026-05-08 15:37:08 +00:00
Frank Song	6808e06083	fix: isolate profile quota usage probes	2026-05-08 15:37:07 +00:00
nesquena-hermes	a11cbd3ee9	Stage 319: PR #1862 — preserve local custom provider model ids by @franksong2702	2026-05-08 15:16:18 +00:00

1 2 3 4 5 ...

647 Commits