hermes-webui

mirror of https://github.com/nesquena/hermes-webui.git synced 2026-05-24 18:50:15 +00:00

Author	SHA1	Message	Date
bergeouss	237010f8bd	fix: remove phantom /sw.js from PUBLIC_PATHS whitelist (#1481 )	2026-05-03 18:18:14 +00:00
Dutch AI Agency	e4d2704ce8	fix: resolve local models from configured base url	2026-05-03 17:04:46 +00:00
Hermes Bot	0cbada7228	Stage 280: PR #1404 — cross-channel messaging handoff (Frank Song, rebased onto master)	2026-05-03 16:51:34 +00:00
Frank Song	c7e52084ba	Harden messaging channel handoff	2026-05-03 16:35:50 +00:00
Frank Song	20ef643bb8	Add messaging session handoff summary	2026-05-03 16:35:22 +00:00
nesquena	df0d904d87	fix(streaming): pass agent.reasoning_effort into WebUI agents (salvages #1531 ) Spliced from #1531 by @Asunfly: take Change-1 only (the actual bug fix + cache signature inclusion) and skip Change-2 (auxiliary title-route extra_body change) which is a separate scope concern. ## What Two surgical fixes in api/streaming.py: 1. Line 1820 — `_cfg.cfg.get(...)` → `_cfg.get(...)`. `get_config()` returns a plain dict (not a wrapper exposing `.cfg`). The buggy line raised AttributeError that the surrounding try/except swallowed, so `_reasoning_config` was always None regardless of what `/reasoning <level>` had been set to. Verified locally — `api/streaming.py:1959` already correctly used `_cfg.get(...)` in the same function, so the same `_cfg` was being read two different ways in one file. 2. Line 1888 — added `_reasoning_config or {}` to `_sig_blob`. Without this, switching effort mid-session would fail to take effect because the per-session agent cache key would still match the old entry. Mirrors how `resolved_provider` / `resolved_base_url` already participate in the signature. ## Why splice instead of merge #1531 directly @Asunfly force-pushed a Change-2 onto #1531 after the original review that removes `extra_body={"reasoning": {"enabled": False}}` from `generate_title_raw_via_aux` (the auxiliary title-generation route). That intent is reasonable (let operator-configured `extra_body.reasoning` flow through to the title route) but it touches a different surface and deserves its own PR. The narrow concern is operators who selected a reasoning-capable auxiliary title model without explicitly setting `reasoning.enabled=False` in the task config — pre-Change-2 the WebUI defended against accidental reasoning on the title hot path; post-Change-2 those configs would reason on every new conversation`s title, with cost and latency implications. ## What is NOT in this PR - The `generate_title_raw_via_aux` extra_body refactor (Change-2 from #1531). - The `test_does_not_override_configured_reasoning_extra_body` test (guards Change-2). Asunfly can re-open that as its own focused PR. ## Tests Two new R17b/R17c regression assertions in tests/test_regressions.py: - `test_streaming_reads_reasoning_effort_from_config_dict` — static-source guard: `_cfg.cfg` must not return to streaming.py - `test_streaming_agent_cache_signature_includes_reasoning_config` — catches removal of `_reasoning_config` from `_sig_blob` ## Closes - Closes #1531 (the Change-1 portion ships here; Asunfly can re-open Change-2 as a separate PR if desired) Co-authored-by: Asunfly <[email protected]>	2026-05-03 16:34:25 +00:00
Hermes Bot	a5e6b9dc8b	Merge PR #1526 by @ai-ag2026: pass WebUI max_tokens into agent + classify OpenRouter quota phrases (refs #1524 )	2026-05-03 16:06:55 +00:00
Hermes Bot	1148656370	Merge PR #1525 by @ai-ag2026: clear stale WebUI stream state proactively (refs #1471 ) Merge conflict resolution: kept HEAD's `CACHE_NAME = 'hermes-shell-__WEBUI_VERSION__'` (post-#1517 rename) over PR #1525's `'hermes-shell-__CACHE_VERSION__-stale-stream-cleanup1'` manual suffix. The renamed placeholder still auto-bumps with each release through the `quote(WEBUI_VERSION, safe="")` substitution, so the manual `-stale-stream-cleanup1` suffix is no longer needed to force-update existing service workers — the natural version bump (v0.50.278 → v0.50.279) already invalidates the old cache via `caches.delete(k)` for `k !== CACHE_NAME` in the SW activate handler. No behavioral regression: the SW cache still bumps on this release, just via the canonical version-token path. Co-authored-by: ai-ag2026 <ai-ag2026@users.noreply.github.com>	2026-05-03 16:06:42 +00:00
Hermes Bot	437eae00be	Merge PR #1532 by @ai-ag2026: recover WebUI-origin state.db sessions when JSON sidecar missing (refs #1471 )	2026-05-03 16:06:04 +00:00
Manfred	9c0a16fdd6	fix: recover WebUI-origin state.db sessions	2026-05-03 15:41:56 +02:00
Manfred	dbb0879956	fix: pass WebUI max_tokens to agents Read configured max_tokens from config.yaml, pass it into WebUI-created AIAgent instances when supported, and include it in the agent cache signature. Also classify OpenRouter quota phrasing such as more credits, can only afford, and fewer max_tokens. Adds regression coverage for max_tokens propagation, cache signature isolation, and quota error classification.	2026-05-03 11:46:42 +02:00
Manfred	6bce34c27e	fix: clear stale WebUI stream state Clear persisted active_stream_id and pending runtime fields when the server no longer has the referenced live stream. Also drop browser-side INFLIGHT state when the server reports a session idle and bump the service-worker cache so the frontend fix is delivered. Adds regression coverage for backend stale-stream cleanup, frontend inflight invalidation, and cache busting.	2026-05-03 11:46:42 +02:00
Frank Song	8f3dbe185d	fix: consolidate __CACHE_VERSION__ → __WEBUI_VERSION__ (#1509 ) __CACHE_VERSION__ (sw.js) and __WEBUI_VERSION__ (index.html) are functionally identical — both resolve to quote(WEBUI_VERSION, safe='') at request time. Two names exist for historical reasons (different files added at different times). Rename __CACHE_VERSION__ → __WEBUI_VERSION__ in: - static/sw.js (CACHE_NAME + VQ constant + comment) - api/routes.py (substitution string) - tests/test_pwa_manifest_sw.py (all assertions) Single canonical name. No behavior change — same ?v=vX.Y.Z query strings on the same URLs.	2026-05-03 14:59:37 +08:00
Hermes Bot	6381ab1b8a	fix(model-picker): deepcopy auto_detected_models per group to stop dedup bleed-across (#1511 root cause) Supersedes contributor PR #1511 (lost9999), which removed the label-suffix logic in _deduplicate_model_ids() but left the underlying shared-reference bug intact — IDs would still be silently corrupted across provider groups, just with cleaner-looking labels. ## Bug shape When multiple unconfigured providers (Ollama / HuggingFace / custom endpoints / Google Gemini CLI / Xiaomi / etc.) all fell through to the 'else' branch in api/config.py:get_models_grouped() that ends with: groups.append({..., "models": auto_detected_models}) every group ended up sharing the SAME list reference AND the SAME dicts inside. When _deduplicate_model_ids() then mutated those dicts to add @provider_id: prefixes and provider-name parentheticals, the changes were applied to every group that referenced the same dict. Visible symptom: user 'vishnu' reported the dropdown showing 'Deepseek V4 Flash (Xiaomi) (Ollama) (HuggingFace) (Google-Gemini-Cli)' on every group. Hidden symptom (worse): the 'id' field collapsed to '@xiaomi:deepseek-v4-flash' on every group too, so clicking the entry under any group routed the request to Xiaomi. ## Fix api/config.py:2078 — wrap auto_detected_models in copy.deepcopy() at the groups.append site so each group gets its own independent dicts. The existing _deduplicate_model_ids() logic is correct and unchanged; the bug was in the assignment site, not the dedup function. The single-parenthetical disambiguation in labels is retained because the composer chip (composer-model-label) shows the model label without the optgroup header context — 'Deepseek V4 Flash (Ollama)' is more useful than ambiguous 'Deepseek V4 Flash' there. ## Tests tests/test_issue1511_dedup_shared_reference.py — 3 new tests: - test_groups_have_independent_model_lists: structural invariant pin - test_unconfigured_providers_no_shared_dedup_bleed: end-to-end against the corrected code path; verifies each group gets its own @provider_id: prefix and exactly ONE provider parenthetical per disambiguated label - test_shared_reference_pre_fix_demonstrates_corruption: documents the broken state that motivated the fix Full suite: 3925 → 3928 passing (+3 new, 0 regressions). Co-authored-by: lost9999 <56498264+lost9999@users.noreply.github.com>	2026-05-03 06:41:11 +00:00
Hermes Bot	8f58688b66	test: lock /session/static MIME-type + auth fix; drop unused import - Add tests/test_session_static_assets.py (5 tests): * /session/static/style.css must return text/css (not text/html) * /session/static/ui.js must return application/javascript * /session/<id> still serves the HTML index (catch-all not weakened) * Path-traversal still sandboxed after prefix strip * /session/static/* matches /static/* auth-exemption policy - Drop unused 'from urllib.parse import urlparse as _up' import from PR #1505's added block (parsed._replace already gives a usable result). Co-authored-by: Rick Chew <rickchew@users.noreply.github.com>	2026-05-03 05:20:19 +00:00
Rick Chew	7cf2150b94	fix: serve static assets correctly under /session/* routes When the browser loads a session page at /session/<id>, it requests static assets relative to that path — e.g. /session/static/style.css. The /session/* catch-all in handle_get() intercepted those requests and returned the HTML index page (text/html), causing browsers to refuse the stylesheet with a MIME-type mismatch error. Two-part fix: - routes.py: add a guard before the /session/ catch-all that strips the /session prefix from /session/static/* paths and delegates to _serve_static(), so the correct Content-Type is returned. - auth.py: whitelist /session/static/* in check_auth() alongside /static/, so static assets on session pages are served without requiring an authenticated session (same policy as /static/). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-03 13:05:15 +08:00
Hermes Bot	ba6f34488e	fix(onboarding,probe): refuse HTTP redirects on probe path (reviewer-flagged on PR #1501 ) SSRF defense-in-depth: `urllib.request.urlopen` follows redirects by default, so a probe at `http://example.com/v1/models` could be redirected to `http://internal-service:8080/admin` — surfacing internal HTTP services to the authenticated user. The probe is already gated behind WebUI auth and the local-network check, so the practical attack surface is 'authenticated user enumerating internal services' (same as `curl` from their browser DevTools). Tightening the redirect default is cheap insurance. Implementation: - New module-level `_NoRedirectHandler` (subclasses `urllib.request.HTTPRedirectHandler`, overrides `redirect_request` to return None — urllib then raises `HTTPError(3xx)` rather than following). - New module-level `_PROBE_OPENER = urllib.request.build_opener(_NoRedirectHandler())`. - `probe_provider_endpoint` switches from `urlopen(req, …)` to `_PROBE_OPENER.open(req, …)`. - The existing `HTTPError` handler now categorizes 3xx as `unreachable` with a detail string mentioning 'redirect' so the user understands what happened. 3xx does NOT get its own error code in `PROBE_ERROR_CODES` — the error taxonomy contract stays the same shape (frontend i18n unchanged). Added regression test `test_probe_does_not_follow_redirects` in `tests/test_issue1499_onboarding_probe.py`. Spins up a tiny HTTP server that 302-redirects `/v1/models` to `/different-endpoint` (which would return `{'data': [{'id': 'should-not-see'}]}` if followed). Asserts the probe returns `{ok: False, error: 'unreachable', status: 302, detail: …'redirect'…}` and that the 'should-not-see' string never appears in the result. Mutation-verified: reverting `_PROBE_OPENER.open` back to `urlopen` causes the test to fail with "Probe followed a redirect — should have refused". Suite delta: 3917 → 3918 passing (+1). Reviewer-flagged in PR #1501. Per the 'reviewer-flagged-fix-in-release-not-followup' policy: <20 LOC defensive fix, regression test path obvious, ship in this release rather than punting.	2026-05-03 03:21:22 +00:00
Hermes Bot	8f4692b8cf	fix(onboarding): allow keyless setup for self-hosted providers (#1499 third sub-bug) Pre-fix, the wizard rejected an empty api_key for every provider in _SUPPORTED_PROVIDER_SETUPS — including lmstudio, ollama, and custom, which run keyless on the vast majority of local installs. The agent's LMSTUDIO_NOAUTH_PLACEHOLDER substitution at chat-time was the workaround for the no-auth case, but the wizard side rejected the empty input first. Users had to type random gibberish into the API key field to clear the form — the third sub-bug from #1420 that the prior commit's PR description explicitly punted to a follow-up. Surfaced by Nathan during PR review: "I think it's too weird for users to have to type a string into the API key field, right?" Yes — and the probe (#1499) makes the cleanest fix strictly better: we accept empty keys, and the probe gives instant feedback ("Connected. 2 model(s) available." for keyless servers, "401" for auth-required servers). Backend changes --------------- * `api/onboarding.py` — `_SUPPORTED_PROVIDER_SETUPS` gains `key_optional: True` for `lmstudio`, `ollama`, `custom`. Cloud providers (openrouter, anthropic, openai, gemini, deepseek, …) remain key_required. * `apply_onboarding_setup` skips the "{env_var} is required" check when `key_optional` is set AND no key is supplied. No write to .env for the empty-key case (no `LM_API_KEY=*** placeholder lying in the user's .env`). * `_status_from_runtime` reports `provider_ready=True` for key_optional providers based on `requires_base_url` alone, so the wizard doesn't refire on the next page load just because there's no api_key. Cloud providers still need a key for provider_ready=True. * `_build_setup_catalog` exposes the `key_optional` flag to the frontend. Frontend changes ---------------- * `static/onboarding.js` — new `_renderOnboardingApiKeyField()` helper. For key_optional providers: - Label: "API key (optional)" - Placeholder: "Leave blank for keyless servers" - Inline italic muted help: "Most LM Studio / Ollama / vLLM installs run keyless — leave this blank if your server doesn't require authentication. Use the Test connection button to verify." For cloud providers: unchanged (label "API key", standard placeholder, no help block). * The api-key input also now triggers `_scheduleOnboardingProbe()` on oninput, so changing the key re-runs the probe — handles "the server rejected my empty key with 401, let me add one and retry." * `static/i18n.js` — 3 new keys × 9 locales (canonical English in `en`, English fallback with `// TODO: translate` markers in the other 8). * `static/style.css` — `.onboarding-api-key-help` rule for the muted italic helper paragraph. Verified end-to-end on port 8789 -------------------------------- Spun up an isolated test server + a mock LM Studio at `127.0.0.1:11234/v1/models`. Stepped through the wizard: * Picked LM Studio → field label flipped to "API key (optional)", placeholder showed "Leave blank for keyless servers", help text rendered in italic muted gray below. * Switched to Anthropic → label reverted to "API key", help text disappeared. Visual hierarchy correct. * Left api_key blank, set base_url to the mock, clicked Test connection → green "Connected. 2 model(s) available." banner. Probe-discovered models populated the workspace-step dropdown. * Continued through to the finish step. config.yaml written with provider/model/base_url. `.env` does NOT exist — no placeholder string written. `chat_ready: true`, `state: ready`. * Vision tool confirmed the visual hierarchy: subtle italic help reads as documentation, prominent green banner pops as status. Tests ----- `tests/test_issue1499_keyless_onboarding.py` — 16 tests in 3 classes: TestKeyOptionalProviderSchema (5) - lmstudio / ollama / custom declare key_optional=True - openrouter / anthropic / openai do NOT (regression defense) - setup catalog exposes the flag TestKeylessOnboarding (6) - lmstudio / ollama / custom: empty api_key accepted, no .env write - openrouter / anthropic: empty api_key still rejected - lmstudio with explicit key still writes .env (regression defense) TestKeylessChatReady (5) - lmstudio / ollama: provider_ready=True with no key - custom: provider_ready=True with key+base_url, False without base_url - openrouter: provider_ready=False with no key (regression defense) - End-to-end get_onboarding_status reports chat_ready=True Full suite: 3901 → 3917 passing (+16 from this commit; +22 cumulative from the PR's earlier commit). 0 failures. Closes #1499 (all three sub-bugs from #1420 now addressed)	2026-05-03 03:07:07 +00:00
Hermes Bot	8616033605	fix(onboarding,providers): probe LM Studio /models + align env var with agent CLI (#1499 #1500 ) Addresses both #1499 (onboarding wizard never probes the configured base URL) and #1500 (cross-tool env-var name divergence between webui and agent CLI). Surfaced together because they're both LM-Studio onboarding bugs that pile on top of each other — fixing only one leaves the broken UX. #1499 — Onboarding wizard probes <base_url>/models before persisting Pre-fix, `apply_onboarding_setup` accepted whatever `base_url` the user typed without ever fetching `<base_url>/models`. @chwps's log timeline in #1420 showed the wizard finishing in 239ms with zero outbound HTTP — onboarding silently persisted unreachable URLs and left users with empty model dropdowns they had to populate by hand-editing config.yaml. Backend: * New `probe_provider_endpoint(provider, base_url, api_key, timeout=5.0)` in `api/onboarding.py`. Stdlib-only (urllib + socket — no httpx dep). Returns `{ok, models}` on success; `{ok: False, error: <code>, detail}` on failure with stable error codes the frontend can switch on: invalid_url, dns, connect_refused, timeout, http_4xx, http_5xx, parse, unreachable. 256 KB response cap and 5s timeout keep a hostile or mis- pointed endpoint from blocking the wizard. * New `POST /api/onboarding/probe` route — thin JSON wrapper around the function above. Same local-network gate as `/api/onboarding/setup` because the body carries an `api_key` the user typed. * The probe response is NEVER persisted. Only the user's typed selection ends up in config.yaml; the probed model list just populates the wizard's dropdown. * SSRF: deliberately does NOT block private-IP ranges. The wizard is gated behind WebUI auth and the legitimate target IS a local LM Studio / Ollama / vLLM server. A "block private IPs" SSRF defense would make the feature useless for its primary use case. Frontend: * `static/onboarding.js`: - New `ONBOARDING.probe` state ({status, error, detail, models, probedKey}). - `_runOnboardingProbe()` — POSTs to /api/onboarding/probe, idempotent & cached on (provider, baseUrl, apiKey). - Debounced (400ms) on `oninput` of the base URL field. - Explicit "Test connection" button. - `nextOnboardingStep` blocks Continue at the setup step for any provider with `requires_base_url=True` until the probe succeeds. Same localized error renders inline. * `static/i18n.js`: 13 new keys × 9 locales (canonical English in `en`, English fallback with `// TODO: translate` markers in the other 8 — same convention as v0.50.271 #1488 voice-buttons). * `static/style.css`: probe banner + Test button styling (red-tinted error variant, green-tinted success variant, neutral probing state). Verified via manual repro on port 8789: * connect_refused → red banner, helpful "from Docker, try the host IP" hint, blocks Continue. * DNS failure → red banner, "could not resolve host '...'", blocks Continue. * Success against a mock /v1/models server → green banner, model dropdown populates from the probed list, Continue advances normally. #1500 — webui env var aligned with agent CLI (LM_API_KEY) The webui has long used `LMSTUDIO_API_KEY` for LM Studio's API key in both onboarding and Settings detection. The agent CLI runtime (hermes_cli/auth.py:177-183) reads `LM_API_KEY`. So a user who configured auth on their LM Studio instance got Settings → Providers reporting has_key=True (because webui saw its own LMSTUDIO_API_KEY) but the agent runtime ignored the key and fell back to LMSTUDIO_NOAUTH_PLACEHOLDER → 401 against the auth-enabled LM Studio server. Masked in practice for the no-auth majority. Picked Option B from the issue (defer to the agent — single source of truth) but mitigated the migration cliff by reading the legacy name as a fallback: * `api/onboarding.py:_SUPPORTED_PROVIDER_SETUPS["lmstudio"]`: - `env_var: "LM_API_KEY"` (canonical, what onboarding writes going forward). - `env_var_aliases: ["LMSTUDIO_API_KEY"]` (read-only fallback for pre-#1500 users so detection keeps working without forcing an .env rewrite). * `api/onboarding.py:_provider_api_key_present` reads aliases too. * `api/providers.py:_PROVIDER_ENV_VAR["lmstudio"] = "LM_API_KEY"`. * `api/providers.py:_PROVIDER_ENV_VAR_ALIASES["lmstudio"] = ("LMSTUDIO_API_KEY",)` — new dict, used by `_provider_has_key` and `get_providers`'s key_source resolution. Drops in cleanly when other providers later rename their env vars too. Verified: ``` before fix: webui writes LMSTUDIO_API_KEY → agent ignores it → 401 on chat after fix: webui writes LM_API_KEY → agent picks it up → chat works pre-#1500 .env with LMSTUDIO_API_KEY → still has_key=True in Settings → key_source='env_file' ``` Tests * `tests/test_issue1499_onboarding_probe.py` — 17 tests: 3 invalid_url variants, dns, connect_refused, success (OpenAI shape), success (bare-list shape), http_4xx, http_5xx, parse non-JSON, parse wrong-shape, api_key authorization header passthrough, "probe must not write to config.yaml or .env", PROBE_ERROR_CODES contract pin, 3 end-to-end route-level smoke tests against the live server fixture. * `tests/test_issue1500_lmstudio_env_var_alignment.py` — 5 tests: onboarding declares LM_API_KEY canonical with LMSTUDIO_API_KEY alias, onboarding writes ONLY the canonical name, legacy env var still detected post-migration, canonical takes precedence when both are set, _provider_api_key_present reads aliases. * `tests/test_issue1420_lmstudio_provider_env_var.py` — updated: the original 5-test #1420 suite now pins LM_API_KEY as canonical and LMSTUDIO_API_KEY as alias. Full suite: 3879 → 3901 passing (+22), 0 failures. Out of scope (explicitly NOT addressed here) The third LM Studio onboarding sub-bug from #1420's thread — that `apply_onboarding_setup` requires a non-empty api_key for lmstudio even though most LM Studio installs run keyless — remains. The agent's `LMSTUDIO_NOAUTH_PLACEHOLDER` substitution kicks in at runtime, but the onboarding wizard rejects the empty-key case at submit. Fixing this requires a UX decision (auto-write a sentinel? loosen the required-key check for self-hosted providers?) and is left as a separate follow-up. Closes #1499 Closes #1500 Co-authored-by: chwps <106549456+chwps@users.noreply.github.com> Co-authored-by: AdoneyGalvan <25235323+AdoneyGalvan@users.noreply.github.com>	2026-05-03 02:46:24 +00:00
Hermes Bot	d3c7ac182b	fix(providers): map lmstudio to LMSTUDIO_API_KEY in _PROVIDER_ENV_VAR (#1420 ) After completing the onboarding wizard with the LM Studio provider, users saw LM Studio in the model picker and could chat normally, but Settings → Providers showed no LM Studio entry — or rendered it with has_key=False and configurable=False even when LMSTUDIO_API_KEY was already in ~/.hermes/.env. There was no UI surface to add or update the key. Root cause: api/providers.py:_PROVIDER_ENV_VAR — the dict that maps each provider id to its env-var name — is missing an "lmstudio: LMSTUDIO_API_KEY" entry. That dict drives two things: 1. _provider_has_key(pid) — env-var-based key detection. Returns False and sets key_source='none' if the pid isn't in the dict, regardless of what's in .env or os.environ. 2. get_providers() line 364: "configurable": not is_oauth and pid in _PROVIDER_ENV_VAR, Without the entry, configurable=False, hiding the "Add API key" form in the UI. So with no map entry, an LM Studio user with a working LMSTUDIO_API_KEY gets has_key=False (wrong) AND no UI to fix it (wrong-er). Same bug shape as #1410 (Ollama Cloud / local Ollama env-var collision). The #1410 fix dropped bare "ollama" from _PROVIDER_ENV_VAR because OLLAMA_API_KEY was shared with ollama-cloud and the runtime semantics made the local key detection ambiguous. LMSTUDIO_API_KEY has no such collision — it's only consumed by the lmstudio runtime. Verified via reproduction: before fix: lmstudio.has_key=False, configurable=False, key_source='none' after fix: lmstudio.has_key=True, configurable=True, key_source='env_file' 5 regression tests in tests/test_issue1420_lmstudio_provider_env_var.py: 1. _PROVIDER_ENV_VAR['lmstudio'] == 'LMSTUDIO_API_KEY' 2. LMSTUDIO_API_KEY in env → has_key=True + configurable=True 3. providers.lmstudio.api_key in config.yaml → has_key=True (fallback path) 4. No env, no config → has_key=False but configurable=True (UI fix surface) 5. LMSTUDIO_API_KEY doesn't cross-detect any other provider Mutation-verified: reverting the map entry causes 4 of 5 tests to fail with clear assertion messages naming the bug (the 5th — config.yaml fallback — is independent of the env-var path and intentionally remains green to pin that the existing path keeps working). Scope discipline: #1420's broader thread surfaces a sibling bug — the onboarding wizard never probes the configured <base_url>/v1/models endpoint before persisting (the wizard accepts unreachable URLs silently with no model-list dropdown population). That bug is being filed separately and is NOT addressed here. Adding a probe touches the wizard UX flow, has timeout / error-handling implications, and warrants its own design pass. Closes #1420 (the "LM Studio missing from Settings" half — feature- request half about provider catalog support is already shipped: LM Studio has been a first-class provider in api/onboarding.py since long before this issue). Co-authored-by: chwps <106549456+chwps@users.noreply.github.com> Co-authored-by: AdoneyGalvan <25235323+AdoneyGalvan@users.noreply.github.com>	2026-05-03 02:06:19 +00:00
Hermes Bot	c4ea9643f9	Stage 272: PR #1492 — P0 bugfixes (tool-card args + CLI rename + scroll pinning + sw.js relative-path regression test)	2026-05-03 01:34:10 +00:00
Hermes Bot	51a87ebdc7	fix(sqlite): close state.db connections explicitly to stop FD leak in sidebar polling (#1494 ) Production WebUI on macOS launchd reproduced an HTTP-unhealthy wedge after #1483 closed the bootstrap supervisor double-fork: process alive, port listening, every HTTP request reset by peer before a response. The reporter (@insecurejezza) traced it to FD exhaustion — 366 open FDs on the wedged process, 238 of them `~/.hermes/state.db`, `state.db-wal`, and `state.db-shm`. Root cause: four sqlite callsites use `with sqlite3.connect(...) as conn:`. Python's sqlite3 connection context manager only commits or rolls back on exit; it does NOT close the connection. `/api/sessions` polling calls these on every sidebar refresh, so each poll leaked one or more open state.db FDs until the process hit macOS's soft FD limit and new sqlite3.connect() calls inside fresh request handlers raised before any response bytes were written. Fix: wrap each `sqlite3.connect(...)` in `contextlib.closing(...)` so the connection is explicitly closed on scope exit, in addition to the auto- commit / rollback semantics that `Connection.__exit__` already provides. Callsites patched: - api/agent_sessions.py:read_importable_agent_session_rows - api/agent_sessions.py:read_session_lineage_metadata - api/models.py:get_cli_session_messages - api/models.py:delete_cli_session Reporter's verification (post-patch, 100-request stress loop against /api/sessions and /api/projects): batch=1 fd=92 state_handles=0 batch=2 fd=92 state_handles=0 ... batch=5 fd=92 state_handles=0 Pre-patch the same loop made FD count and state.db handle count climb monotonically. 4 regression tests in tests/test_issue1494_state_db_fd_leak.py monkeypatch sqlite3.connect with a tracking wrapper that records .close() calls and assert every connection opened by each of the four functions is explicitly closed. Verified to fail (catching the original bug) when the closing() wrap is reverted: "leaked 5 of 5 sqlite connection(s) — context-manager- only `with sqlite3.connect()` does not close. Wrap in contextlib.closing()." This addresses Bug #2 of the umbrella issue #1458. Bug #3 (HTTP-unhealthy wedge in the absence of FD exhaustion) remains open pending separate diagnostic data — explicit scope discipline. Closes #1494 Refs #1458 (Bug #2 of 3) Co-authored-by: insecurejezza <70424851+insecurejezza@users.noreply.github.com>	2026-05-03 01:15:26 +00:00
bergeouss	24a5457471	fix: P0 bugfixes — tool-card args, sw.js path, CLI rename, scroll pinning - #1481: Use absolute path for service worker registration to avoid <base> tag resolution on session pages causing JSON 404 - #1484: Fix tool-card expanded args readability — replace word-break:break-all with pre-wrap+break-word, add display:block so newlines and indentation are preserved - #1486: Prefer WebUI JSON title over state.db title for CLI sessions, fixing rename-not-persisting after compression chain extension - #1469/#1360: Add _programmaticScroll guard to distinguish programmatic scrolls from user scrolls, preventing the race condition where scrollIfPinned() re-pins after user scrolls up	2026-05-02 23:39:52 +00:00
Hermes Bot	26b332612d	fix(api): add pending_user_message to Session.compact() (#1479 )	2026-05-02 18:04:44 +00:00
Hermes Bot	bcfd8b2eac	chore(release): stamp v0.50.268 — 4-PR batch + Opus follow-ups (i18n + per-session fields + None title guard) - CHANGELOG.md: v0.50.268 entry detailing #1395 #1450 #1462 #1476 + Opus SHOULD-FIX followups - ROADMAP.md: bump to v0.50.268, 3800 tests collected - TESTING.md: bump header + total to 3800 SF-1 i18n fix: - static/i18n.js: session_meta_children key in all 10 locale blocks (en, ja, ru, es, de, zh, zh-Hant x2, pt, ko) - static/sessions.js: 2 callsites use t(session_meta_children, childCount) SF-2 #1462 per-session field carry-over: - api/routes.py: duplicate now carries personality, enabled_toolsets, context_length, threshold_tokens SF-3 #1462 None-title guard: - api/routes.py: (session.title or "Untitled") + " (copy)" Tests: - tests/test_stage268_opus_followups.py: 6 regression tests pinning SF-1 + SF-2 + SF-3 - tests/test_session_duplicate.py: 2 brittle assertions widened to accept new forms Follow-up issue filed: #1481 (PWA /sw.js whitelist vestige, Opus SF-4)	2026-05-02 17:54:58 +00:00
youzhi	b804b66238	Fix session list pending message payload	2026-05-03 01:44:38 +08:00
Hermes Bot	273888df48	fix(sidebar): nest child sessions under lineage roots (#1450 )	2026-05-02 17:41:05 +00:00
Hermes Bot	7c1b53258a	feat(api): /api/session/duplicate endpoint for session cloning (#1462 )	2026-05-02 17:41:05 +00:00
Hermes Bot	02726b9123	feat(pwa): Android PWA app installation with manifest and icons (#1476 )	2026-05-02 17:41:05 +00:00
Hermes Bot	6303a30a87	Address review feedback: deepcopy independence, persist on duplicate, reset pinned/archived, 404 status Five fixes from the May 2 2026 maintainer review: 1. messages and tool_calls now use copy.deepcopy() — prior plain assignment shared list refs between source and duplicate, so appending a turn to one mutated the other. 2. copied_session.save() called explicitly — pre-fix, the duplicate was in-memory only until the user sent a turn. Refreshing mid-flow lost it. 3. pinned and archived reset to False — duplicating an archived conversation should produce a visible (un-archived) copy. 4. Missing-session error is now status=404 (was default 400). 5. Removed redundant `import uuid` / `import time` inside the handler — both are already at the top of routes.py. Test updates: - Two existing static-grep tests widened to accept the new `copy.deepcopy(session.messages)` form alongside the original `messages=session.messages`. - Five new static-grep regression tests pin each of the five fixes so reverting any single one trips a test. All 3775 tests pass. Co-authored-by: Alexey Dsov <AlexeyDsov@users.noreply.github.com>	2026-05-02 17:39:55 +00:00
Jan	8e2fea6f5d	feature: add manifest and icons to enable app install on android	2026-05-02 19:06:39 +02:00
Hermes Bot	3abae9aca7	chore(release): stamp v0.50.267 — 7 contributor PR batch + Opus follow-up - CHANGELOG.md: v0.50.267 entry detailing #1454/#1474/#1461/#1465/#1467/#1460/#1473 + Opus advisor SHOULD-FIX trailing-empty guard for _norm_model_id - ROADMAP.md: bump to v0.50.267, 3776 tests collected - TESTING.md: bump header + total to 3776 - api/config.py: trailing-empty fallback in _norm_model_id (parts[-1] or s) - static/ui.js: mirror trailing-empty fallback in _normalizeConfiguredModelKey - tests/test_norm_model_id_trailing_empty_guard.py: 5 regression tests	2026-05-02 17:03:25 +00:00
AlexeyDsov	7c4c0142d5	feat(api): add /api/session/duplicate endpoint for session cloning\nNew endpoint creates independent session copies with all messages, model and workspace intact. Added 10 comprehensive regression tests for error handling and logic verification.	2026-05-02 11:59:45 +03:00
happy5318	d6164cdadb	Fix _norm_model_id to properly strip provider prefixes The _norm_model_id function was using split(':', 1)[1] which only removed the first colon-separated segment, leaving provider names in the normalized model ID. For example, '@custom:jingdong:GLM-5' became 'jingdong:glm.5' instead of 'glm.5'. This caused the default model injection check to fail, resulting in a duplicate 'Default' group being added to the model list even when the model already existed with a provider prefix. Changes: - Use split(':')[-1] to get the last segment after all colons - Use split('/')[-1] consistently for slash-separated paths - Replace local _norm lambda with _norm_model_id function call Fixes duplicate Default group appearing in model dropdown when using custom providers with @provider:model ID format.	2026-05-02 13:40:38 +08:00
nesquena-hermes	c73f2ff387	v0.50.264 polish followups: i18n parity + assistant-output readability Closes #1442 (server-side _LOGIN_LOCALE missing ja/pt/ko) Closes #1443 (promote _isImeEnter helper to 6 other Safari Enter guards) Closes #1446 (glued-bold-heading lift for LLM thinking-block output) Closes #1447 (markdown heading visual hierarchy in chat messages) All four issues were filed by the Opus pre-release advisor on the v0.50.264 batch or by Cygnus via Discord (relayed by @AvidFuturist, May 1 2026). They share a common shape — narrow, well-scoped, independent of each other, all adding regression tests. == #1442: _LOGIN_LOCALE parity (api/routes.py + static/i18n.js) == Added entries for ja/pt/ko to the server-side _LOGIN_LOCALE dict that renders the localized login page BEFORE the JS i18n bundle loads. With v0.50.264 shipping Japanese as the 8th built-in locale, ja/pt/ko users were seeing the English login page even with their language preference set. While auditing static/i18n.js for English leakage, also fixed: - ko: 10 user-facing login/sign-out/password keys still in English - es: 3 sign-out/auth-disabled keys still in English Tests: tests/test_login_locale_parity.py (20 tests) — pins both invariants: (a) every locale in i18n.js LOCALES has a matching _LOGIN_LOCALE entry (b) every locale's login-flow keys (13 of them) are translated, not English == #1443: window._isImeEnter promotion == PR #1441 fixed the Safari IME-composition Enter race in the chat composer (`#msg`) by widening the guard from `e.isComposing` to a `_isImeEnter(e)` helper that combines three signals (isComposing \|\| keyCode===229 \|\| _imeComposing flag). Six other Enter-input handlers were left on the original narrow guard and would still drop IME composition Enters on Safari for Japanese/Chinese/Korean users. Promoted the helper to `window._isImeEnter` (defined in static/boot.js) and replaced the `e.isComposing` guards at all six sites: - static/sessions.js: session rename, project create, project rename - static/ui.js: app dialog (confirm/prompt), message edit, workspace rename The state-free part of the helper (`isComposing \|\| keyCode===229`) handles Safari's race for any focused input without needing per-input composition listeners — only `#msg` keeps the local `_imeComposing` flag. Tests: - tests/test_issue1443_ime_helper_promotion.py (9 tests) — pins each site + verifies no raw `e.isComposing` Enter-guards remain in sessions.js/ui.js - tests/test_ime_composition.py — alternation regex extended to accept the windowed helper form (loosen-test-on-shape-change pattern from v0.50.264 reflection notes) == #1446: glued-bold-heading lift (static/ui.js renderMd + Python mirror) == LLMs in thinking/reasoning mode emit "section headers" glued to the end of the previous paragraph with no whitespace: Para 1 text.Heading to Para 2 Para 2 text.Heading to Para 3 The renderer correctly produces inline `<strong>` per CommonMark, but it looks like trailing emphasis on the body text rather than a section break. Cygnus reported this as "Markdown feedback 2 of 3." Added a single regex pre-pass in renderMd(): s.replace(/([.!?])\\([^\n]{1,80})\\\n\n/g, '$1\n\n$2\n\n') Constraints chosen to avoid false positives: - Trigger only on `[.!?]` IMMEDIATELY before `` (no space) — almost always an LLM-glued heading, not intentional emphasis - Inner text ≤80 chars, no `` or newline (single-line only) - Trailing `\n\n` required — preserves "this is important to know." mid-paragraph emphasis untouched - Position: after rawPreStash restore, before fence_stash restore — fenced code blocks stay protected (their content is `\x00P` / `\x00F` tokens when the lift runs) Mirrored in tests/test_sprint16.py render_md() so both stay in sync. Tests: tests/test_issue1446_glued_heading_lift.py (17 tests, 5 of which drive the actual ui.js renderMd via node) — covers all 3 trigger forms (.!?), all 4 preserve-emphasis cases the issue spec'd, fenced/inline code protection, chained glued headings, source-level position pin, regex shape pin. == #1447: markdown heading visual hierarchy (static/style.css) == Pre-fix sizes in `.msg-body`: h1 18px, h2 16px, h3 14px (= body), h4 13px, h5 12px, h6 11px So h3 was indistinguishable from body and h4/h5/h6 were SMALLER than body. Cygnus's report: "Markdown feedback 3 of 3 — Headings seem to be missing across the board in Hermes. They're there, but all plaintext." New sizes: h1 24px (border-bottom) h2 20px (border-bottom) h3 17px h4 15px h5 14px (uppercase, tracked) h6 13px (uppercase, tracked, muted) All headings now `font-weight:700` + `color:var(--strong)` for stronger ink. h5/h6 use uppercase + letter-spacing for "label-style" affordance instead of being smaller-than-body. Synced .preview-md (file preview pane) to match exactly so a markdown file preview and a chat message render identically. Added missing h4/h5/h6 rules to .preview-md (it only had h1-h3 before). Updated data-font-size="small"/"large" h1-h6 overrides to scale proportionally with the new defaults. Hierarchy preserved at all three font-size settings. Tests: tests/test_issue1447_heading_hierarchy.py (9 tests) — pins the size hierarchy, the bottom borders on h1/h2, the uppercase affordance on h5/h6, the .preview-md sync, and the small/large override scaling. == Verification == pytest tests/ -q → 3748 passed (+56 new) bash ~/WebUI/scripts/run-browser-tests.sh → 20 + 11 PASS bash ~/WebUI/scripts/webui_qa_agent.sh 8789 → 23/23 PASS Visual confirmation in browser at port 8789: - Heading hierarchy clearly visible at all 6 levels - Glued-bold lift produces separate paragraphs as designed - window._isImeEnter accessible from any module after boot.js - Login page renders ja/pt/ko strings correctly (curl -s /login)	2026-05-02 04:19:28 +00:00
Dennis Soong	082f3d45b7	fix: nest child sessions under lineage roots	2026-05-02 12:09:36 +08:00
nesquena-hermes	4ee9368464	Opus pre-release follow-ups for PR #1445 REQUIRED: - _fully_unquote_path range(3) -> range(10) — defense-in-depth so quadruple- encoded .. is rejected by validator instead of slipping through (not exploitable but contract violation) - docs/EXTENSIONS.md trust-model callout moved to top of file with explicit 'don't enable in untrusted env / don't point at user-writable dir' guidance NICE-TO-HAVE (taken since Nathan asked for all fixes big and small): - URL list cap at _MAX_URL_LIST=32 to avoid pathological rendering - One-shot WARNING log for rejected URLs (silent drop now visible to admin) - One-shot WARNING log for URL list truncation - MIME map: ttf (font/ttf), otf (font/otf), wasm (application/wasm) 5 regression tests in tests/test_pr1445_opus_followups.py pin all invariants.	2026-05-02 03:49:40 +00:00
Ryan Jones	9de61a0b9a	feat: add opt-in webui extension hooks	2026-05-02 03:36:54 +00:00
nesquena-hermes	081e600b33	fix: context-window indicator broken on older sessions (#1436 ) Fix two-layer bug where `/api/session` returned `context_length=0` for sessions that pre-date #1318, then the frontend silently fell back to cumulative `input_tokens` and the 128K JS default, producing nonsense indicators like "100" capped from "890% used (context exceeded), 1.2M / 131.1k tokens used". Empirical impact: 23 of 75 sessions on dev server rendered >100% before this fix. #1356 fixed the same symptom on the live SSE path but missed the GET /api/session load path that older sessions go through. Two-layer fix: 1. Backend (api/routes.py:1295-1313) — resolve context_length via agent.model_metadata.get_model_context_length() when the persisted value is 0. Mirrors api/streaming.py:2333-2342. 2. Frontend (static/ui.js:1269) — drop the cumulative `input_tokens` fallback. When last_prompt_tokens is missing, render "·" + "tokens used" (existing !hasPromptTok branch) instead of computing a percentage from the cumulative total. 10 regression tests in tests/test_issue1436_context_indicator_load_path.py covering both layers + the empty-model edge case (avoids the 256K default-for-unknown-model trap that get_model_context_length('') returns). Verified live: claude-opus-4-7 session with input_tokens=5,226,479 now renders "·" + "5.3M tokens used" instead of "100" + "3987% used". Reported by @AvidFuturist. Closes #1436.	2026-05-02 01:43:00 +00:00
nesquena-hermes	c75ce33280	v0.50.259: Opus pre-release follow-up — close _session_db on LRU eviction + CHANGELOG + 5 regression tests PR #1421 (SessionDB WAL handle leak fix on cached-agent reuse path) had a sibling leak at the LRU eviction site that I caught during pre-review: api/streaming.py SESSION_AGENT_CACHE.popitem(last=False) was discarding the evicted entry with `evicted_sid, _ = ...`. The agent's _session_db was dropped on the floor and only released when GC eventually finalized the agent — which on a long-running server may be never (cyclic refs, extension types holding C handles, etc.). Same fix shape as #1421: capture the evicted entry, call _evicted_agent._session_db.close() explicitly. SessionDB.close() is idempotent + thread-safe (with self._lock: if self._conn:), so the double-close-is-benign property still holds. 5 regression tests in test_v050259_sessiondb_fd_leak.py: - Source-level: cached-agent reuse path closes before replace - Source-level: LRU eviction path captures + closes evicted agent - Behavioral: SessionDB.close() is idempotent (3 calls safe) - Behavioral: cached-agent reuse with mock — close called exactly once - Behavioral: LRU eviction with mock — only evicted agent's DB closes Full suite: 3615 passed, 0 failed. Nathan explicitly authorized 'just go ahead and merge it as a small release' since the PR is 9 LOC, focused, has Opus pre-release follow-up + tests, and matches the empirically-confirmed leak shape (73-handle leak at EMFILE).	2026-05-01 22:42:53 +00:00
nesquena-hermes	f05893215e	Merge PR #1421 from wali-reheman: close previous SessionDB before replacing on cached agent	2026-05-01 22:38:53 +00:00
nesquena-hermes	399f12ac96	v0.50.258: Opus follow-up — fix multi-param redirect-encoding bug + CHANGELOG PR #1419 (login session TTL + redirect-back + connectivity probe) had a real bug in the server-side ?next= construction: quote(path, safe='/:@!$&'()*+,;=') keeps ? and & literal, so: (a) /api/sessions?limit=50&offset=0 round-trips as /api/sessions?limit=50 — the inner & terminates the outer next= value and offset=0 leaks as a top-level outer query the login page ignores. (b) An attacker-controlled path with embedded &next=https://evil.com injects a second top-level next parameter. Browsers parse first-match (benign), Python parse_qs parses last-match (the evil URL) — the parser-divergence is a footgun even though _safeNextPath() in login.js rejects the actual exploit. Fix: encode the entire path?query blob with safe='/' so ?, &, = all percent-encode. The outer next then holds exactly one path-with-query string the browser auto-decodes once. 6 regression tests in test_v050258_opus_followups.py pin round-trip behavior across simple paths, single-query, multi-param queries, attacker-injection neutralization, and the SESSION_TTL=30d constant. Full suite: 3610 passed, 0 failed.	2026-05-01 21:30:10 +00:00
Wali Reheman	9b987eefb0	fix: close previous SessionDB before replacing on cached agent SessionDB WAL handles leak when streaming.py creates a new SessionDB instance per request and replaces the cached agent's _session_db without closing the old one. Each orphaned connection holds 2 FDs (.db + .db-wal), causing FD exhaustion and EMFILE crashes after ~73 messages. Fix: close the previous _session_db before replacing it on cached agents, mirroring the close-before-replace pattern used elsewhere in the codebase.	2026-05-01 13:51:21 -07:00
bsgdigital	9c0667d187	fix(auth): extend session TTL to 30 days + redirect back after login	2026-05-01 19:54:47 +00:00
nesquena-hermes	c78bcddda6	v0.50.257: CRITICAL Opus finding — fix non-functional per-session toolset override Opus pre-release advisor caught a 5th issue not covered by my initial follow-up sweep, this one CRITICAL: PR #1402 #493 per-session toolset override silently no-op'd every time. Bug: api/streaming.py:1755 called _session_meta.get('enabled_toolsets') on the result of Session.load_metadata_only(). It returns a Session INSTANCE, not a dict. .get() raised AttributeError, which the surrounding bare except swallowed silently. The toolset chip in the UI saved correctly to disk, but the streaming agent always ran with global toolsets. Fix: use getattr(_session_meta, 'enabled_toolsets', None). Two new regression tests: - Source-level: forbid the .get() / [] dict-access shape. - Runtime: Session.load_metadata_only must return a Session instance. Full suite: 3604 passed, 0 failed.	2026-05-01 18:36:24 +00:00
nesquena-hermes	f8007d43f3	v0.50.257: 4 Opus pre-release follow-ups + CHANGELOG + test fixes for #1415 stage-257 batch (PRs #1402 + #1415): Opus pre-release advisor caught 4 issues in stage-257: 1. MUST-FIX (security): api/oauth.py::_write_auth_json — tmp.replace() preserves the temp file umask (0644 default), so OAuth access/refresh tokens landed world-readable on shared systems. Fix: tmp.chmod(0o600) BEFORE rename, with try/except OSError that warns but does not abort. 2. SHOULD-FIX: _handle_cron_history and _handle_cron_run_detail accepted job_id as a path component without validation. Mirrors the rollback path-traversal vector caught in v0.50.255 (#1405). Path() / .. does NOT normalize. New regex ^[A-Za-z0-9_-][A-Za-z0-9_.-]{0,63}$ with explicit . / .. rejection. 3. SHOULD-FIX: _handle_cron_history int(offset)/int(limit) raised ValueError on malformed input → confusing 500. Now try/except + clamp to (max(0, offset), max(1, min(500, limit))). 4. NIT: same regex applied to _handle_cron_run_detail (defense-in-depth even though path-resolve check would catch it downstream). PR #1415 follow-up: 8 pre-existing tests in test_issue1106 and test_custom_provider_display_name asserted bare model IDs but #1415 changes named-custom-provider IDs to @custom:NAME:model form when active provider differs. Tests updated to use _strip_at_prefix helper to keep checking the same invariant in the new shape. 4 regression tests in test_v050257_opus_followups.py + 8 fixed pre-existing tests. Full suite: 3602 passed, 0 failed.	2026-05-01 18:30:41 +00:00
nesquena-hermes	42d4070e2d	Merge PR #1415 from Thanatos-Z: fix named custom provider routing in model picker	2026-05-01 18:20:07 +00:00
nesquena-hermes	bc17229a7d	Merge PR #1402 from bergeouss: P2 improvements — cron history, toolsets per session, Codex OAuth # Conflicts: # static/i18n.js	2026-05-01 18:20:05 +00:00
youzhi	59e07f3fff	Fix WebUI custom provider routing	2026-05-02 02:11:41 +08:00
nesquena-hermes	fcba6fda1c	Merge PR #1411 from nesquena-hermes: TTS toggle CSS specificity collision (#1409 ) + Ollama env var bleed (#1410 ) # Conflicts: # CHANGELOG.md	2026-05-01 17:34:28 +00:00

1 2 3 4 5 ...

415 Commits