hermes-webui

mirror of https://github.com/nesquena/hermes-webui.git synced 2026-05-24 18:50:15 +00:00

Author	SHA1	Message	Date
nesquena-hermes	7d1aa2e261	v0.50.209: check-for-updates, workspace toggle, HTML preview, provider categories, queue flyout docs (#1042 ) * feat: add manual 'Check for Updates' button in System settings (#785) Add a 'Check now' button next to the version badge in the System settings section, allowing users to manually trigger an update check at any time without waiting for the automatic periodic check. Changes: - index.html: add button with spinner and status text inline with version badge - panels.js: add checkUpdatesNow() calling /api/updates/check?force=1 with immediate feedback (checking... / up to date / X updates available) - style.css: style the button block and spinner - i18n.js: add 5 new keys (settings_check_now, settings_checking, settings_up_to_date, settings_updates_available, settings_updates_disabled) in all 6 locales (en, ru, es, de, zh, zh-Hant) * fix: sanitize error message in checkUpdatesNow to avoid exposing paths Review feedback: strip filesystem paths from error messages and cap length to prevent internal details leaking into the UI. * fix: fully sanitize error in update check — never expose raw e.message in UI Previous partial fix (`80cdaee`) stripped filesystem paths from e.message but still displayed the JS exception message to users. Per reviewer feedback and project convention (NEVER expose raw e.message in UI), replace with: - A generic user-facing i18n key (settings_update_check_failed) as default - Fallback to API response body error if available (structured, not raw) - Full error logged via console.warn for debugging - Button disable-during-check already confirmed working (try/finally pattern) - settings_update_check_failed key added in all 6 locales * fix(#785): align HTML selectors with CSS and add regression tests - Wrap update button in div#checkUpdatesBlock so CSS selectors apply - Change button class from sm-btn to btn-tiny (matching stylesheet) - Remove inline styles now handled by CSS (#checkUpdatesBlock, .btn-tiny) - Move spinner sizing to CSS class .spinner-xs - Add 4 static tests in test_update_banner_fixes.py: checkUpdatesNow defined, btnCheckUpdatesNow in HTML, CSS selectors exist, i18n key in all locales * feat: 'Keep workspace panel open' toggle in Appearance settings (#999) * feat: categorize providers in setup wizard (#603) - Add 6 new providers: Google Gemini, DeepSeek, Mistral, xAI (Grok), Ollama, LM Studio to the onboarding quick-setup catalog - Group providers into 3 categories: Easy start, Open/self-hosted, Specialized — rendered as <optgroup> in the provider dropdown - Generic base_url save logic (requires_base_url + default_base_url) instead of hardcoded provider checks - i18n keys for category labels in en, ru, es, zh, zh-Hant * ci: re-run tests * fix(tests): prevent reload_config() from overwriting in-memory mock in test_issue644 The test helper _available_models_with_cfg patches cfg in-memory but get_available_models() calls reload_config() when the config file's mtime doesn't match _cfg_mtime. On CI, config.yaml exists so mtime > 0 and _cfg_mtime starts at 0.0, triggering a reload that overwrites the test's mock with on-disk content. Fix: freeze _cfg_mtime to the current config file mtime inside the helper, so reload_config() is not triggered during the test. * fix: correct default model IDs for gemini, xai, deepseek; add specialized provider tests - gemini: gemini-3.1-pro-preview → gemini-2.5-pro-preview - x-ai: grok-4.20 → grok-3 - deepseek: deepseek-chat-v3-0324 → deepseek-chat - Add TestApplyBaseURLSpecialized: 4 tests verifying base_url written for gemini, deepseek, mistral, and x-ai through apply_onboarding_setup * test: add TestApplyBaseURLSpecialized — verify base_url written for gemini, deepseek, mistralai, x-ai * fix(onboarding): correct stale model defaults for specialized providers Three issues in the new specialized provider catalog (#1027 hold reason): 1. gemini default_model was `gemini-2.5-pro-preview` — agent's catalog has the 3.1 family. Updated to `gemini-3.1-pro-preview`. 2. x-ai default_model was `grok-3` — agent's catalog has `grok-4.20`. Updated. 3. gemini `models` list was sourcing from `_PROVIDER_MODELS.get("gemini")` which returns []. The catalog in api/config.py is keyed under "google" (even though the agent's alias map normalizes google -> gemini). Switched to `_PROVIDER_MODELS.get("google")` so the wizard surfaces the actual 5-model list. Also forward-compatible lookup for x-ai (xai or x-ai key). Without these fixes, users picking gemini or x-ai in the wizard would see no model dropdown and the default_model written to config.yaml would 404 on first chat. deepseek default_model bumped from `deepseek-chat` to `deepseek-chat-v3-0324` to match the test fixture's expectation and the agent catalog's pinned version. Added two regression tests: - test_gemini_model_list_is_populated: pins the catalog-key correctness - test_specialized_default_models_match_catalog: pins the version prefixes (3.x for gemini, 4.x for grok) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * feat: inline HTML preview in workspace panel (#779) Render .html/.htm files as live previews in a sandboxed iframe instead of showing raw source code. Adds an 'Open in browser' button to open the file in a new tab. Changes: - workspace.js: add HTML_EXTS set, 'html' preview mode, iframe routing in openFile(), and openInBrowser() function - index.html: add sandboxed iframe element and 'Open in browser' button in preview toolbar (visible only for HTML files) - i18n.js: add 'open_in_browser' key in all 6 locales The iframe uses sandbox='allow-scripts' for security. Download button remains available alongside the new preview. * docs: document sandbox security tradeoff for HTML preview Review feedback: fileExt() already lowercases extensions so .HTML/.HTM work. Added code comment explaining the deliberate sandbox=allow-scripts choice: scripts are needed for most HTML documents but the iframe is still origin- isolated and cannot access parent cookies/data. * fix: pass ?inline=1 to file/raw so HTML preview iframe renders instead of downloading routes.py: add inline_preview param — bypasses Content-Disposition:attachment for text/html when ?inline=1 is set, serving the file inline for the sandboxed iframe. workspace.js: add &inline=1 to the iframe src URL. test: add 5 static regression tests for the inline HTML preview. * fix(security): CSP sandbox header for inline HTML preview The iframe sandbox="allow-scripts" attribute on previewHtmlIframe only applies when HTML is loaded INSIDE that iframe. A user tricked into opening /api/file/raw?path=evil.html&inline=1 directly in a top-level tab (e.g. via a chat link) would render the HTML in the WebUI's origin without any sandbox, giving the page full access to cookies and localStorage. Server-side Content-Security-Policy: sandbox allow-scripts mirrors the iframe sandbox exactly: scripts run, but the document is treated as a unique opaque origin (no allow-same-origin) and cannot read WebUI cookies, localStorage, or postMessage to the parent regardless of how the URL is accessed. Added test_inline_html_response_sets_csp_sandbox to pin the header. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * docs: v0.50.209 release notes — 4 PRs, 2212 tests (+43) * docs(changelog): document #1040 queue flyout and Cloudflare CSP in v0.50.209 The stage commit `ed2bd18` listed v0.50.209 as a 4-PR release but the stage actually bundles 5 PRs — #1040 (queue flyout) was cherry-picked in without a corresponding CHANGELOG entry. Without this fix, the queue feature ships silently and the bundled Cloudflare CSP relaxation in api/helpers.py is also undocumented. Adds two entries: - Added: queue flyout (#1040) under v0.50.209 - Changed: CSP allowlist for Cloudflare Access deployments Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: bergeouss <bergeouss@users.noreply.github.com> Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> Co-authored-by: Nathan Esquenazi <nesquena@gmail.com> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com> v0.50.209	2026-04-25 14:33:41 -07:00
nesquena-hermes	3ce7844a7a	feat(queue): Codex-style message queue flyout above composer (#1040 ) * chore: apply pending #965 queue flyout patches on local master Queue flyout implementation (PR #965 — pending merge) applied on top of upstream v0.50.205. Features: - Queue card slides up from behind composer (approval-card pattern) - Lucide icons via li(), CSS class system, no inline SVG dumps - Drag-to-reorder by _queued_at timestamp (survives re-renders) - Inline contenteditable edit with focus guard and blur-commit - Combine preserves first item files, merge immediate (no 200ms race) - Files/model compact badges per item - Hide/expand via header chevron + composer pill + titlebar chip - All 3 expand paths sync correctly - border-bottom CSS order fixed, fingerprint improved, _dragTs guards CF CSP domains also applied (deployment-specific, not in upstream PR). * fix(queue): harden merge closure, toggleQueue sid, and drain flash - mergeBtn _doMerge now reads live queue (_getSessionQueue) instead of stale closure q - toggleQueue reads activeSid from S.session at call time, not captured param - updateQueueBadge defers chips.innerHTML='' by 360ms so slide-out transition completes before content clears * style(queue): contain:paint on inner, pill fade-in animation * feat(queue): pill outside composer, compact collapsed state matching card width - Move #queuePill out of .composer-box to between .composer-flyout and .composer-box - Pill styled as compact queue-card-inner (same border, radius:14px 14px 0 0, no border-bottom) - Pill width matches card inner: max-width:calc(var(--msg-max)-40px), centered - Pill stays visible until user re-expands or queue drains (updateQueueBadge no longer hides pill when card is manually collapsed) - Remove all queue-active/queue-pill-active composer modifications — composer untouched - Fix: mergeBtn reads live queue not stale closure - Fix: toggleQueue uses S.session.session_id at call time not captured param - Fix: chips.innerHTML deferred 360ms on drain to avoid empty-card flash * fix(queue): collapsed state persists + cross-session DOM isolation - Add _queueCollapsed[sid] flag: set by hideBtn, cleared by pill expand / queue drain - _renderQueueChips respects flag — no longer reopens card when new message queued while collapsed - updateQueueBadge else-branch: DOM mutations now gated on sid===active session - _syncQueueTitlebar only fires for active session in else-branch - Fixes Opus/Codex-identified bugs: pill auto-reopen and cross-session DOM corruption * fix(queue): proper pill wrapper matching queue-card structure - Add .queue-pill-outer div wrapper (max-width:var(--msg-max); padding:0 20px) identical to .queue-card outer — positions pill button at exact card-inner width - .queue-pill button fills slot with width:100% - Removes hardcoded 740px — width is derived correctly from the same CSS variables the card uses, scales with --msg-max across all viewports - JS toggles .show on pillOuter (parentElement), not on pill button directly --------- Co-authored-by: Basit Mustafa <basit.mustafa@gmail.com> v0.50.208	2026-04-25 14:21:50 -07:00
nesquena-hermes	ad8e10304c	v0.50.207: batch of 10 PRs — TPS stat, SSE guard, session polish, cron UX, folder create, model errors, session speed, title gen (#1031 ) * fix: remove orphaned i18n keys from top-level LOCALES object Three Traditional Chinese translation keys (cmd_status, memory_saved, profile_delete_title) were placed outside any locale block between the en and ru blocks in static/i18n.js. They became top-level properties of the LOCALES object, causing them to appear as invalid language options in the Settings > Preferences dropdown. The correct translations already exist in the zh-Hant locale block. Fixes #1008 * fix: block stale SSE events from polluting new session's DOM - appendThinking(): guard with !S.session\|\|!S.activeStreamId to drop events from a previous session's SSE stream during a session switch - appendLiveToolCard(): same guard for consistency - finalizeThinkingCard(): scroll thinking-card-body to top when scroll is pinned, so completed response is immediately visible - appendThinking(): auto-scroll thinking card body to bottom while streaming if user is watching (scroll pinned) * Fix empty agent sessions in sidebar * fix: resolve cron UI UX issues — icon ambiguity, toast overlap, running status Fixes #995 — three sub-issues in the Cron Jobs UI: 1. Dual play icons ambiguous: Resume button now shows a distinct play+bar icon (play triangle + vertical line) instead of the identical triangle used by Run now. 2. Toast notification overlapping header buttons: Added position:relative; z-index:10 to .main-view-header so it stacks above the fixed toast (z-index:100 within its layer). 3. No running status after trigger: After triggering a job, the status badge immediately shows 'running…' with a CSS spinner animation, and polls the cron list every 3s (up to 30s) to refresh when the job completes. - Added cron_status_running i18n key in all 5 locales (en, es, de, ru, zh, zh-Hant) - Added .detail-badge.running CSS class with spinner animation - New functions: _setCronDetailStatus(), _startCronRunningPoll() * fix(#1011): address review feedback — poll cleanup, badge persistence, 30s fallback - _clearCronDetail() now clears _cronRunningPoll interval on navigation - Poll re-applies 'running' badge after loadCrons() re-render (prevents flicker) - When poll ends (30s max), detail re-renders with actual status as fallback * feat: create folder and add space directly from UI (#782) - After creating a folder via the file tree New folder button, offer to add it as a space via confirm dialog - Add Create folder if it doesnt exist checkbox in the New Space form - Backend: support create flag in /api/workspaces/add to mkdir before validation - i18n: 4 new keys (folder_add_as_space_title/msg/btn, workspace_auto_create_folder) in all 6 locales * fix: validate workspace path before mkdir to prevent orphan directories Review feedback (critical): the previous code called mkdir() before validate_workspace_to_add(), which meant a rejected path (e.g. system dir) would leave an orphan directory on disk. New flow: 1. Resolve path and check against blocked system roots BEFORE any mutation 2. mkdir() only if path passes the blocklist check 3. Full validation (exists, is_dir) after mkdir Also imports _workspace_blocked_roots for the pre-mutation blocklist check. * fix(#1014): classify model-not-found errors with helpful message - Add model_not_found error type to streaming.py exception classifier - Detect 404, 'not found', 'does not exist', 'invalid model' patterns - Strip HTML tags from provider error messages (nginx 404 pages, etc.) - Add model_not_found branch to apperror handler in messages.js - Add i18n key model_not_found_label in all 6 locales - 15 tests covering detection, sanitization, frontend, and i18n * feat(ui): add live TPS stat to header Adds a TPS (Tokens Per Second) chip to the right of the header title bar that updates live while AI output is streaming. Metering (api/metering.py) - Tracks per-session output + reasoning tokens via GlobalMeter singleton - Per-session TPS = total_tokens / elapsed_time - Global TPS = average of active sessions' TPS values - HIGH/LOW are max/min of global_tps snapshots over a 60-minute rolling window (only recorded when > 0, so idle periods are excluded) - Thread-safe with a single lock Metering events emitted from streaming.py - Throttled at 100ms from token/reasoning/tool callbacks so the display updates rapidly during fast token streams - 1Hz ticker as fallback for slow streams (exits when no active sessions) - Final stats emitted on stream end Routes (api/routes.py) - Removed POST /api/metering/interval endpoint (dynamic interval via focus/blur was replaced with simple always-1s-when-active approach) UI (static/messages.js, index.html, style.css) - TPS chip in titlebar: shows 'N.N t/s . N.N high . N.N low' - Default: '0.0 t/s . 0.0 high' when idle - Display updates on every metering SSE event (throttled to 100ms) * feat: session restore speed + title gen reasoning hardening (#1025, #1026) PR #1025 (@franksong2702): Speed up large session restore paths - GET /api/session?messages=0 now parses only metadata before the messages array - Metadata-only loads no longer populate the full-session LRU cache - Frontend lazy fetch uses resolve_model=0 to avoid cold model-catalog lookup - Hard reload no longer waits for populateModelDropdown() before restoring session PR #1026 (@franksong2702): Harden auto title generation for reasoning models - Raises title-gen completion budget to 512 tokens (reasoning-safe) - Retries once with 1024 tokens on empty content / finish_reason:length - Applies retry to both auxiliary and active-agent fallback routes - Preserves underlying failure reason in title_status on local fallback Co-authored-by: Frank Song <franksong2702@gmail.com> * feat: session attention indicators in right slot + last_message_at timestamps (#1024) PR #1024 (@franksong2702): Polish session attention indicators - Streaming spinners and unread dots now reuse the right-side actions slot - Running/unread rows hide timestamps; idle/read rows keep right-aligned timestamps - Date group carets point down when expanded, right when collapsed - Pinned group no longer repeats pinned-star icon per row - Running indicators appear immediately after send (local busy state while /api/sessions catches up) - Sidebar sorting/grouping/timestamps now prefer last_message_at (derived from last real message) so metadata-only saves don't make old sessions appear under Today Co-authored-by: Frank Song <franksong2702@gmail.com> * docs: v0.50.207 release notes — 10 PRs, 2169 tests (+36) --------- Co-authored-by: bergeouss <bergeouss@users.noreply.github.com> Co-authored-by: Josh <josh@fyul.link> Co-authored-by: Frank Song <franksong2702@gmail.com> Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.207	2026-04-25 13:07:35 -07:00
nesquena-hermes	12a8c051fb	fix: inject full workspace path into agent context for uploaded files (#997 ) fix: inject full workspace path into agent context for uploaded files (#997) Uploaded files (drag-and-drop or paperclip) were saved correctly to the workspace but the agent message only contained the bare filename — `photo.jpg` instead of the full path. The agent couldn't call `read_file` or `vision_analyze` without a full path. `uploadPendingFiles()` now returns `{name, path}` objects from `/api/upload` (`data.path` was always returned, just never threaded through). The agent message gets the full absolute path; all display surfaces (badges, session history, INFLIGHT state, POST body) continue showing only the bare filename. Three fixes absorbed during review: - Second `saveInflightState()` call was passing raw `{name,path}` objects instead of the `uploadedNames` string array (INFLIGHT localStorage corruption on page reload) - `attachLiveStream()` was being called with the raw object array; changed to pass `uploadedNames` so the `done` handler receives strings, not objects - `attachLiveStream` `done` handler referenced `uploadedNames` which is out of scope there (ReferenceError on every upload success); fixed to use the `uploaded` param Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> Closes #996 v0.50.206	2026-04-24 23:09:44 -07:00
nesquena-hermes	44a6587e78	docs(architecture): document workspace path trust levels (#993 ) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com>	2026-04-24 13:22:28 -07:00
nesquena-hermes	0a6f15d8d9	chore: v0.50.205 CHANGELOG Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.205	2026-04-24 13:04:44 -07:00
nesquena-hermes	2800ebdcff	fix(workspace): allow adding external paths not under home directory (#991 ) The workspace add endpoint used resolve_trusted_workspace() which blocks any path outside the user's home directory, the saved workspace list, or BOOT_DEFAULT_WORKSPACE. This created a circular dependency: to add /mnt/d/Projects you need it in the saved list, but to get it in the list you need to add it. Fix: introduce validate_workspace_to_add() used by /api/workspaces/add, which only blocks non-existent paths, non-directories, and known system roots. The stricter resolve_trusted_workspace() is still used for actual file operations within a workspace. Fixes #953. Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com>	2026-04-24 13:04:36 -07:00
nesquena-hermes	3c457d178d	chore: v0.50.204 CHANGELOG Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.204	2026-04-24 12:54:13 -07:00
nesquena-hermes	c0019723d1	fix(docker): use /home/hermes/.hermes for HERMES_HOME in compose files (#989 ) Fixes container crash on startup (#967). The hermes-agent image drops privileges to a 'hermes' user via gosu; /root is mode 700 so mkdir fails under /root/.hermes. Changed to /home/hermes/.hermes throughout. Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com>	2026-04-24 12:54:05 -07:00
nesquena-hermes	34329ad231	chore: v0.50.203 CHANGELOG (#964 ) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.203	2026-04-24 12:34:05 -07:00
Basit Mustafa	e62338d3a0	fix(queue): drain correct session queue after cross-session stream completion (#964 ) When a session finishes streaming while the user has switched to a different session, setBusy(false) was draining S.session.session_id (the currently viewed session) instead of the session that actually finished. Queued follow-up messages were silently dropped. Root cause: setBusy() has no context about which session triggered it. The activeSid closure variable inside attachLiveStream() knew the right session but was not propagated. Fix: add _queueDrainSid module global (null by default). Stream done and error handlers set it to activeSid immediately before calling setBusy(false). setBusy(false) reads and clears _queueDrainSid, falling back to S.session if it is unset (the common case where the user hasn't switched away). Handlers patched: done event, start-call error handler, stream_end/stream_stop reconnection fallback, and max-retry error exit. Co-authored with Claude Sonnet 4.6 / Anthropic.	2026-04-24 12:33:56 -07:00
nesquena-hermes	619646159c	chore: v0.50.202 CHANGELOG (#972 ) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.202	2026-04-24 12:33:25 -07:00
Basit Mustafa	a4b56642d9	perf(streaming): throttle inflight localStorage persist to prevent GC crash (#972 ) saveInflightState() is called from syncInflightAssistantMessage() on every token. It does localStorage.getItem + JSON.parse + mutate + JSON.stringify + localStorage.setItem on the full inflight state map. For a 5000-token response with a 10KB messages array this produces ~36MB of JSON churn per second. This O(response_length) work per token is the primary source of GC pressure that causes the renderer to crash (Chrome error codes 4/5). The 13.6-second RunTask we observed in perf traces is a direct consequence: accumulated rAF callbacks execute all at once after each multi-second GC pause. Fix: add _throttledPersist() which writes at most once every 2 seconds during token streaming. State transitions that matter for crash recovery (tool events, done, start) still call persistInflightState() directly, so at most 2s of in-flight progress is lost if the tab crashes mid-stream. The _persistTimer is cleared on 'done' so the final state is always flushed. Co-authored with Claude Sonnet 4.6 / Anthropic.	2026-04-24 12:33:16 -07:00
nesquena-hermes	32276c81d1	chore: v0.50.201 CHANGELOG Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com>	2026-04-24 11:58:18 -07:00
nesquena-hermes	86b20d362f	fix(streaming): call clearTimeout at all _pendingRafHandle cleanup sites (#985 ) _scheduleRender() now uses setTimeout(→rAF) when within the 66ms throttle window, meaning _pendingRafHandle can hold a setTimeout ID (not a rAF ID). All 4 cleanup sites only called cancelAnimationFrame(), which is a no-op for timeout handles, leaving stale callbacks that could fire after stream end. Fix: call both clearTimeout() and cancelAnimationFrame() at each site. (clearTimeout is a no-op when called with a rAF handle, and vice versa.) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com>	2026-04-24 11:57:48 -07:00
nesquena-hermes	8ce83b637c	chore: v0.50.200 CHANGELOG (#963 ) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.200	2026-04-24 11:49:23 -07:00
Basit Mustafa	6333a06524	perf(ui): cache renderMessages per session, skip O(n) rebuild on back-navigation (#963 ) renderMessages() tears down and rebuilds every message's DOM from scratch on every call — renderMd() (markdown parse), Prism highlight, and KaTeX per message, O(n) total. With large sessions the main thread blocks for 1-5 seconds on each call. A Chrome perf trace (78s, many open sessions) showed: - 9,373ms of GC across 34,049 GC events (sustained, not burst) - Peak 273 messages.js FunctionCalls/second - 4.7s, 3.5s, 3.2s main-thread blocks from repeated renderMessages invocations The render bottleneck is unaddressed by PR #959 (which improves the network/ parse leg of session switching, not the render leg). Fix: a session-keyed innerHTML cache. After a full rebuild, the rendered HTML is stored against the session_id + message count. When switching back to a session that was already rendered with the same count, the DOM is restored from cache (fast innerHTML set + re-highlight) instead of rebuilt from scratch. Guard: the cache is only used on cross-session navigation (sid !== current). In-session updates (new messages, edits, tool_complete, stream events) always get a full rebuild — no stale content is ever shown. Cache is capped at 30 sessions and evicts oldest-first to bound memory. Co-authored with Claude Sonnet 4.6 / Anthropic.	2026-04-24 11:49:14 -07:00
nesquena-hermes	5663fb147b	chore: v0.50.199 CHANGELOG (#966 ) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.199	2026-04-24 11:44:56 -07:00
Basit Mustafa	0217bf5cce	perf(streaming): throttle live render to ~15fps to prevent crash under GC pressure (#966 ) _scheduleRender() uses requestAnimationFrame to update the live assistant message during streaming. rAF fires at up to 60fps, but each DOM update takes 50-150ms on sessions with long histories — far exceeding the 16ms rAF budget. During GC pauses (which can run for hundreds of milliseconds), rAF callbacks accumulate. When the GC yields, the browser executes all queued callbacks sequentially in a single RunTask. A Chrome performance trace shows a 13.6-second RunTask containing 1,240 accumulated render callbacks — which causes the renderer to crash (Chrome error codes 4/5, ERR_EMPTY_RESPONSE / ERR_CONNECTION_RESET). Fix: track the last render timestamp and delay scheduling the next rAF until at least 66ms (15fps) have elapsed since the previous render. If within the 66ms window, use setTimeout to defer the rAF rather than skipping it — this batches token updates without dropping any content. The 66ms interval is conservative enough to prevent runaway accumulation while fast enough that streaming text still feels immediate. The _renderPending flag continues to prevent double-scheduling within each interval. Co-authored with Claude Sonnet 4.6 / Anthropic.	2026-04-24 11:44:47 -07:00
nesquena-hermes	da131b842d	chore: v0.50.198 CHANGELOG (hotfix) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.198	2026-04-24 11:41:41 -07:00
nesquena-hermes	ef72384217	fix: harden _accepts_gzip + update stale test assertions post-#959 (#981 ) Fixes introduced when absorbing PR #959 (fast conversation switching): - _accepts_gzip() now uses getattr() to tolerate _FakeHandler and any synthesised handler that lacks a .headers attribute (fixes 2 test failures in test_sprint46.py) - test_issue401: updated assertion to accept both minified and reformatted forms of the tool_calls fallback guard (PR reformatted the code) - test_regressions: updated activeStreamId assertion — PR refactored data.session references to S.session for direct state access Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com>	2026-04-24 11:41:17 -07:00
nesquena-hermes	116a510ed3	i18n: add complete Traditional Chinese (zh-Hant) translations (#954 ) * i18n: add complete Traditional Chinese (zh-Hant) translations - Add 300+ zh-Hant translation entries covering all UI sections: onboarding, settings/Control Center, session actions, cron jobs, providers panel, workspace management, skills, profiles, todos, BTW - Fix existing zh-Hant translations: remove mixed Simplified Chinese characters, fix typos (e.g. 皮膚→佈景, 待踩→待辦, 新存對話→新對話) - Update zh locale: fix 需要审批→需要审核 (Simplified Chinese correction) - Add data-i18n attributes to Control Center HTML (index.html) for heading, subtitle, tab names, dropdown, and section titles - Migrate session action menu (sessions.js) from hardcoded English to t() function calls for full i18n support * fix: translate remaining English entries to Traditional Chinese in zh-Hant locale - settings_heading_title: 'Control Center' → '控制中心' - settings_dropdown_providers: 'Providers' → '供應商' - providers_section_title: 'Providers' → '供應商' - providers_tab_title: 'Providers' → '供應商' * fix: add missing locale keys to zh/ru/es/de + restore zh approval_heading - zh (Simplified): reverted approval_heading to 需要审批 (matches master) PR had changed it to 需要审核 which broke the representative-translation test - zh/ru/es/de: added 39 new session management + settings keys as English fallback strings (session_archive, session_pin, settings_dropdown_*, etc.) These keys were added to English in this PR but missing from other locales - es: added cmd_status (English fallback) to fix coverage gap - Fixes all locale coverage test failures --------- Co-authored-by: 陳俊宇 <chenjunyu@chenjunyudeMacBook-Air-7.local> Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.197	2026-04-24 11:36:41 -07:00
nesquena-hermes	ed24010e10	chore: v0.50.197 CHANGELOG (#954 ) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com>	2026-04-24 11:36:13 -07:00
nesquena-hermes	23b7c63198	chore: v0.50.196 CHANGELOG (#959 ) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.196	2026-04-24 11:35:23 -07:00
Josh Jameson	7e17ec497c	fix: fast conversation switching with metadata-first load (#959 ) - Backend: save session JSON with metadata fields before messages array so load_metadata_only() reads only ~1KB without parsing the full session - Backend: add GET /api/session?messages=0 for metadata-only responses (~1KB vs ~400KB), enabling instant sidebar switching - Backend: add POST /api/admin/reload to hot-reload models without restart - Backend: gzip compress JSON API responses (>1KB) for 70-80% bandwidth reduction - Frontend: show Loading indicator immediately on session switch, replacing old DOM before API call to prevent stale content flash - Frontend: clear S.messages before API call so _ensureMessagesLoaded always fetches fresh data for the target session - Frontend: wrap both Phase 1 (messages=0) and Phase 2 (_ensureMessagesLoaded) in try/catch to prevent permanently stuck loading state on network/server errors	2026-04-24 11:35:14 -07:00
nesquena-hermes	2d5c4b71cc	chore: v0.50.195 CHANGELOG (#962 ) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.195	2026-04-24 11:21:50 -07:00
Basit Mustafa	4a882bec66	fix(auth): persist sessions across restarts via STATE_DIR/.sessions.json (#962 ) _sessions is an in-memory dict, so every process restart (launchd bounce, systemd restart, container recycle) invalidates all active browser sessions. Users get 401 on every authenticated endpoint until they clear cookies. The HMAC signing key already persists to STATE_DIR/.signing_key via atomic owner-only write. This PR applies the same pattern to the session table: - _load_sessions(): reads .sessions.json on module import, prunes expired entries, tolerates missing/malformed files (returns {} on any error) - _save_sessions(): atomic write via tempfile + os.replace(), chmod 0600, mirrors .signing_key write pattern exactly - create_session(): saves after inserting new token - invalidate_session(): saves after removing token (only if token existed) - _prune_expired_sessions(): saves only when entries are actually removed Cookie format and signing are unchanged; existing sessions survive upgrade. 6 regression tests cover: restart survival, invalidation persistence, expiry pruning on load, 0600 permissions, corrupt-file tolerance. Co-authored with Claude Sonnet 4.6 / Anthropic.	2026-04-24 11:21:41 -07:00
nesquena-hermes	f48b157a8f	chore: v0.50.194 CHANGELOG (#960 ) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.194	2026-04-24 11:04:42 -07:00
bsgdigital	a2d7f311be	fix(streaming): prevent dropped characters in incremental smd path (#960 ) Detect prefix desync between current display text and already-streamed text, then rebuild the streaming-markdown parser from full content to avoid character loss during live rendering. Add regression assertions for the new desync guard. Made-with: Cursor Co-authored-by: bsgdigital <bsg@bsgdigital.com>	2026-04-24 11:04:32 -07:00
nesquena-hermes	c06ec43f17	chore: v0.50.193 CHANGELOG (#958 ) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.193	2026-04-24 11:04:26 -07:00
bsgdigital	e5cf9c5910	fix(streaming): strip malformed DSML function_calls tags (#958 ) Handle DeepSeek DSML variants including truncated and spaced tag forms, and sanitize thinking-card text so leaked XML fragments never render. Add regression tests for DSML edge cases and thinking-card sanitization. Made-with: Cursor Co-authored-by: bsgdigital <bsg@bsgdigital.com>	2026-04-24 11:04:16 -07:00
nesquena-hermes	70de09290c	chore: v0.50.192 CHANGELOG (#951 ) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.192	2026-04-24 11:04:09 -07:00
ruxme	f109592cb0	perf: add defer to all local script tags (#951 ) All 10 local <script> tags now use the defer attribute, allowing the browser to download them in parallel during HTML parsing instead of blocking the DOM sequentially. Execution order is preserved. Before: scripts loaded one-at-a-time, each blocking DOM construction After: scripts downloaded in parallel, executed in order after DOM ready Fixes slow sidebar session list rendering on initial page load. Co-authored-by: 陳俊宇 <chenjunyu@chenjunyudeMacBook-Air-7.local>	2026-04-24 11:03:59 -07:00
nesquena-hermes	d339200b5b	chore: v0.50.191 CHANGELOG (#948 ) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.191	2026-04-24 11:03:52 -07:00
starship-s	0a91e3cb02	fix: identify WebUI sessions as webui platform (#948 ) * fix: use webui platform for webui sessions * test: harden WebUI platform hint regression coverage	2026-04-24 11:03:42 -07:00
nesquena-hermes	cb41075bd2	chore: v0.50.190 CHANGELOG (.venv #949 ) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.190	2026-04-24 10:45:33 -07:00
xingyue	91703e3e54	fix(config): add .venv discovery paths in _discover_python (#949 )	2026-04-24 10:45:23 -07:00
nesquena-hermes	396537c624	chore: v0.50.189 CHANGELOG (#961 csp) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com>	2026-04-24 10:45:09 -07:00
Basit Mustafa	b072a6887c	fix(csp): add explicit manifest-src 'self' directive (#961 ) PR #920 added static/manifest.json and sw.js for PWA support. The CSP in _security_headers() had no explicit manifest-src directive, so browsers fell back to default-src 'self' and emitted a console warning on every page load. The fallback is functionally correct but non-compliant with CSP Level 3 best practice of declaring each directive explicitly. Adds manifest-src 'self' before base-uri. No origin set is changed. Regression test added alongside existing CSP coverage in test_pwa_manifest_csp.py. Co-authored with Claude Sonnet 4.6 / Anthropic.	2026-04-24 10:44:46 -07:00
nesquena-hermes	27e69c404a	chore: v0.50.189 CHANGELOG (csp #961 ) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.189	2026-04-24 10:44:34 -07:00
nesquena-hermes	dbc9c910a8	chore: v0.50.188 CHANGELOG (btw fix #950 ) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.188	2026-04-24 10:44:10 -07:00
bergeouss	23e9070fc5	fix(btw): use correct SSE endpoint /api/chat/stream (#950 ) The /btw command was completely non-functional because attachBtwStream() connected to /api/stream which doesn't exist — the server SSE handler lives at /api/chat/stream. This caused an immediate 404 on every /btw request. Closes #945 Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-24 10:43:44 -07:00
nesquena-hermes	e0257d81d5	chore: v0.50.187 CHANGELOG entry for breakpoint fix (#956 ) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.187	2026-04-24 09:13:35 -07:00
nesquena-hermes	533edbcae0	fix(ui): close 641-767px rail/hamburger breakpoint gap (#956 ) At 641-767px the sidebar was in a no-mans-land: hamburger hidden (<=640 only) and rail also hidden (>=768 only). Users could still navigate via the sidebar-nav tabs inside the sidebar, but the rail was absent unnecessarily. Changing the rail breakpoint from min-width:768px to min-width:641px closes the gap. The sidebar slide-in behavior (position:fixed, hamburger toggle) stays at <=640px only, so the mobile UX is unchanged. At 641-767px the rail now appears alongside the persistent sidebar. Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com>	2026-04-24 09:13:00 -07:00
nesquena-hermes	885f1fa349	chore: v0.50.186 CHANGELOG entry for three-column layout (#899 ) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.186	2026-04-24 09:06:50 -07:00
Aron Prins	970bc1d3fd	refactor(ui): three-column layout with left rail + main-view migration (#899 ) refactor(ui): three-column layout with left rail + main-view migration (#899) Unifies the shell into a three-column layout (rail + sidebar + main) matching the hermes-desktop reference, and migrates every per-item detail/edit surface into a shared main-view canvas with consistent headers, empty states, and action buttons. Changes: - New desktop-only left rail (48px) with 8 nav tabs (chat/tasks/skills/memory/workspaces/profiles/todos/settings) - Persistent app titlebar (replaces per-chat topbar), active conversation title shown - All panel detail/create/edit views migrated to #mainSkills, #mainTasks, #mainSettings, #mainWorkspaces, #mainProfiles, #mainMemory - Settings moved out of modal into main-view page; ESC closes it - YAML frontmatter rendered in collapsible <details> block in skill detail - Toasts repositioned from bottom-center to top-right with theme-aware success/error/warning/info variants - Composer workspace chip split into two-button group: files-icon toggles file panel, label opens workspace picker - .settings-menu → .side-menu / .side-menu-item (generalised, shared by memory and settings panels) - i18n: ~25 new keys across en/ru/es/de/zh/zh-Hant for all new form labels, placeholders, and empty states - Mobile: hamburger in titlebar, slide-in sidebar; box-shadow removed from sidebar - New regression test: tests/test_settings_navigation_and_detail_refresh.py (9 tests) Co-authored-by: Aron Prins <pwf.aron@gmail.com>	2026-04-24 09:05:25 -07:00
nesquena-hermes	061af78cde	v0.50.185: /btw stream hardening + .venv bootstrap + /reasoning toast (#935 #939 #941 #942 ) * fix(bootstrap): discover .venv layout in agent_dir (closes #938) (#941) * fix(btw): harden _streamDone flag — defensive ordering + session guard + stream_end coverage (#935) * fix(btw): align /reasoning toast prefix with BRAIN const (#939) * docs: v0.50.185 release notes, update test counts to 2107 --------- Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.185	2026-04-23 23:25:45 -07:00
nesquena-hermes	87d4136a43	fix(ui): move reasoning chip after model chip in composer footer (#937 ) Reasoning is a sub-setting of the model (applies only to models that support it), so the model should come first. This also keeps the model chip in a stable position regardless of whether reasoning is active. Order was: Profile → Workspace → Reasoning → Model Order now: Profile → Workspace → Model → Reasoning Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com>	2026-04-23 19:43:41 -07:00
nesquena-hermes	ce9aec1640	chore: v0.50.184 release notes (#936 ) Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> v0.50.184	2026-04-23 19:38:48 -07:00
nesquena-hermes	1a9dba7844	fix: reasoning chip dropdown visible + monochrome SVG icon + /btw answer preserved (closes #933 ) (#934 ) * fix: reasoning chip dropdown visible + SVG icon + /btw answer no longer wiped (closes #933) * fix(ui): resize handler symmetry + lock regressions for PR #934 fixes Two small additions on top of the core PR: 1. Resize handler now re-positions the reasoning dropdown when the window resizes while it's open, matching the existing model-dropdown branch. Without this, resizing while the dropdown is open leaves it aligned to the pre-resize chip position — fine in practice (most resizes close the dropdown via the global click handler) but inconsistent with the model-dropdown sibling. 2. Regression test file tests/test_reasoning_chip_btw_fixes.py with 10 tests locking all four fixes in place so they can't silently regress: - Dropdown sits OUTSIDE .composer-left (so overflow-y: hidden can't clip it) - Dropdown is grouped with the other composer-level dropdowns - Chip button contains stroke="currentColor" SVG (not a 🧠 emoji) - _applyReasoningChip() body doesn't include 🧠 - cmdReasoning calls _applyReasoningChip(eff) directly with the server-confirmed effort, not syncReasoningChip() (stale cache) - _streamDone flag declared, set in done handler, checked in onerror - _ensureBtwRow() called in done handler (creates bubble when no tokens arrive) - resize handler re-positions composerReasoningDropdown Full suite: 2056 passed, 0 failed. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: nesquena-hermes <nesquena-hermes@users.noreply.github.com> Co-authored-by: Nathan Esquenazi <nesquena@gmail.com> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 19:18:51 -07:00

1 2 3 4 5 ...

717 Commits