mirror of
https://github.com/nesquena/hermes-webui.git
synced 2026-05-24 18:50:15 +00:00
040cb8af70
SHOULD-FIX: rate-limit _repair_stale_pending repair-firing telemetry. Switch from unconditional logger.warning to age-keyed: WARNING when pending_age < 5min (the diagnostically valuable race window — actual leak-path candidates that slipped past the grace guard) and DEBUG for the long-tail (orphaned sidecars from prior process lifetimes). Prevents reconnect loops on stuck sessions from flooding the log while preserving the diagnostic signal we want for tuning _REPAIR_STALE_PENDING_GRACE_SECONDS empirically. NIT: _LOCAL_SERVER_PROVIDERS expanded with lm-studio (hyphenated alias used in some custom_providers configs and already recognized at api/config.py:2189 for SSRF host trust) and localai (LocalAI project). Test parametrize expanded from 7 to 11 names, also covering pre-existing koboldcpp and textgen for symmetry. +4 regression tests. NIT (docs): CHANGELOG callout for the RFC1918 behavior change. Internal- network OpenAI-compatible proxies now preserve the model prefix on private-IP base_urls. Documented the migration path: configure as a custom_providers entry to bypass the local-server detection. NIT (deferred, optional): narrowing the heuristic to is_loopback only is left as future work; the broader scope was an explicit goal in the bug body and Opus flagged it as SHOULD-DISCUSS-but-not-block. 4184 -> 4188 passing. 0 regressions. ~10 LOC absorbed total.