hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-05-21 03:39:54 +00:00

Files

T

Teknium 775c0e22cf perf(models_dev): cache-first lookup, skip network when disk cache is fresh (#22808 )

`fetch_models_dev()` is on the hot path of every `AIAgent.__init__`
(via `context_compressor → get_model_context_length`). The previous
policy was "always try network first, only fall back to disk if
network fails," so every fresh `hermes chat` / `hermes gateway` /
batch / cron process paid 250-500 ms re-fetching a 2 MB JSON registry
that was already on disk from earlier runs.

Add a stage 2 between in-mem and network: if
`models_dev_cache.json` exists and its mtime is younger than the
existing `_MODELS_DEV_CACHE_TTL` (1 hour, same TTL the in-mem cache
already uses), load from disk and skip the network call.

The in-mem TTL is anchored to the disk file's age, so a 50-min-old
cache stays in-memory for only 10 more minutes — no surprise
extension of staleness window.

Invariants preserved:
- `force_refresh=True` still always hits the network and only falls
  back to disk on failure (`hermes config refresh` semantics).
- Missing disk cache → fall through to network (first-ever run).
- Stale disk cache (mtime > TTL) → fall through to network.
- Negative file age (clock skew) → fall through to network.
- Network failure → existing stage-4 stale-disk fallback unchanged.

Measured impact (3-run medians, 9950X3D, fresh process per run):
  fetch_models_dev cold:  256 → 17 ms  (-93%)
  hermes chat -q wall:   4.00 → 3.73 s (-7% median)
                         3.99 → 3.60 s (-10% min)

The chat-end-to-end win is bounded below by API latency variance, but
the fetch_models_dev microbenchmark is the cleanest signal: 239 ms
shaved off every fresh-process agent construction.

Win compounds with the previous perf PRs:
  #22681 google_chat lazy-load
  #22766 doctor parallel + IMDS off
  #22790 gateway.platforms PEP 562

Tests: all 30 `tests/agent/test_models_dev.py` pass (added 4 new ones
covering the new disk-cache-first path, force_refresh override, stale
disk fallback, and missing-disk-cache fall-through). Full `tests/agent/`
suite: 2560 passed, 0 failed.

2026-05-09 13:32:38 -07:00

transports

feat(transports/codex): pass reasoning.effort to xAI Responses API

2026-05-09 13:23:02 -07:00

__init__.py

test: add unit tests for 8 modules (batch 2)

2026-02-26 13:54:20 +03:00

test_anthropic_adapter.py

fix: avoid unsupported anthropic context beta by default

2026-05-07 05:43:20 -07:00

test_anthropic_keychain.py

fix: re-auth on stale OAuth token; read Claude Code credentials from macOS Keychain

2026-04-24 07:14:00 -07:00

test_arcee_trinity_overrides.py

test(arcee): cover Trinity Large Thinking temperature + compression overrides

2026-05-05 17:23:45 -07:00

test_auxiliary_client_anthropic_custom.py

fix(anthropic): complete third-party Anthropic-compatible provider support (#12846 )

2026-04-19 22:43:09 -07:00

test_auxiliary_client.py

fix(auxiliary): enforce Codex Responses stream timeout

2026-05-07 06:21:50 -07:00

test_auxiliary_config_bridge.py

fix(tests): pin UTF-8 encoding when reading source files on Windows

2026-05-09 02:47:28 -07:00

test_auxiliary_main_first.py

fix(copilot): send vision header for Copilot vision requests

2026-04-27 08:35:50 -07:00

test_auxiliary_named_custom_providers.py

fix(fallback): let custom_providers shadow built-in aliases

2026-04-30 20:18:44 -07:00

test_auxiliary_transport_autodetect.py

fix(auxiliary): auto-detect Anthropic Messages transport for all aux clients (#17027 )

2026-04-28 06:50:14 -07:00

test_bedrock_1m_context.py

test: remove 50 stale/broken tests to unblock CI (#22098 )

2026-05-08 14:55:40 -07:00

test_bedrock_adapter.py

fix(bedrock): preserve reasoningContent across converse normalization

2026-05-07 05:17:16 -07:00

test_bedrock_integration.py

fix(agent): handle aws_sdk auth type in resolve_provider_client

2026-04-24 07:26:07 -07:00

test_codex_cloudflare_headers.py

fix(aux): remove hardcoded Codex fallback model, drop Codex from auto chain (#17765 )

2026-04-29 23:23:50 -07:00

test_compress_focus.py

fix: resolve CI test failures — add missing functions, fix stale tests (#9483 )

2026-04-14 01:43:45 -07:00

test_compressor_image_tokens.py

feat(image-input): native multimodal routing based on model vision capability (#16506 )

2026-04-27 06:27:59 -07:00

test_context_compressor_summary_continuity.py

fix(compression): preserve iterative summary continuity

2026-05-05 04:42:44 -07:00

test_context_compressor.py

fix(context): handle JSON decode errors in compression — salvage of #22248 (#22416 )

2026-05-09 01:47:15 -07:00

test_context_engine.py

feat: wire context engine plugin slot into agent and plugin system

2026-04-10 19:15:50 -07:00

test_context_references.py

fix(agent): fall back when rg is blocked for @folder references

2026-04-20 01:56:41 -07:00

test_copilot_acp_client.py

fix(ci): recover 38 failing tests on main (#17642 )

2026-04-29 20:05:32 -07:00

test_credential_pool_routing.py

refactor: remove smart_model_routing feature (#12732 )

2026-04-19 18:12:55 -07:00

test_credential_pool.py

fix(auth): shorten credential 401 cooldown

2026-05-07 06:15:33 -07:00

test_crossloop_client_cache.py

refactor(tests): re-architect tests + fix CI failures (#5946 )

2026-04-07 17:19:07 -07:00

test_curator_activity.py

fix: use skill activity in curator status

2026-04-30 10:31:47 -07:00

test_curator_backup.py

fix(curator): authoritative absorbed_into on delete + restore cron skill links on rollback (#18671 ) (#18731 )

2026-05-02 01:29:57 -07:00

test_curator_classification.py

fix(curator): prevent false-positive consolidation from substring matching

2026-05-04 01:21:23 -07:00

test_curator_reports.py

fix(curator): rewrite cron job skill refs after consolidation (#18253 )

2026-04-30 23:04:50 -07:00

test_curator.py

fix(skills): keep manual skills out of curator

2026-05-04 02:19:28 -07:00

test_deepseek_anthropic_thinking.py

test(anthropic): regression guard for DeepSeek /anthropic thinking replay

2026-04-29 08:10:29 -07:00

test_direct_provider_url_detection.py

fix: restrict provider URL detection to exact hostname matches

2026-04-20 22:14:29 -07:00

test_display_emoji.py

feat(tools): centralize tool emoji metadata in registry + skin integration

2026-03-15 20:21:21 -07:00

test_display.py

fix(cli): honor positive tool preview length

2026-05-07 05:26:28 -07:00

test_error_classifier.py

fix(tool-schemas): reactive strip of pattern/format on llama.cpp grammar 400s

2026-05-05 04:25:18 -07:00

test_external_skills_dirs_cache.py

perf(cli): cut ~19s from 'hermes' cold start (skills cache + lazy Feishu + no Nous HTTP) (#22138 )

2026-05-08 16:39:32 -07:00

test_external_skills.py

feat(skills): support external skill directories via config (#3678 )

2026-03-29 00:33:30 -07:00

test_gemini_cloudcode.py

fix(gemini): assign unique stream indices to parallel tool calls

2026-04-20 02:10:53 -07:00

test_gemini_fast_fallback.py

Prefer fallback for Gemini CloudCode rate limits

2026-05-05 10:14:48 -07:00

test_gemini_free_tier_gate.py

feat(gemini): block free-tier keys at setup + surface guidance on 429 (#15100 )

2026-04-24 04:46:17 -07:00

test_gemini_native_adapter.py

fix(gemini): fail fast on missing API key + surface it in hermes dump (#15133 )

2026-04-24 05:35:17 -07:00

test_gemini_schema.py

fix(gemini): drop integer/number/boolean enums from tool schemas (#15082 )

2026-04-24 03:40:00 -07:00

test_i18n.py

fix: add Turkish locale references in config, tests, and docs

2026-05-05 17:29:12 -07:00

test_image_gen_registry.py

feat(plugins): pluggable image_gen backends + OpenAI provider (#13799 )

2026-04-21 21:30:10 -07:00

test_image_routing.py

fix(image-routing): sniff magic bytes for image MIME, ignore misleading suffix

2026-05-07 05:58:11 -07:00

test_insights.py

test: stop testing mutable data — convert change-detectors to invariants (#13363 )

2026-04-20 23:20:33 -07:00

test_kimi_coding_anthropic_thinking.py

fix(anthropic): broaden Kimi thinking-suppression to custom endpoints (#17455 )

2026-04-29 06:35:42 -07:00

test_local_stream_timeout.py

fix(agent): recognize Tailscale CGNAT (100.64.0.0/10) as local for Ollama timeouts

2026-04-22 14:46:10 -07:00

test_memory_provider.py

fix(memory): add write origin metadata

2026-04-24 14:37:55 -07:00

test_memory_session_switch.py

feat(hindsight): probe API for update_mode='append' support, dedupe across processes

2026-05-05 15:09:59 -07:00

test_memory_user_id.py

feat(hindsight): richer session-scoped retain metadata

2026-04-22 05:27:10 -07:00

test_minimax_auxiliary_url.py

fix: provider/model resolution — salvage 4 PRs + MiniMax aux URL fix (#5983 )

2026-04-07 22:23:28 -07:00

test_minimax_provider.py

feat: provider modules — ProviderProfile ABC, 33 providers, fetch_models, transport single-path

2026-05-05 13:40:01 -07:00

test_model_metadata_local_ctx.py

fix(tui): show correct context length

2026-04-28 12:27:36 -07:00

test_model_metadata_ssl.py

fix(auth): honor SSL CA env vars across httpx + requests callsites

2026-04-24 03:00:33 -07:00

test_model_metadata.py

feat(computer-use): cua-driver backend, universal any-model schema

2026-05-08 11:07:38 -07:00

test_models_dev.py

perf(models_dev): cache-first lookup, skip network when disk cache is fresh (#22808 )

2026-05-09 13:32:38 -07:00

test_moonshot_schema.py

fix(moonshot): also strip nullable/enum after anyOf collapse

2026-04-30 23:14:31 -07:00

test_nous_rate_guard.py

fix(nous): don't trip cross-session rate breaker on upstream-capacity 429s (#15898 )

2026-04-26 04:53:42 -07:00

test_onboarding.py

docs(onboarding): lead OpenClaw residue banner with migrate, warn that cleanup breaks OpenClaw (#17507 )

2026-04-29 08:08:36 -07:00

test_openrouter_response_cache.py

fix(openrouter): use canonical X-Title attribution header

2026-05-05 10:13:34 -07:00

test_prompt_builder.py

fix(webui): add platform hint for MEDIA rendering

2026-05-09 02:22:40 -07:00

test_prompt_caching.py

fix(prompt-caching): skip top-level cache_control on role:tool for OpenRouter

2026-03-21 16:54:43 -07:00

test_proxy_and_url_validation.py

fix(agent): normalize socks:// env proxies for httpx/anthropic

2026-04-21 05:52:46 -07:00

test_rate_limit_tracker.py

feat: capture provider rate limit headers and show in /usage (#6541 )

2026-04-09 03:43:14 -07:00

test_redact.py

feat: replace kimi-k2.5 with kimi-k2.6 on OpenRouter and Nous Portal (#13148 )

2026-04-20 11:49:54 -07:00

test_shell_hooks_consent.py

fix(shell_hooks): parse hooks_auto_accept as strict bool/string, not bool() (#16322 )

2026-04-26 20:48:35 -07:00

test_shell_hooks.py

feat: shell hooks — wire shell scripts as Hermes hook callbacks

2026-04-20 20:53:51 -07:00

test_skill_commands_reload.py

refactor(reload-skills): queue note for next turn, drop cache invalidation + agent tool

2026-04-29 21:07:47 -07:00

test_skill_commands.py

test(skills): cover additional rescan paths in skill_commands cache (#14536 )

2026-05-07 04:59:43 -07:00

test_skill_utils.py

test(skill_utils): add regression tests for non-dict metadata in extract_skill_conditions

2026-04-30 20:37:15 -07:00

test_streaming_context_scrubber.py

style: trim verbose comment blocks added by previous commit

2026-04-27 12:37:33 -07:00

test_subagent_progress.py

feat(delegate): orchestrator role and configurable spawn depth (default flat)

2026-04-21 14:23:45 -07:00

test_subagent_stop_hook.py

feat: shell hooks — wire shell scripts as Hermes hook callbacks

2026-04-20 20:53:51 -07:00

test_subdirectory_hints.py

fix(agent): catch PermissionError in subdirectory hint discovery

2026-04-09 03:10:30 -07:00

test_think_scrubber.py

fix(agent): stateful streaming scrubber for reasoning-block leaks (#17924 ) (#20184 )

2026-05-05 04:33:38 -07:00

test_title_generator.py

fix: improve telegram topic mode setup

2026-05-04 12:07:17 -07:00

test_tool_guardrails.py

fix(agent): make tool loop guardrails warning-first

2026-04-30 20:43:15 -07:00

test_unsupported_parameter_retry.py

test: remove 50 stale/broken tests to unblock CI (#22098 )

2026-05-08 14:55:40 -07:00

test_unsupported_temperature_retry.py

refactor(memory): remove flush_memories entirely (#15696 )

2026-04-25 08:21:14 -07:00

test_usage_pricing.py

fix(usage): read top-level Anthropic cache fields from OAI-compatible proxies

2026-04-22 17:40:49 -07:00

test_vision_resolved_args.py

fix(vision): preserve explicit provider auth with custom base_url

2026-05-04 05:05:43 -07:00