hermes-agent

Author	SHA1	Message	Date
Teknium	87ae4ae94b	fix(update): harden #57659 follow-ups — task restore on failure, --force-venv split, trampoline detection, managed-install health (#57680 ) Five follow-ups to #57659 from post-merge review: 1. install.ps1: gateway scheduled-task re-enable now runs in a finally (a thrown Remove-Item/uv venv failure previously stranded the user's gateway autostart disabled), and tasks that were already disabled before the install are no longer blindly re-enabled. 2. The venv-python holder guard is no longer bypassed by plain --force (which the desktop bootstrap passes on every update while its lock probe only checks hermes.exe/app.asar). New explicit --force-venv is the escape hatch; --force keeps bypassing only the hermes.exe shim guard. 3. _detect_venv_python_processes now also catches uv/base-interpreter trampolines whose exe is outside the venv, via cmdline (venv path or '-m hermes_cli.main' tied to this install root) and cwd. 4. Missing venv python is now UNHEALTHY on managed installs (.hermes-bootstrap-complete / .update-incomplete markers) so the repair lane runs instead of 'Already up to date!'; the repair branch recreates the venv first when it's gone entirely. Dev checkouts keep reporting healthy. 5. install.ps1 comment no longer claims a Startup-folder disarm the code doesn't perform (logon-only, not a mid-install respawner).	2026-07-03 04:08:37 -07:00
LeonSGP43	f6a3d2e900	fix(model): preserve named custom provider slug	2026-07-03 03:33:06 -07:00
teknium1	7485fe0605	fix(dashboard): make .env sensitive-file guard case-insensitive Follow-up to #57507: .ENV / .Env.local on case-insensitive filesystem mounts slipped past the guard. Lowercase the name before matching and add a regression test. Addresses egilewski's open review note.	2026-07-03 03:27:47 -07:00
liuhao1024	1bcc52c14e	fix(dashboard): use pattern match for .env sensitive file guard Replace the exact-filename frozenset with _is_sensitive_filename() that matches .env plus any .env.<suffix> variant. This covers shorthand suffixes like .env.prod that the previous enumeration missed. Add test_sensitive_env_suffix_variants_blocked regression test covering .env.prod, .env.dev, .env.staging.local, and .env.ci. Addresses review feedback from egilewski on PR #57507.	2026-07-03 03:27:47 -07:00
liuhao1024	bc55c201c7	fix(dashboard): block .env files from managed-files API The dashboard Files tab could list, read, and download .env files containing API keys when running with a bind-mounted Hermes home directory (e.g. docker run -v ~/.hermes:/opt/data). Add _SENSITIVE_FILENAMES frozenset and filter these from list_managed_files(), read_managed_file(), and download_managed_file(). Return 403 for direct read/download attempts on sensitive files. Fixes #57505	2026-07-03 03:27:47 -07:00
Teknium	b14d75f8af	fix(update): prevent and self-heal half-updated venvs on Windows (#57659 ) Root-causes the July 2026 Windows incident chain (locked _brotlicffi.pyd / _sodium.pyd during install, then 'No module named annotated_doc' with 'hermes update' insisting 'Already up to date!'): - hermes update: probe venv core imports even when the checkout is current; a half-updated venv (dep sync killed mid-flight by a locked .pyd) is now detected and repaired instead of being reported as up to date - hermes update (Windows): after pausing gateways, refuse to mutate the venv while other processes run from the venv interpreter (the Desktop backend runs as python.exe so the hermes.exe shim guard never saw it); --force keeps the old behavior - install.ps1 venv stage: disarm gateway autostart Scheduled Tasks before the kill sweep (they respawn the gateway inside the kill->delete window), make the sweep a bounded loop requiring 3 clean passes, and rename-then- delete the old venv (a rename succeeds even with mapped DLLs) with stale- dir cleanup on the next run - desktop updater: 'venv shim still locked after 15s' now ABORTS the update hand-off (restarting our backend, surfacing the holder to the user) instead of 'proceeding anyway (force)' into guaranteed venv corruption; the unlock wait also re-kills respawned backends each poll tick	2026-07-03 03:24:08 -07:00
Teknium	c7103c637c	feat(desktop): CLI/dashboard parity — skills hub, MCP test/toggle/catalog, maintenance ops, log filters (#57441 ) * feat(desktop): CLI/dashboard parity — skills hub browser, MCP test/toggle/catalog, maintenance ops, log filters Brings desktop GUI to parity with hermes skills/mcp/doctor/backup/debug-share/ curator/memory CLI commands and the dashboard's System + Skills-hub pages: - Skills page: new Browse Hub tab (search official/GitHub/community sources, preview SKILL.md, security scan verdicts, install/update with live action log) - MCP settings: connection test (tool listing), per-server enable/disable toggle, and a Catalog tab installing Nous-approved MCP servers with env prompts - Command Center: new Maintenance section (doctor, security audit, backup, debug share links, curator status/pause/run, memory file status + reset) - Command Center system logs: file (agent/errors/gateway/desktop), level, and substring filters instead of a fixed agent.log tail - hermes.ts API client + types for all the above; en/zh locale strings (ja and zh-hant inherit via defineLocale) * feat(desktop): backend model catalogs in toolset config — hermes tools parity Completes the `hermes tools` parity gap: after picking an image/video generation backend the CLI runs a model picker (e.g. FAL's multi-model catalog with speed/strengths/price); the desktop toolset drawer now has the same flow as a radio-card list. - web_server: GET /api/tools/toolsets/{name}/models (catalog + current + default for the active or named provider row) and PUT .../model (validated write to image_gen.model / video_gen.model), reusing the CLI's plugin catalog helpers so GUI and `hermes tools` stay in lockstep - desktop: ModelCatalogPicker in ToolsetConfigPanel — per-model cards with speed/strengths/price, in-use + default badges, disabled until the backend is the active one; provider selection now mirrors is_active locally so the catalog unlocks without a refetch - tests: 3 backend endpoint tests (catalog shape invariants, persist + validation), 2 component tests, 2 API-contract tests; en/zh strings	2026-07-03 01:02:47 -07:00
Teknium	6eb39c2bbe	fix(opencode-go): heal stripped /v1 base_url so non-minimax models stop 404ing (#57585 ) OpenCode Go serves minimax/qwen via Anthropic Messages (base URL without /v1 — the SDK appends /v1/messages) and glm/kimi/deepseek/mimo via OpenAI chat completions (base URL WITH /v1). The runtime stripped /v1 for anthropic-routed models, and the TUI/desktop + gateway persisted that stripped URL to model.base_url. Every later chat_completions model then POSTed to https://opencode.ai/zen/go/chat/completions — a 404 (the marketing site). Result: only minimax worked; glm/deepseek/kimi all 404ed. - New normalize_opencode_base_url(): symmetric /v1 normalization — strip for anthropic_messages, re-append for chat_completions / codex_responses on opencode.ai hosts (heals persisted stripped URLs; custom proxy overrides untouched) - Applied at all three former one-way strip sites (resolve_runtime_provider x2, switch_model) - opencode_model_api_mode: all Qwen models on Go AND Zen now route via /v1/messages per current published endpoint tables (previously only qwen3.7-max on Go — qwen3.6-plus etc. would 404 the same way) - Catalog refresh: Go gains deepseek-v4-pro/flash, glm-5.2, kimi-k2.7-code, minimax-m3, qwen3.7-plus; Zen gains glm-5.2, kimi-k2.7-code, minimax-m3, qwen3.7-plus Reported by IndieSuperhuman on X: opencode-go 404s for any model other than minimax.	2026-07-03 00:46:45 -07:00
Teknium	372f8195c7	fix(moa): default temperatures to unset — provider default, like single-model agents (#57440 ) A single-model Hermes agent never sends temperature; the provider default applies. MoA hardcoded reference_temperature=0.6 / aggregator_temperature=0.4, and the coercion float(preset.get(key, 0.6) or 0.6) made unset IMPOSSIBLE to express: absent, null, empty, and even an explicit 0 all collapsed to the baked-in default. Every MoA advisor and aggregator therefore ran at 0.6/0.4 while the same model running solo used the provider default — silently skewing solo-vs-MoA comparisons and overriding provider-tuned defaults. - moa_config normalization: temperatures coerce to None when absent/blank/ invalid (new _coerce_float_or_none); explicit values incl. 0 honored. - moa_loop: _preset_temperature() resolves preset values; None flows to call_llm, which already omits the parameter when None (same contract as max_tokens). Aggregator still inherits the acting agent's own configured temperature when the preset doesn't pin one. - conversation_loop (context-mode MoA): same resolution, no more hardcoded 0.6/0.4 at the call site. - DEFAULT_CONFIG preset + web_server payload models + docs updated: unset is the default, pinning stays available.	2026-07-03 00:22:49 -07:00
Brooklyn Nicholson	89acc19606	fix(dump): flag API keys visible only to the shell, not the managed backend hermes debug share reads os.getenv — the invoking terminal's environment — but launchd/systemd and the desktop-spawned `serve` backend load credentials from ~/.hermes/.env, not the login shell. A key exported in the shell but absent from .env is invisible to the backend, yet the dump printed a bare "set", sending support down a phantom "the key is configured" path. This was the actual trap behind a "Desktop has no web_search / no tools" report: FIRECRAWL_API_KEY was a shell export (so `debug share` in a terminal read "firecrawl set") but not in .env, so the launchd backend's check_web_api_key returned False and web_search was gated off — which a contributor then misdiagnosed as a missing `desktop` platform registration. The dump now annotates any key set in-process but missing from ~/.hermes/.env with "(shell only — not in .env; managed/desktop backend may not see it)" so the mismatch is obvious instead of hidden behind "set".	2026-07-02 19:52:18 -05:00
kshitijk4poor	ed4123792c	refactor(providers): dedupe extra_headers normalizer + key picker groups by headers Follow-up to @helix4u's #57336 salvage. Two review findings: - W1: model-picker grouped custom-provider rows by (api_url, credential, api_mode) but NOT extra_headers. Entries sharing a URL+credential+api_mode yet declaring different headers (e.g. per-tenant routing behind one proxy) collapsed into one row and probed /models with whichever header set was seen first (order-dependent). Fold a canonical header identity into group_key so distinct header-authed endpoints stay separate; drops the now-dead first-non-empty merge branch. - W2: the extra_headers stringify+None-filter comprehension existed in 5 copies (config.py x2, runtime_provider.py, model_switch.py, models.py). Extract one shared hermes_cli.config.normalize_extra_headers primitive; all sites now call it. Tests: +normalize_extra_headers unit tests, +regression test proving two same-endpoint entries with different headers stay distinct and each probes with its own headers. 223 targeted tests pass; ruff clean.	2026-07-03 04:23:15 +05:30
helix4u	ab40e952f3	fix(providers): pass extra headers to model discovery	2026-07-03 04:23:15 +05:30
Brooklyn Nicholson	1501a338c3	fix(cli): stop profile-bound backends before deleting so rmtree converges delete_profile stopped only the process named in gateway.pid, but a Desktop app spawns a headless `serve`/`dashboard` backend per profile that holds the profile's SQLite connection open and keeps writing sessions/WAL/sandbox files. That backend is never in gateway.pid, so a CLI `hermes profile delete` run while the Desktop app is up left it writing into the tree — rmtree's final rmdir then failed with ENOTEMPTY (#47368 "Bug 2"), and pre-guard it also resurrected the directory. - _profile_bound_backend_pids(): find running Hermes backends bound to this profile via a `--profile <name>` selector or a HERMES_HOME env resolving to the profile dir. Tightly scoped — current-user only, backend subcommands (serve/dashboard/gateway) only so an interactive chat is never killed, and never this process or its ancestors. - _stop_profile_backends(): terminate them (graceful, then force), best-effort so it can never make delete worse. - _rmtree_with_retry(): a few spaced retries absorb the ENOTEMPTY / Windows file-lock race from a just-terminated writer's in-flight -wal/-shm/sandbox writes instead of failing the whole delete on a race the next attempt wins. Complements the recreation guard (deleted profiles no longer reappear) and the Desktop teardown-before-delete flow; this is the CLI-side convergence fix for a delete run while a Desktop-managed backend is live. Part of #47368.	2026-07-02 15:31:35 -05:00
teknium1	254328bf56	fix(auth): remove stale loopback_pkce reference in xAI quarantine removal list The terminal-refresh quarantine filtered in-memory entries on source == "device_code" but built removed_ids from the deleted "loopback_pkce" source name, so the revoked device-code entry was never pruned from the persisted pool in auth.json. Also restores the _print_loopback_ssh_hint test suite scoped to Spotify (the helper's remaining caller) instead of deleting it wholesale.	2026-07-02 13:17:41 -07:00
Jaaneek	5ef0b8acb0	feat(auth): make xAI Grok OAuth device-code-only, drop loopback login Replace the loopback/PKCE-callback server and manual-paste fallback with the RFC 8628 device-code flow as the only xAI Grok OAuth login path. The flow works in headless/SSH/container sessions with no 127.0.0.1 listener, shrinking the local attack surface. - Poll the token endpoint with server-provided interval, honoring slow_down and expires_in; store tokens with auth_mode oauth_device_code. - Adaptive proactive refresh skew for short-lived device-code JWTs; rotated tokens sync back to auth.json, the global root store, and the credential pool (no refresh-token replay). - Clear source suppression on successful re-login (CLI + dashboard) and drop the duplicate dashboard pool entry so exactly one seeded device_code entry exists. - Use the shared device_code source name for consistency with the nous/codex device-code providers. - Desktop: remove the loopback OAuth flow states and dead type variants; pkce providers' sign-in URL selection is unchanged. - Docs (EN + zh-Hans) rewritten for device-code login; drop the deleted --manual-paste flag from documented commands.	2026-07-02 13:17:41 -07:00
LeonSGP43	472d75193f	Prevent deleted profile skeleton revival	2026-07-02 15:11:56 -05:00
Jneeee	b98baa3039	feat(config): extra HTTP headers for LLM API calls (#3526 salvage) Named providers / custom_providers entries in config.yaml now accept an extra_headers dict scoped to that endpoint — for reverse proxies, API gateways, and custom auth schemes (e.g. Cloudflare Access service tokens). - hermes_cli/config.py: normalize extra_headers on provider entries (_normalize_custom_provider_entry + providers-dict translation), add get_custom_provider_extra_headers / apply_custom_provider_extra_headers_to_client_kwargs helpers keyed on base_url (case/trailing-slash insensitive, no substring bypass — mirrors the TLS helpers) - hermes_cli/runtime_provider.py: surface extra_headers in the resolved runtime for named custom providers (providers dict, legacy custom_providers list, and the credential-pool path) - run_agent.py / agent/agent_init.py: merge per-provider extra_headers onto the OpenAI client default_headers at construction and on every _apply_client_headers_for_base_url re-application (credential swaps, rebuilds), most-specific level wins; OpenAI-wire only (native Anthropic/Bedrock scoped out) - agent/auxiliary_client.py: accept model.extra_headers as an alias of model.default_headers for the global variant - cli-config.yaml.example: documented commented example - Header values are treated as secrets and never logged Salvaged from PR #3526 by @jneeee, reimplemented against current main. Co-authored-by: Teknium <127238744+teknium1@users.noreply.github.com>	2026-07-02 05:33:25 -07:00
Mibayy	070ac2a719	fix(status): label provider as custom when config.yaml model.base_url is set Salvage of the surviving hunk of #3296 by @Mibayy. The PR's gateway _handle_provider_command hunk targets code removed on main (/provider was absorbed into /model + /status, which already read model.base_url); the hermes status mislabel was the remaining live symptom: _effective_provider_label() only checked the legacy OPENAI_BASE_URL env var, so a custom endpoint configured canonically in config.yaml still displayed as OpenRouter.	2026-07-02 04:59:02 -07:00
Teknium	6e369a3762	feat(delegation): unify concurrency caps — deprecate max_async_children (#56955 ) delegation.max_concurrent_children is now the single cap for both a batch's parallelism and concurrent background delegation units. - _get_max_async_children() delegates to _get_max_concurrent_children(); a leftover max_async_children key logs a one-time deprecation warning - config v32→33 migration removes the stale key, folding a raised max_async_children into max_concurrent_children (max wins, no lost headroom) - capacity error messages now point at max_concurrent_children - pool-at-capacity sync fallback now attaches an explanatory note so the model/user know why the call blocked instead of dispatching async Previously users who raised max_concurrent_children (e.g. to 15) still hit the invisible default-3 async cap: the 4th background delegate_task silently ran inline, blocking the turn with no signal.	2026-07-02 02:53:39 -07:00
Teknium	543d305bbb	feat(moa): add reference_max_tokens to cap advisor output and cut turn latency (#56756 ) MoA per-turn latency is dominated by advisor GENERATION: turn wall time correlates ~0.88 with output tokens and ~-0.03 with input tokens (measured over 52 turns). Each turn waits for the slowest advisor to finish writing, and advisors were uncapped — writing multi-thousand-token essays the aggregator only needs the gist of. Add an opt-in per-preset reference_max_tokens knob (mirrors reference_temperature) that caps ADVISOR output only; the acting aggregator is never capped. Default None = uncapped, so existing presets are byte-for-byte unchanged (no regression). Wired through both MoA execution paths (MoAChatCompletions.create and aggregate_moa_context). E2E: same task, closed preset uncapped vs reference_max_tokens=600 -> 59s to 33s (~44% faster), final answer identical/correct. - hermes_cli/moa_config.py: _coerce_int_or_none helper + reference_max_tokens in _normalize_preset/_default_preset/flattened view - agent/moa_loop.py: read preset.reference_max_tokens, pass to reference fan-out - agent/conversation_loop.py: pass reference_max_tokens on the per-turn path - tests + docs	2026-07-02 00:16:35 -07:00
Ben Barclay	9be39de0f2	fix(auth): make HERMES_PORTAL_BASE_URL/NOUS_PORTAL_BASE_URL bypass the Portal host allowlist (#56864 ) Ben caught that the initial approach (widening _NOUS_PORTAL_ALLOWED_HOSTS to include the staging host) was the wrong fix -- env vars are supposed to override the allowlist, mirroring how NOUS_INFERENCE_BASE_URL already bypasses _ALLOWED_NOUS_INFERENCE_HOSTS via _nous_inference_env_override(). The actual bug: both resolve_nous_access_token and resolve_nous_runtime_credentials read `_optional_base_url(state.get("portal_base_url")) or os.getenv(...) or ...` -- a plain `or` chain where the STORED state value wins first (short-circuits before the env vars are even read), and then whichever value won gets run through the same _NOUS_PORTAL_ALLOWED_HOSTS gate regardless of its source. So a hosted agent stamped with HERMES_PORTAL_BASE_URL=<staging> in its env AND a staging portal_base_url already persisted to auth.json would still get silently rewritten to prod on every refresh, because the env var never even got a chance to be consulted. Revert the previous _NOUS_PORTAL_ALLOWED_HOSTS widening entirely -- staying prod-only preserves the allowlist's actual job (rejecting an untrusted network-provided portal_base_url persisted to auth.json by a compromised Portal response). Add _nous_portal_env_override() (mirrors _nous_inference_env_override()) and restructure both call sites so the env override is checked FIRST and, when set, wins outright and skips the allowlist gate entirely -- the allowlist only ever runs against the fallback (stored-state-or-default) path now. Rewrote tests/hermes_cli/test_nous_portal_staging_allowlist.py to test the actual fix: the helper function, and an end-to-end resolve_nous_access_token proof that the env override wins even when state ALSO has the staging host stored (the exact incident shape), that it wins over a stored PROD host too, and that the allowlist's heal-to-prod behaviour for an untrusted stored value is preserved when no override is set.	2026-07-02 06:52:46 +00:00
kshitijk4poor	676236bb1d	fix(agent): honor custom CA certs on aux client + harden TLS resolution The salvaged fix wired per-provider ssl_ca_cert / ssl_verify (and HERMES_CA_BUNDLE) into the MAIN OpenAI client. This follow-up: - Auxiliary client parity: process_bootstrap.build_keepalive_http_client accepts and forwards verify; auxiliary_client._resolve_aux_verify mirrors the main-client TLS resolution (via load_config_readonly, the read-only fast path) so compression/vision/web_extract/title-gen/session_search honor the same per-provider CA. Without this, chat worked against a private-CA endpoint but every auxiliary call still failed APIConnectionError. - switch_model now reads custom_providers from live config (load_config_readonly) instead of the init-time agent._custom_providers snapshot, so ssl_ca_cert / ssl_verify edits are honored on mid-session model switch — matching the context-length reload (#15779). - Drop the dead client-level verify= where a custom httpx transport is used (httpx ignores it there); verify lives on the transport. Fix docstrings. Applies to both run_agent._build_keepalive_http_client and process_bootstrap. - resolve_httpx_verify: add CURL_CA_BUNDLE to the env chain (consistency with agent/ssl_guard._CA_BUNDLE_ENV_VARS) and emit a loud logger.warning naming the endpoint whenever ssl_verify:false disables verification. - get_custom_provider_tls_settings: case-insensitive base_url match (config dedup already lowercases; scheme/host are case-insensitive) so a mixed-case entry doesn't silently drop its CA. Exact match preserved — no prefix bypass. - Demote best-effort except Exception: pass in agent_init/switch_model to logger.debug(exc_info=True). - Tests for aux verify forwarding, _resolve_aux_verify, case-insensitive match, and prefix-bypass rejection.	2026-07-02 04:51:56 +05:30
HexLab98	3a2ba959ce	fix(agent): honor custom CA certs for custom_providers HTTPS endpoints Wire ssl_ca_cert and ssl_verify through custom_providers config and env vars into the keepalive httpx client, fixing APIConnectionError against mkcert/self-signed Ollama proxies behind HTTPS.	2026-07-02 04:51:56 +05:30
Brooklyn Nicholson	428b9a0c42	fix(cli): render /journey color instead of leaking raw ANSI In the interactive CLI, /journey dispatched straight to `args.func(args)`, letting Rich write ANSI to stdout — which patch_stdout's StdoutProxy passes through as literal `?[38;2;…m` garbage. Route the read-only views (default + `list`) through a captured, force-color Console and re-emit via `_cprint` (prompt_toolkit's ANSI parser), matching the `ChatConsole` idiom. `delete`/`edit` stay on real stdio since they prompt / open `$EDITOR`.	2026-07-01 16:25:48 -05:00
snav	35eb93c8df	fix(codex-runtime): re-running /codex-runtime codex_app_server when already enabled now triggers migration The /codex-runtime slash command short-circuits with "openai_runtime already set" when invoked with the same value as the current config, and crucially skips the entire migration block below. The check conflates two things: (a) "the config value is correct" and (b) "the world state (managed block in ~/.codex/config.toml, hermes-tools MCP callback, plugin discovery) is converged". Common footgun this exposes: a user who pre-sets `model.openai_runtime: codex_app_server` directly in config.yaml (reasonable thing to do) and then runs /codex-runtime codex_app_server to trigger migration sees "already set" and silently gets no migration. ~/.codex/config.toml never receives the managed block, the hermes-tools MCP callback never registers, and codex falls through to its default runtime instead of the app-server one — visibly successful but functionally partial setup. The migration is idempotent by design (it replaces its own managed block in place between MIGRATION_MARKER and MIGRATION_END_MARKER), so re-running it is safe and cheap. Fix the short-circuit to fall through to migration when re-applying codex_app_server while skipping the config persist (no value-level change needed). The disable case (re-applying "auto") still short-circuits because disabling doesn't touch ~/.codex/config.toml at all. The user-visible message changes to "openai_runtime already set to codex_app_server — re-applying migration" so re-runs surface what happened. Regression test (test_reapply_codex_app_server_runs_migration) asserts: - migrate() was called when re-applying - persist_callback was NOT called (no config write on no-op transitions) - migration output (MCP servers, sandbox default) surfaces in the user-visible message - requires_new_session is True so callers know to /reset Verified RED→GREEN: the test fails on origin/main with "migration must run on reapply, not just first enable" and passes with this fix. Full test_codex_runtime_switch.py suite: 31 passed.	2026-07-01 23:51:54 +05:30
Steve Lawton	c73e74386b	feat(vertex): add Google Vertex AI provider for Gemini (OAuth2) Adds Vertex AI as a first-class provider for Gemini models via Vertex's OpenAI-compatible endpoint. Vertex authenticates with short-lived OAuth2 access tokens (service-account JSON or ADC), not a static API key — the missing piece behind the recurring requests (#13484, #12639, #56259). - agent/vertex_adapter.py: OAuth2 token minting + refresh-on-expiry (5-min margin), ADC->service-account fallback, global vs regional endpoint URLs. Config precedence: env var > config.yaml > default. - plugins/model-providers/vertex/: provider profile (auth_type=vertex), reuses Gemini's extra_body.google.thinking_config translation. - runtime_provider: vertex short-circuit BEFORE the credential pool so a credentials-file path is never mistaken for a static API key; mints a fresh token + computes base_url per resolve. - run_agent + conversation_loop: _try_refresh_vertex_client_credentials() re-mints the token and rebuilds the client on a mid-session 401, so a long-lived gateway agent survives token expiry (~1h). - auxiliary_client: vertex auth_type branch for side-LLM tasks. - config.yaml: vertex.project_id / vertex.region (non-secret, bridged to env); credential path stays in .env (VERTEX_CREDENTIALS_PATH). - setup wizard + model picker: dedicated _model_flow_vertex; curated google/gemini-* model list; --provider choices. - pricing/metadata: Vertex prices off the gemini docs snapshot; endpoint host auto-maps to the vertex provider (no probe spam). - lazy_deps + pyproject [vertex] extra: google-auth, opt-in only. - docs: guides/google-vertex.md + providers page; tests for adapter + runtime resolution. Salvages and modernizes #8427 by @slawt onto current main: rewired from the legacy PROVIDER_REGISTRY path to the provider-profile architecture, moved non-secret config out of .env into config.yaml, and added the per-turn 401 token-refresh the original lacked.	2026-07-01 05:25:33 -07:00
teknium1	d739194926	test(auth): mock new source-aware Nous state read boundary resolve_nous_runtime_credentials / resolve_nous_access_token now read via _load_provider_state_with_source (and write via _save_provider_state_to_source). TestEnvOverrideWins mocked only the old _load_provider_state, so the real (empty) state was read → AuthError. Mock the new boundary too, returning (state, None) so the write-through helper treats it as the active store.	2026-07-01 05:06:00 -07:00
teknium1	34de127200	fix(auth): widen portal_base_url allowlist guard to runtime credential path The salvaged PR guarded only resolve_nous_access_token; the primary resolve_nous_runtime_credentials path also POSTs the refresh token to portal_base_url on refresh with no allowlist check. Mirror the guard there so a poisoned host can't receive the bearer, and drop the stray duplicated allowlist comment. Adds a sibling-site regression test.	2026-07-01 04:57:40 -07:00
szzhoujiarui	f3c5327e67	fix(auth): validate portal_base_url and migrate stale api.nousresearch.com (#44710 )	2026-07-01 04:57:40 -07:00
teknium1	3b41df6d46	test(gateway): regression for multi-profile node symlink leak; AUTHOR_MAP Add tmp_path symlink regression tests for both generate_systemd_unit and generate_launchd_plist (~/.local/bin/node -> profile node install must not leak the profile target into the generated unit PATH). Register jearnest11's AUTHOR_MAP entry for the salvage cherry-pick.	2026-07-01 04:57:21 -07:00
kshitijk4poor	fb7a38ad21	fix(macos): compose launchd reload retry with _launchctl_bootstrap + drain-aware window Reworks @valenteff's #53277 fix per review (Teknium's 3 findings): - Route refresh_launchd_plist_if_needed's bootstrap through the existing _launchctl_bootstrap() EIO-recovery helper (canonical since #56256), wrapped in a wall-clock retry loop, instead of an ad-hoc 5x2s loop. - Window sized to agent.restart_drain_timeout (default 180s), not a fixed ~10s: the failure happens while the old gateway is still draining (finding 1). - Retry on subprocess.TimeoutExpired too, not just CalledProcessError — a bootstrap timeout after bootout otherwise escapes and leaves the service unloaded (finding 2). - Confirm success with launchctl list, not a bare bootstrap exit 0 (finding 3); mirror verify+drain-window in the detached-helper bash path. - Shared helpers _launchd_reload_log_path / _append_launchd_reload_log / _launchctl_label_registered / _retry_launchctl_bootstrap_until_registered. 3 new tests cover retry-until-listed, TimeoutExpired-retried, deadline-exhaust. E2E: real reload log + mocked launchctl — retries CalledProcessError+TimeoutExpired, verifies via launchctl list, logs failures.	2026-07-01 16:56:14 +05:30
Teknium	81595cd588	fix(dashboard): run plugin gate after auth + enable example fixture Follow-up on the salvaged #47491 commits: - Register _plugin_api_runtime_gate BEFORE the auth middlewares so it executes AFTER them, and add an explicit auth check: unauthenticated requests to /api/plugins/<name>/ fall through to auth's 401 instead of this gate's 404. Prevents the gate from becoming a plugin-name oracle (an unauthenticated caller could otherwise fingerprint installed/enabled plugins by status code). Keeps test_non_kanban_plugin_route_requires_auth green. - Enable the 'example' user plugin in the _install_example_plugin test fixture so the auth / static-asset-allowlist tests still reach the real serving paths now that user plugins are gated on plugins.enabled. - Mark the runtime-gate unit-test scopes as authenticated so they exercise the enabled/disabled policy under the new auth-first ordering.	2026-07-01 04:05:15 -07:00
manusjs	b2e0086f1b	fix(dashboard): enforce plugin disabled gate at request time and for bundled assets Address two residual bypasses identified in review: 1. Add _plugin_api_runtime_gate middleware that checks plugins.enabled/ plugins.disabled on every request to /api/plugins/{name}/... routes. Previously, disabling a plugin at runtime had no effect on its already- mounted API routes until a restart. 2. Extend serve_plugin_asset to check plugins.disabled for bundled plugins. Previously, only user plugins were gated — a bundled plugin in plugins.disabled would still serve assets from the unauthenticated /dashboard-plugins/{name}/... endpoint. Both fixes ensure the enabled/disabled policy is evaluated live at request time, not just at startup. Adds regression tests covering: - Middleware blocks disabled user plugin API routes (404) - Middleware blocks user plugin removed from enabled set (404) - Middleware passes enabled user plugin API routes - Middleware blocks disabled bundled plugin API routes (404) - Bundled plugin assets return 404 when disabled - Bundled plugin assets served normally when not disabled - User plugin asset gating still works correctly	2026-07-01 04:05:15 -07:00
HiaHia	8feeb0ccb8	fix(gateway): retry launchd bootstrap after bootout on EIO for install/start On macOS, `launchctl bootstrap` of a label still registered in the domain fails with 5: Input/output error (EIO). That is the already loaded case — a stale registration from an interrupted restart or a bootout that didn't settle — recoverable by booting the leftover out and bootstrapping again, and distinct from the domain being genuinely unmanageable. launchd_install and launchd_start (both bootstrap paths) treated exit 5 as 'launchd cannot manage this macOS version' and silently degraded to a detached process, losing auto-start at login and crash-restart. Centralize bootstrap in _launchctl_bootstrap(), which on EIO boots the stale label out and retries once; only if the retry also fails does the error propagate so callers apply their existing _launchctl_domain_unsupported fallback for a genuinely broken domain. launchd_restart already boots out before bootstrapping (its drained job is almost always still registered, so a plain bootstrap would hit EIO on the common path), so it keeps its explicit pre-bootout rather than routing through the bootstrap-first helper. Corrected the stale exit-5 comment that claimed it always meant an unmanageable domain. Adds TestLaunchctlBootstrapEioRetry covering clean bootstrap (no bootout), EIO -> bootout -> retry success, persistent EIO re-raise, and non-EIO re-raise without a spurious bootout.	2026-07-01 03:21:20 -07:00
teknium1	b48cacb97b	fix(gateway,cron): guard cron model-tool path + add auto-resume loop breaker (#30719 ) Completes the #30719 restart-loop defenses. Defenses 1-2 (the _HERMES_GATEWAY guard on `hermes gateway stop\|restart` + terminal_tool, and the cron-creation lifecycle filter) already landed on main, but two gaps remained: - The agent's `cronjob` model tool calls cron.jobs.create_job directly, bypassing the hermes_cli.cron.cron_create CLI filter, so lifecycle commands scheduled via the model tool were only blocked at execution time (terminal_tool), not at creation. Moved the filter to a shared cron/lifecycle_guard.py enforced at create_job — the single chokepoint every job-creation path hits (CLI + model tool). Re-exported _contains_gateway_lifecycle_command from hermes_cli.cron so terminal_tool's import keeps working. - No breaker for the auto-resume loop itself. Defenses 1-2 cover the cron/CLI/terminal paths, but any other SIGTERM source (e.g. a raw terminal("launchctl kickstart ai.hermes.gateway")) still triggers the boot->auto-resume->re-run cycle. Added gateway/restart_loop_guard.py: counts restart-interrupted boots in a rolling window (config gateway.restart_loop_guard, default 3 boots / 60s) and skips auto-resume for that boot once tripped. The gateway still comes up and serves real inbound messages; it just stops replaying the session that keeps killing it, putting a human back in the loop. Also tightened the lifecycle regex over main's version: dropped `hermes gateway start` (benign), required the gateway identifier on the launchctl/systemctl branches (so `launchctl unload ai.hermes.update-checker.plist` and `systemctl restart hermes-meta.service` no longer false-positive), added the inverse pkill token order, and fixed the binary-script bypass (decode with errors='replace' instead of swallowing UnicodeDecodeError). The create_job guard resolves relative script paths under HERMES_HOME/scripts the same way the scheduler does, so a bare script name is scanned as the file that actually runs. Design and much of defense-2 originate from PR #33395 (@kshitijk4poor), which itself salvaged #30728 (@SimoKiihamaki). Rebuilt against current main since defenses 1-2 had already landed under different names. Closes #30719. Co-authored-by: SimoKiihamaki <simo.kiihamaki@gmail.com> Co-authored-by: kshitijk4poor <82637225+kshitijk4poor@users.noreply.github.com>	2026-07-01 02:48:36 -07:00
xxxigm	9efe01c3a0	test(runtime): pin Anthropic OAuth → /v1/messages routing across runtime branches End-to-end regression coverage for #32243 that asserts every runtime branch resolving an Anthropic endpoint returns `api_mode == "anthropic_messages"`: * `_resolve_explicit_runtime` — the path used when a Hermes subcommand passes an explicit `--api-key` / `--base-url`. Pins that a stale persisted `model.api_mode: chat_completions` from a prior provider migration cannot override the anthropic pin. * `_resolve_runtime_from_pool_entry` — the path triggered by `hermes auth add anthropic --type oauth` (the exact flow from the issue). Same stale-api_mode regression pinned here. * `_try_resolve_from_custom_pool` — the user-defined `providers:` / `custom_providers:` path that depends on the URL detector fix landed in the prior commit. Asserts both the detector fallback fires for `api.anthropic.com` and that an explicit `api_mode_override` still wins (so users who DELIBERATELY pointed a chat_completions transport at api.anthropic.com for OpenAI-compat experiments aren't hijacked). Co-locates the three contracts so a future refactor of one branch cannot silently diverge from the others and re-introduce the "out of extra usage" 400 on fresh OAuth Pro/Max credentials.	2026-07-01 02:18:56 -07:00
xxxigm	a2251b40eb	test(provider): pin api.anthropic.com → anthropic_messages URL detection Add a dedicated `TestDirectAnthropicHost` class to `test_detect_api_mode_for_url.py` covering the native Anthropic host shape (bare, trailing slash, /v1 suffix, uppercase host) plus the two negative-space regressions that matter for security: lookalike subdomains (`api.anthropic.com.attacker.test`) and path-segment spoofing (`https://proxy.example.test/api.anthropic.com/v1`) must NOT be classified as native — leaking an Anthropic OAuth token to either would be the worst case. Refs #32243.	2026-07-01 02:18:56 -07:00
Harish Kukreja	01bf61c865	fix(runtime): honor NOUS_INFERENCE_BASE_URL across pool/explicit/aux paths Upstream #52270 added `_nous_inference_env_override()` but wired it into only `resolve_nous_runtime_credentials`. Three sibling resolution paths still ignored the override, so a self-hosted Nous inference endpoint set via `NOUS_INFERENCE_BASE_URL` was silently dropped whenever credentials arrived through any of them: - the credential-pool path (`_resolve_runtime_from_pool_entry`) - the explicit-provider path (`_resolve_explicit_runtime`) - the auxiliary side-LLM client (`_pool_runtime_base_url`) Route all three through the same auth-layer reader so every `NOUS_INFERENCE_BASE_URL` read shares one normalization path (trailing-slash stripping, blank -> empty) and the documented trusted-bypass intent stays in one place. The override is live-only: it wins for the base URL returned this run but is never persisted to auth.json or the credential pool, so an ephemeral dev/staging value cannot poison durable auth state. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-07-01 01:52:06 -07:00
kernel-t1	b944c6e821	fix(cli): stop .env sanitizer from splitting secrets that embed a known KEY= ## What does this PR do? A single, perfectly valid `.env` line was being silently corrupted on read and write. When a secret's value happened to contain a known Hermes env var name followed by `=` — for example a webhook or proxy base URL carrying a query parameter like `OPENAI_BASE_URL=https://proxy.example.com/v1?TAVILY_API_KEY=sk-...` — `_sanitize_env_lines()` treated the embedded `KEY=` as a second entry. It truncated the real secret at the inner match and fabricated a bogus second variable. A related path silently dropped any text before the first matched key. Because this runs on every `load_env()`, `save_env_value()`, `remove_env_value()` and `sanitize_env_file()`, the damage was written back to `~/.hermes/.env` and re-applied on every read — persistent loss/corruption of the canonical secrets store. The concatenation splitter now only acts when the line actually begins with a known `KEY=` (so leading text is never dropped) and when every value that precedes a boundary is a plain token. If a preceding value looks structured — a URL/query string (`://`, `?`, `&`) or contains whitespace — the embedded `KEY=` is understood to be part of that value, and the line is kept verbatim. Genuine concatenations of plain-token secrets still split as before. ## Related Issue N/A ## Type of Change - [x] 🐛 Bug fix (non-breaking change that fixes an issue) ## Changes Made - `hermes_cli/config.py`: added `_looks_like_structured_value()` helper and reworked the split logic in `_sanitize_env_lines()` to anchor splits to the line start and skip splitting when a preceding value looks like a URL/query string or holds whitespace. - `tests/hermes_cli/test_config.py`: added two regression tests — a value that embeds a known `KEY=` is preserved verbatim, and leading text before the first key is not dropped. ## How to Test 1. Run the sanitizer tests: `pytest tests/hermes_cli/test_config.py -k anitize -q`. 2. Confirm the new cases reproduce the bug on the old code and pass on the new: `OPENAI_BASE_URL=https://proxy.example.com/v1?TAVILY_API_KEY=sk-embedded` is returned unchanged instead of being split into a truncated value plus a fabricated `TAVILY_API_KEY` entry. 3. Run the full file: `pytest tests/hermes_cli/test_config.py -q` (97 passed). ## Checklist ### Code - [x] I've read the Contributing Guide - [x] My commit messages follow Conventional Commits (`fix(scope):`, `feat(scope):`, etc.) - [x] I searched for existing PRs to make sure this isn't a duplicate - [x] My PR contains only changes related to this fix/feature (no unrelated commits) - [x] I've run `pytest tests/ -q` and all tests pass - [x] I've added tests for my changes (required for bug fixes, strongly encouraged for features) - [x] I've tested on my platform: macOS 15 (Darwin 25.5) ### Documentation & Housekeeping - [x] I've updated relevant documentation (README, `docs/`, docstrings) — or N/A - [x] I've updated `cli-config.yaml.example` if I added/changed config keys — or N/A - [x] I've updated `CONTRIBUTING.md` or `AGENTS.md` if I changed architecture or workflows — or N/A - [x] I've considered cross-platform impact (Windows, macOS) per the compatibility guide — or N/A - [x] I've updated tool descriptions/schemas if I changed tool behavior — or N/A	2026-07-01 01:50:32 -07:00
redactdeveloper	6b21a935af	fix(doctor): ignore disabled toolsets in missing-API-key summary hermes doctor's final 'configure missing API keys' summary counted every toolset with unmet key requirements, including default-off and explicitly disabled ones. Filter the summary to toolsets actually enabled for the CLI platform, with a graceful fallback to prior behavior when config resolution fails. Fixes #11336	2026-07-01 01:25:43 -07:00
teknium1	836732f54f	fix(cron): null-safe deliver in cron list + re-resolve BSM secrets per run Two live cron bugs, both surfaced by @banditburai in #35616 (whose larger watchdog/supervisor work is already superseded by the CronScheduler provider refactor on main): - #32896: `cron list` crashed on a present-but-null `deliver` field — `job.get("deliver", ["local"])` returns None for an explicit null, which then hit `", ".join(None)`. Coalesce with `or ["local"]` (same pitfall the sibling `repeat` line already guards against). - #33465: cron jobs 401'd on Bitwarden/BSM-backed secrets. The per-run env reload used a bare `load_dotenv(override=True)`, which re-applied only the .env placeholder — startup had already recorded this HERMES_HOME in env_loader._APPLIED_HOMES, so the external-secret re-pull no-oped. Route the reload through load_hermes_dotenv() and call reset_secret_source_cache() first to force the re-pull (Bitwarden's 300s value-cache keeps it off the network; override honours secrets.bitwarden.override_existing, mirroring startup). Tests: null-deliver regression guard in test_cron.py; reset-before-reload ordering guard in test_scheduler.py. Migrated 31 scheduler-reload test seams from patching dotenv.load_dotenv to the new load_hermes_dotenv / reset_secret_source_cache seam.	2026-07-01 01:05:33 -07:00
JezzaHehn	54f32af4a7	fix(security): require explicit consent before uploading debug logs `hermes debug share` printed a privacy notice and then uploaded the report to a public paste service in the same breath — the user never got to say yes or no. Add a consent gate: an interactive [y/N] prompt, a --yes/-y flag to skip it, and a hard refusal (exit 1) in non-interactive contexts (no TTY on stdin) so debug data can't be exposed silently in scripts/CI. - New _confirm_upload() helper gates the actual upload after the notice. - Applied to BOTH upload paths: the public paste.rs path and the --nous Nous-S3 path (the latter is a sibling site the original PR missed). - The /debug slash command passes yes=True (typing /debug is itself the consent action, and input() would hang inside prompt_toolkit). - Rewrote the privacy notice for accuracy: secrets (API keys/tokens/ passwords) ARE force-redacted before upload; PII (display name, platform user ID, verbatim message content, filesystem paths) is NOT, and that URL is public. Fixes #22016. Co-authored-by: liuhao1024 <liuhao1024@users.noreply.github.com>	2026-07-01 00:38:17 -07:00
Ben	98d550e035	feat(debug): support /debug [nous\|local] in the CLI/TUI slash command The --nous flag was only wired into the argparse `hermes debug share` subcommand. The /debug slash command (classic CLI + TUI, both via process_command -> _handle_debug_command) built a hardcoded args namespace with no `nous` attribute, so it always took the default paste.rs path. Pass cmd_original through to _handle_debug_command and parse an optional destination word: /debug -> public paste (default, unchanged) /debug nous -> Nous-internal S3 /debug local -> stdout, no upload local wins over nous (never touches the network); unknown words fall back to the default. Add args_hint="[nous\|local]" so help/autocomplete surface it. New TestDebugSlashCommand covers the parsing + dispatch.	2026-06-30 17:29:23 -07:00
Ben	89653db403	feat(debug): drop dead confirm step from --nous upload (stateless NAS) NAS PR #349 (merged) ships a stateless presigned-PUT endpoint: the only route is POST /api/diagnostics/upload-url, and the object's existence in S3 is the only state. There is no /api/diagnostics/confirm route — confirming live against the merged preview returns 404. The client's confirm_upload() therefore fired a guaranteed-404 request on every --nous upload (harmless, since errors were swallowed, but dead). Remove it and simplify share_to_nous() to the 2-step mint + PUT flow that matches the shipped contract. Drop the corresponding TestConfirmUpload class and confirm assertions; add a test that the share succeeds even when the response carries no id (we no longer depend on it). The separately-flagged cross-repo requirement from #349's review -- sizeBytes is now REQUIRED and signed into the presigned URL's ContentLength -- was already satisfied: share_to_nous() sends len(bundle) as sizeBytes and urllib sets a matching Content-Length on the PUT. Verified against the live merged preview (missing sizeBytes -> 400 invalid_body; present -> 503 dark). Tested: pytest tests/hermes_cli/test_diagnostics_upload.py tests/hermes_cli/test_debug.py -> 95 passed.	2026-06-30 17:29:23 -07:00
Ben Barclay	51eeb70cb8	feat(debug): add --nous flag to upload diagnostics to Nous S3 `hermes debug share --nous` uploads the (force-redacted) debug bundle to Nous-internal S3 storage via a presigned URL minted by the Nous account service, instead of a public paste. The bundle is private — viewable only by Nous staff / allowlisted mods through a Google-OAuth-gated viewer — and auto-deletes after 14 days. The paste.rs path is unchanged and remains the default. - hermes_cli/diagnostics_upload.py (new): stdlib-urllib NAS client — request_upload_url(), put_bundle(), confirm_upload() (best-effort), share_to_nous() orchestrator. Base URL via HERMES_DIAGNOSTICS_BASE_URL (default https://portal.nousresearch.com). - hermes_cli/debug.py: extract collect_share_bundle() from build_debug_share() so the Nous path reuses the exact same redaction/collection (paste.rs behaviour unchanged); add build_nous_bundle() producing the gzipped {"format":"hermes-debug-share/1","redacted":...,"files":...} envelope the discord-support viewer parses; add the --nous run path with a privacy notice and a clean fallback (suggest --local) on failure. - hermes_cli/main.py: add the --nous flag + help/epilog entry on `debug share`. - tests: test_diagnostics_upload.py (new) mocks urllib; test_debug.py adds bundle/Nous coverage. 97 passing.	2026-06-30 17:29:23 -07:00
HiddenPuppy	0e4c879a3b	fix: keep plain custom GPT-5 relays on chat completions Generic provider:custom relays were force-routed to the OpenAI Responses API whenever the model matched gpt-5*, and a stale persisted model.api_mode=codex_responses survived /reset and upgrades. Some OpenAI-compatible relays do not implement Responses semantics, which surfaced as malformed function_call.name replay errors in gateway sessions. - runtime_provider: route custom-provider api_mode through _resolve_plain_custom_api_mode(), which drops a stale codex_responses unless the URL is direct OpenAI/xAI - run_agent: _provider_model_requires_responses_api returns False for custom; direct api.openai.com / api.x.ai URLs still upgrade via _is_direct_openai_url() / URL detection - regression coverage for plain relays vs direct OpenAI/xAI URLs Co-authored-by: HiddenPuppy <HiddenPuppy@users.noreply.github.com>	2026-06-30 15:57:52 -07:00
kshitijk4poor	c717be8ded	fix(config): route every migration write through one default-stripping chokepoint A single 'hermes update' / 'hermes -p' could rewrite a hand-curated config.yaml into a near-full DEFAULT_CONFIG dump (the 'you blow up my profile config on one tweak' reports). Root cause: migrate_config() had ~16 independent save_config() call sites, each author deciding ad hoc whether to materialise a value, and many persisted pure schema defaults with strip_defaults=False. Defaults already merge transparently at read time via load_config(), so writing them is pure bloat that also shadows future default changes (see save_config's docstring). Architectural fix (not a per-site patch): introduce a single _persist_migration() chokepoint that enforces one invariant — a migration may persist only values that DIFFER from the current schema default, plus explicit removals/renames of user data; pure defaults are never written. Every migration write (all 17 sites incl. the version-bump finalizer) now routes through it. The invariant is mechanically correct for all cases and verified empirically: - pure-default seeds (timezone='', curator/auxiliary.curator blocks, interim flag, curator.consolidate=False, empty plugins.enabled) are stripped → merged in at read time; - non-default values (write_approval=True, model_catalog.ttl_hours=1) preserved via explicit-raw-path preservation; - behaviour flips (agent.verify_on_stop=False, schema default still 'auto') preserved because False != 'auto'; - data transforms (custom_providers->providers, stt.model relocation, write_mode->write_approval, compression.summary_* removal, MCP-disable) persist their removals/renames. An explicitly user-set non-default value (e.g. matrix.require_mention: false) is preserved across the bump. Guard tests lock the architecture: an AST check asserts migrate_config() makes no direct save_config() call (all writes go through _persist_migration), and a full-range v1->latest test asserts a lean config is never dumped. Two existing change-detector tests that froze the on-disk representation of default-valued keys are rewritten to assert the effective value via load_config() (behaviour contract, not snapshot). Validation: lean v1->latest migration drops from ~567 bytes to ~196 bytes; 148 config+setup and 196 profile/curator/migrate tests pass on scripts/run_tests.sh.	2026-06-30 20:30:22 +05:30
Vladimir Smirnov	c080a530ae	fix(cli): redact status API keys with --all	2026-06-30 04:38:43 -07:00
Teknium	e7ca53e6b8	fix(moa): disabled presets no longer hijack a plain model switch (#55598 ) exact_moa_preset_name matched any bare model name equal to a preset key, regardless of the preset's enabled flag. On the no-explicit-provider switch path (PATH B in model_switch.py), a plain /model switch whose name collided with a preset key (e.g. "default") silently pivoted the session onto the MoA virtual provider — even when the user had set enabled: false to opt out (issue #55187). The LLM driving a routine model switch could land on a broken moa provider with empty default_preset / unconfigured aggregator credentials. Gate the implicit bare-name match on the per-preset enabled flag. Explicit selection via --provider moa / the model picker uses PATH A and does not go through exact_moa_preset_name, so a disabled preset stays reachable when the user explicitly asks for it.	2026-06-30 04:22:32 -07:00
teknium1	bff61f558f	feat(plugins): enable-time consent prompt for tool_override grant Builds on memosr's sink-level opt-in gate (#29249). Enabling a non-bundled plugin now surfaces the privileged allow_tool_override decision at `hermes plugins enable` time instead of leaving the operator to discover the config key after a runtime rejection. - `hermes plugins enable <name>` prompts for non-bundled plugins: 'Allow this plugin to replace built-in tools?' Default is deny (blank Enter / non-interactive stdin / EOF all fail closed). - --allow-tool-override / --no-allow-tool-override flags for non-interactive and scripted use (and a future desktop checkbox). - Bundled plugins are trusted: never prompted, no entry written. - Writes plugins.entries.<key>.allow_tool_override, the same key the sink gate reads (manifest.key == discovery key), so consent and enforcement compose end to end.	2026-06-30 04:00:42 -07:00

1 2 3 4 5 ...

1782 commits