hermes-agent

Author	SHA1	Message	Date
mrparker0980	10a54ccc2c	fix(security): anchor @file context refs to canonical read deny-list `@file` / `@folder` context-reference expansion enforced its own narrow deny-list (`_ensure_reference_path_allowed` in `agent/context_references.py`) that only covered `~/.ssh` keys, a handful of shell dotfiles, `~/.hermes/.env`, and `skills/.hub`. It never blocked the credential stores that the canonical read guard (`agent/file_safety.get_read_block_error`) protects: provider API keys (`~/.hermes/auth.json`), Anthropic OAuth tokens (`~/.hermes/.anthropic_oauth.json`), MCP OAuth material (`~/.hermes/mcp-tokens/`), webhook HMAC secrets, and project-local `.env` files. This matters because the messaging gateway feeds untrusted remote text straight into reference expansion: `gateway/run.py` calls `preprocess_context_references_async(..., allowed_root=_msg_cwd)` where `_msg_cwd` defaults to the operator's HOME when `TERMINAL_CWD` is unset. A chat peer (Telegram/Discord/Slack/...) could send `@file:~/.hermes/auth.json`, pass the `allowed_root` check (it resolves under HOME), slip past the narrow list, and have the operator's live keys read into the agent's context — where the model would typically echo or act on them. Rather than duplicate and re-sync a second secret list, this routes the guard through the existing single source of truth. A reviewer might ask "why not just add `auth.json` to the local list?" — because the local list has already drifted once (a prior commit had to add `.config/gh`); anchoring to `get_read_block_error` means every future addition there protects this path too. The narrow checks are kept as a fallback since they also cover dirs that guard does not (`.aws`, `.gnupg`, `.kube`, etc.), and the canonical lookup is wrapped so it can never crash reference expansion. N/A - [x] 🔒 Security fix - `agent/context_references.py`: `_ensure_reference_path_allowed` now also consults `agent.file_safety.get_read_block_error` after its existing checks and refuses the reference when that canonical guard flags the resolved path. The lookup is wrapped so guard-resolution failures fall back to the explicit checks instead of breaking expansion. - `tests/agent/test_context_references.py`: added `test_blocks_canonical_read_denylist_credential_stores`, asserting that `@file` attaches for `auth.json`, `.anthropic_oauth.json`, `mcp-tokens/`, and a project-local `.env` are all refused and their secret bodies never reach the expanded message. - `scripts/release.py`: added the contributor email to `AUTHOR_MAP` (release gate). 1. `scripts/run_tests.sh tests/agent/test_context_references.py` — all 15 tests pass, including the new credential-store case. 2. Regression proof: stash `agent/context_references.py`, run the suite with `-- -k canonical`, and confirm the new test fails (secrets leak into the message) without the fix; restore and confirm it passes. 3. `ruff check agent/context_references.py tests/agent/test_context_references.py` and `python scripts/check-windows-footguns.py agent/context_references.py tests/agent/test_context_references.py` both pass. - [x] I've read the Contributing Guide - [x] My commit messages follow Conventional Commits (`fix(scope):`, etc.) - [x] I searched for existing PRs to make sure this isn't a duplicate - [x] My PR contains only* changes related to this fix (plus the AUTHOR_MAP release gate) - [x] I've run the test suite for the touched area and all tests pass - [x] I've added tests for my changes (required for bug fixes) - [x] I've tested on my platform: macOS 15 (Darwin 25.5) - [x] I've updated relevant documentation (README, `docs/`, docstrings) — or N/A - [x] I've updated `cli-config.yaml.example` if I added/changed config keys — or N/A - [x] I've updated `CONTRIBUTING.md` or `AGENTS.md` if I changed architecture or workflows — or N/A - [x] I've considered cross-platform impact (Windows, macOS) — or N/A - [x] I've updated tool descriptions/schemas if I changed tool behavior — or N/A	2026-07-01 02:43:49 -07:00
kshitijk4poor	8f1d22d7ed	chore(release): map r266-tech contributor noreply email for #55780 salvage	2026-07-01 15:01:33 +05:30
teknium1	116a63d3a0	chore(release): map jcjc81 + Tranquil-Flow in AUTHOR_MAP for #54947 cluster salvage	2026-07-01 02:29:24 -07:00
Teknium	522a5e93b2	chore(release): map x9x9x9x9x9x91 for #49247 salvage	2026-07-01 02:18:56 -07:00
itenev	f981d47cb0	fix(gateway): prevent Discord disconnects from blocking event loop models_dev.py's fetch uses a synchronous requests.get(timeout=15). Called from the async gateway message handlers, it blocked the event loop for up to 15s, starving Discord heartbeats and causing ClientConnectionResetError disconnects. Adds get_model_context_length_async() which offloads the entire sync resolution chain to a worker thread via asyncio.to_thread(), and switches the two async gateway call sites (_prepare_inbound_message_text, _handle_message_with_agent) to await it. The loop stays responsive; the sync path remains the single source of truth for the cache. Salvaged from PR #22753 by @itenev. Follow-up: dropped the unused fetch_models_dev_async/lookup_models_dev_context_async aiohttp variants from the original PR (dead code with zero callers that had drifted from the sync cache logic) — the to_thread wrapper already runs the sync path off-loop, so they were redundant.	2026-07-01 02:17:35 -07:00
Teknium	ea533e7f41	chore(release): map justin-cyhuang contributor email for #31960 salvage	2026-07-01 02:12:25 -07:00
Teknium	259e6b87a7	fix(teams-pipeline): reject dot-only recording display_name Path(raw).name reduces '..'/'.'/'' to themselves, so basename extraction alone still let a Graph-provided display_name of '..' or '../' escape the temp recording directory (tmp_dir / '..' resolves to the parent). Reject the dot-only basenames explicitly and fall back to the artifact id. Extends @outsourc-e's regression coverage with the dot-only cases.	2026-07-01 02:03:48 -07:00
teknium1	6d30f8c0ab	chore: add AUTHOR_MAP entry for PR #52534 salvage (@qWaitCrypto)	2026-07-01 02:03:40 -07:00
teknium1	49cb06c07a	chore(release): map sasquatch9818 for PR #41198 salvage	2026-07-01 01:54:45 -07:00
kshitijk4poor	58ea7f9071	chore(release): map claudlos contributor email for #52351 salvage	2026-07-01 14:23:01 +05:30
Teknium	f70abae606	chore(release): map kernel-t1 for .env sanitizer salvage (#41349 )	2026-07-01 01:50:32 -07:00
teknium1	db2ac840c1	chore(release): map kyzcreig@gmail.com in AUTHOR_MAP	2026-07-01 01:44:40 -07:00
kshitijk4poor	843a3be7d6	chore(attribution): map baris@writeme.com -> isair for salvaged #50124	2026-07-01 14:09:15 +05:30
Teknium	a56bfeb2cb	chore(release): map approval-bypass PR contributors AUTHOR_MAP entries for the salvaged shell-bypass fixes: xy200303 (#40663), YLChen-007 (#26965), egilewski (co-author #40663). necoweb3 (#55653) already mapped.	2026-07-01 01:39:10 -07:00
teknium1	907cbba885	chore(release): add Vesna-9 to AUTHOR_MAP for #41274 salvage	2026-07-01 01:38:59 -07:00
teknium1	5e64dd9a98	chore: map charleneleong84 email to AUTHOR_MAP for #11736 salvage	2026-07-01 01:36:34 -07:00
teknium1	80d0ff8da5	chore: add AUTHOR_MAP entry for PR #40978 salvage (@friendshipisover)	2026-07-01 01:27:26 -07:00
ryo-solo	d578b6165d	fix(api_server): pop fallback model kwarg to prevent AIAgent collision When the primary provider's auth fails (expired token / 429 quota cap), _resolve_runtime_agent_kwargs() falls through to the fallback provider chain, whose runtime dict carries its own 'model' key. api_server's _create_agent then did AIAgent(model=model, **runtime_kwargs), colliding on 'model' and 500ing every /v1/chat/completions request while a fallback was active. Pop the runtime model and let it override the config model, mirroring the native gateway path (_resolve_session_agent_runtime). Salvaged from #35716 by @ryo-solo (earliest submitter); the PR's second half (Mistral reasoning_content strip) is already handled on main and dropped. Co-authored-by: Hermes Agent <noreply@nousresearch.com>	2026-07-01 01:26:27 -07:00
teknium1	ce9d180a94	chore: add redactdeveloper to AUTHOR_MAP for PR #36897 salvage	2026-07-01 01:25:43 -07:00
teknium1	081c91c147	chore: add AUTHOR_MAP entry for PR #40773 salvage (rrevenanttt)	2026-07-01 01:25:24 -07:00
petrichor-op	f2a528fb59	fix(agent): never persist empty-response recovery scaffolding Ephemeral empty-response/prefill recovery scaffolding (the synthetic assistant "(empty)" turn, the user nudge, the terminal "(empty)" sentinel, and the thinking-only prefill placeholder) exists only to drive the next API retry; the in-memory loop pops it before appending the real response. The append-only flush did not mirror that, so a mid-turn persist could commit scaffolding to the SQLite session store (and JSON log), and a resumed session would replay synthetic "(empty)"/nudge turns as genuine context — re-poisoning the empty-retry boundary forever. Filter ephemeral scaffolding at both durable-write sites (_flush_messages_to_session_db + _save_session_log), by flag not position, so buried scaffolding (an answered nudge leaves the synthetic pair mid-list) is skipped too. Covers all three flags including _thinking_prefill. Adapted onto current main's identity-tracking flush. Cherry-picked from #41281 by petrichor-op.	2026-07-01 01:08:27 -07:00
teknium1	cf427ccf08	chore: add AUTHOR_MAP entry for PR #35130 salvage (@jnibarger01)	2026-07-01 01:05:28 -07:00
teknium1	deb4629764	chore: add AUTHOR_MAP entry for PR #30491 salvage (MattKotsenas)	2026-07-01 01:02:23 -07:00
teknium1	7136b5382a	chore: add JustinOhms to release AUTHOR_MAP for PR #24469 salvage	2026-07-01 00:45:31 -07:00
teknium1	3aebdb1d23	chore: add AUTHOR_MAP entry for PR #22523 salvage (@H2KFORGIVEN)	2026-07-01 00:27:09 -07:00
Teknium	8d78be5460	revert: back out prompt_caching.enabled toggle (#56105 ) for re-evaluation (#56126 ) * Revert "fix(caching): honor prompt_caching.enabled across model switch + fallback" This reverts commit `36f9f50145`. * Revert "fix: allow disabling prompt caching" This reverts commit `c1c1a12fe6`.	2026-07-01 00:20:32 -07:00
teknium1	36f9f50145	fix(caching): honor prompt_caching.enabled across model switch + fallback @janrenz's PR #35862 added prompt_caching.enabled=false at init only. But _anthropic_prompt_cache_policy re-derives _use_prompt_caching on every /model switch (agent_runtime_helpers) and fallback-model swap (chat_completion_helpers), which re-enabled markers and re-broke the strict proxy the toggle was meant to fix. Move the kill switch into anthropic_prompt_cache_policy so it returns (False, False) on every path. Drop the now-redundant init-time override (kept @janrenz's isinstance hardening on the cache_ttl read). Add policy-level tests + docs for the toggle. Follow-up to salvaged PR #35862.	2026-07-01 00:10:42 -07:00
syahidfrd	0198713c33	fix(security): reuse auth chain when tagging unverified senders in Slack threads Mitigates indirect prompt injection (CWE-863) in Slack thread context. When the bot is mentioned mid-thread for the first time, _fetch_thread_context pulls the full thread via conversations.replies and prepends every reply to the LLM prompt. Replies from senders not on the allowlist were rendered identically to authorised senders, letting a third party in a shared channel inject instructions the model might act on when answering the next authorised message. - BasePlatformAdapter.set_authorization_check / _is_sender_authorized, registered by GatewayRunner._make_adapter_auth_check() with a closure over the existing _is_user_authorized chain (platform/global/group allowlists, allow-all flags, pairing store all stay the single source of truth — no env-var re-parsing). - Tags non-bot thread messages whose sender fails the auth check with an [unverified] prefix; strengthens the header with soft guidance only when at least one unverified message is present, so setups without an allowlist see no behaviour change. - Wired into all three adapter-init sites in run.py (start, reconnect watcher, restart) so the reconnect path is covered too. Softened wording: adapted from the original [untrusted] tag to [unverified] and non-accusatory header framing — the label reflects allowlist status, not a judgment about the person. Adapter relocated to plugins/platforms/slack/ since the PR was authored. Salvaged from #17059.	2026-06-30 18:05:43 -07:00
teknium1	7cb85733b8	chore(release): add AUTHOR_MAP entries for #54609 , #54912 salvage	2026-06-30 17:45:45 -07:00
teknium1	698c287fd0	chore(release): add AUTHOR_MAP entry for CRWuTJ (PR #17082 salvage)	2026-06-30 17:39:30 -07:00
teknium1	7de485703b	fix(gateway): preserve media + reply payload when /queue defers a turn /queue rebuilt the queued MessageEvent with only text/type/source/ message_id/channel_prompt, silently dropping any photo, document, voice, or reply context attached to the command. The deferred turn then ran with the attachment lost. Carry the full payload through, and accept a /queue that has media but no prompt text (e.g. "/queue" as an image caption). Salvaged from #13913 by @ypwcharles — the gateway busy-session/queue infrastructure was rewritten since that PR (Telegram moved to plugins/platforms/, /queue now uses the FIFO chain), so the media fix is reimplemented against the current handler; the PR's batching and busy-bypass changes targeted code paths that no longer exist. Co-authored-by: ypwcharles <92324143+ypwcharles@users.noreply.github.com>	2026-06-30 17:32:35 -07:00
Scott Gabel	4a7a6fd401	fix(approval): redact secrets in user-facing approval prompts The dangerous-command approval prompt renders the flagged command so the user can decide whether to approve. If the agent constructed it with a credential (curl -H 'Authorization: Bearer sk-...', psql postgres://user:pw@host, an execute_code script with api_key = 'sk-...'), that secret hit stdout and, via the gateway notify payload, Discord/Slack messages — which are screenshottable and forwardable. Apply the existing agent.redact.redact_sensitive_text() to every user-facing approval surface. Redaction is display-only: the raw command still executes after approval, and approval persistence keys off pattern_key (not the command text), so the allowlist is unaffected. Decision context (URL, flags, command structure) is preserved; only the secret value masks. Covers all surfaces, including the execute_code path the original PR missed: - prompt_dangerous_approval(): callback + stdout fallback - check_all_command_guards(): gateway approval_data + cron/batch pending fallback - check_execute_code_guard(): gateway approval_data + no-notifier pending fallback (script body can embed credentials) Adds TestApprovalPromptRedaction covering callback redaction, no-over-redaction of clean commands, and the execute_code pending fallback. Salvaged from PR #13139 by @sgabel; extended to the execute_code surface.	2026-06-30 17:29:11 -07:00
teknium1	caa2034f88	chore(release): map codexGW noreply email for PR #12302 salvage	2026-06-30 16:38:31 -07:00
teknium1	638d2e7bfc	fix(memory/holographic): apply FTS5 sanitizer to search_facts sibling The store-level search_facts() shared the same raw-MATCH bug class as _fts_candidates (FTS5 AND-joins tokens, zeroing prose recall). Route it through FactRetriever._sanitize_fts_query via a lazy import to keep the store->retrieval layering acyclic. Also add cyb3rwr3n to release AUTHOR_MAP.	2026-06-30 15:55:11 -07:00
teknium1	86200e7583	chore(release): map kyssta-exe id-prefixed noreply email for PR #55657 salvage	2026-06-30 15:49:36 -07:00
teknium1	0b3752eede	chore(release): add AUTHOR_MAP entry for lEWFkRAD (#53848 salvage)	2026-06-30 12:07:01 -07:00
Neo	c969090878	fix(cli): clear input-blocking overlays when interrupting a running agent Interrupting the agent while an approval/clarify/sudo/secret prompt is up left the overlay state dict set with no thread servicing it. The prompt's worker thread is torn down on interrupt, but read_only (gated on _command_running) plus the keypress filter kept the CLI input locked until the prompt's own timeout expired — the terminal appeared frozen. Drain and clear all four input-blocking overlays on interrupt via a single helper (_clear_active_overlays_for_interrupt): approval -> deny, clarify/sudo/secret -> cancel, each guarded so a dead queue can't block the others; sudo restores the pre-modal draft. Wired into all three interrupt paths — new-message interrupt, Ctrl+C, and Ctrl+Q. Blocking overlays now clear AND fall through so one keypress both clears a stale overlay and interrupts a still-running agent; the /model picker and slash-confirm foreground prompts keep their cancel-and-return behavior. Closes #13618.	2026-06-30 04:49:29 -07:00
teknium1	8e6fd4cfa6	chore(release): add AUTHOR_MAP entry for londo161 (#15795 salvage)	2026-06-30 04:38:43 -07:00
teknium1	6148a9a3fe	chore(release): map nnnet author email for PR #25142 salvage	2026-06-30 04:23:03 -07:00
fayenix	d6c53dcdcb	fix(gateway): stop per-turn agent-cache eviction from model + message_id signature churn Two independent bugs evicted the cached gateway AIAgent on every turn, preventing the prompt cache from ever warming: 1. Model normalization mismatch: the post-run fallback-eviction check compared _agent.model (stripped in AIAgent.__init__) against the raw _resolve_gateway_model() config string. For vendor-prefixed config on native providers (e.g. 'deepseek/deepseek-v4-pro' vs 'deepseek-v4-pro') this was always unequal, so the agent was evicted after every successful run. Normalize _cfg_model the same way (skip aggregators). 2. Discord triggering message_id leaked into the cached system prompt via build_session_context_prompt()'s Discord IDs block. message_id changes every turn, so the agent-cache signature (computed from the ephemeral prompt) changed every Discord turn -> rebuild every message. The id is now injected per-turn into the user message (where per-turn content belongs and does not touch the cache signature); the cached IDs block carries a static pointer to it, preserving reply/react/pin via the discord tools. Adapted from #28846. Bug #1 fix is the contributor's; bug #2 reworked to be non-destructive (keeps the triggering-id capability instead of deleting it). Redundant auto-reset eviction (already on main via #9893/#48031) and the wrong-premise reset_context_note plumbing from the original PR were dropped. Co-authored-by: Hermes Agent <hermes@nousresearch.com>	2026-06-30 04:22:41 -07:00
Zane Ding	ac380050ea	fix(credential-pool): distinguish OpenRouter upstream 429s from account 429s OpenRouter returns 429 in two shapes: an account-level throttle on the user's key, and an upstream-provider throttle (DeepSeek/Anthropic/etc. rate-limiting OpenRouter's aggregate traffic). The classifier treated both identically and rotated/exhausted OPENROUTER_API_KEY on every 429 — burning the key for ~24min and silently disabling auxiliary features (compression, summarization, vision) on an upstream throttle where the key was healthy. Add a FailoverReason.upstream_rate_limit classified from OpenRouter's unambiguous wrapper message "Provider returned error" (the same signal the metadata-raw parser already trusts). Recovery skips credential rotation and defers to the fallback chain to switch models instead. Co-authored-by: Hermes Agent <127238744+teknium1@users.noreply.github.com>	2026-06-30 03:57:14 -07:00
Teknium	abca77615a	chore(release): map Jeffgithub0029 author email for #28558 salvage	2026-06-30 03:51:08 -07:00
teknium1	c510f48680	chore(release): add jasonQin6 to AUTHOR_MAP for PR #15093 salvage	2026-06-30 03:42:25 -07:00
teknium1	2ae9e222f0	chore: AUTHOR_MAP entry for PR #27123 salvage (jimmyjohansson84)	2026-06-30 03:42:20 -07:00
teknium1	ea95fdd6d7	chore(release): add nikshepsvn to AUTHOR_MAP for PR #27426 salvage	2026-06-30 03:41:46 -07:00
teknium1	3ecc58a8da	chore: map trevorgordon981 in AUTHOR_MAP for #50590 co-authorship	2026-06-30 03:27:41 -07:00
teknium1	bf2dc18f84	test+chore: real-path regression test for #15157 model_extra guard + AUTHOR_MAP Adds tests/agent/test_model_extra_type_guard.py exercising the real ChatCompletionsTransport.normalize_response path with string/list/None/dict model_extra; adds the AUTHOR_MAP entry for the contributor.	2026-06-30 03:27:12 -07:00
Tao Yan	b8ebe32866	fix(agent): flatten multi-part user_message in codex intermediate-ack detector Vision requests routed through the OpenAI-compat API server forward the raw multi-part content list ([{type:"text"}, {type:"image_url"}, ...]) straight through as user_message. The codex intermediate-ack detector flattened it with (user_message or "").strip(), so a truthy list survived and .strip() raised AttributeError — killing any Codex-routed vision turn that took the require_workspace path. Route through the existing _summarize_user_message_for_log helper (which already backs the logging/banner previews on main), and widen the param type hint from str to Any to match how the function is actually called. The two logging-preview sites the original PR also touched were fixed independently on main by the conversation-loop refactor. Co-authored-by: Hermes Agent <agent@nousresearch.com>	2026-06-30 03:20:11 -07:00
Markus Phan	cd9f5cc671	fix(delegate): route subagent progress lines through _safe_print for ACP stdio delegate_task's per-task completion display emitted lines like "✓ [1/3] Research done (17.92s)" via a bare print(). Under ACP (and any headless JSON-RPC stdio host where AIAgent routes human output to stderr via a custom _print_fn), these landed on stdout and corrupted the protocol frame stream, surfacing as "Failed to parse JSON message: ✓ [3/3] …" in the ACP adapter. Add _emit_parent_console() which prefers parent_agent._safe_print (the same hook AIAgent uses for every other user-facing print) and falls back to print() only when no router is wired up or it raises. CLI behavior is unchanged. The PR's other fix (preset toolset expansion) is already covered on main by _expand_parent_toolsets(), so only the stdio-safe printing change is salvaged here.	2026-06-30 03:16:22 -07:00
teknium1	db880186f2	chore(release): add AUTHOR_MAP entries for #51841 and #54287 salvage	2026-06-30 03:11:13 -07:00

1 2 3 4 5 ...

1083 commits