hermes-agent

Author	SHA1	Message	Date
teknium1	cfbc7ed1f9	fix(browser): narrow credential-query denylist to unambiguous names Follow-up on the salvaged #49830 hardening. The contributor's sensitive query-param set included bare English words (code, key, auth, session, sig) that double as ordinary page facets — ?code= on promo/challenge pages, ?key= as a search facet, ?session= on blogs — so web_extract and cloud browser_navigate would refuse a large slice of normal browsing. Narrow the set to unambiguously credential-named params (access_token, authorization, client_secret, password, token, x-amz-signature, ...). Prefix-based vendor-key redaction (is_safe_url) still catches recognizable key shapes; this set is the belt-and-suspenders for opaque secrets carried under an explicit credential-named parameter. Also fixes two intra-PR-staleness test breakages surfaced by salvaging onto current main: - web_extract_tool() no longer accepts use_llm_processing= (signature changed since the PR was authored) — dropped the invalid kwarg. - agent.redact now fully masks keyed 'token=<secret>' to 'token=***' instead of partial 'sk-...'; the console-redaction test now asserts the real invariant (secret body gone) rather than the exact mask format. Added a regression test that generic English-word query params are NOT blocked by the credential guard.	2026-07-01 05:04:41 -07:00
yongyong	937e56be92	fix(browser): block bracketed sensitive eval primitives	2026-07-01 05:04:41 -07:00
yongjin	a0beb52a50	fix(browser): harden browser tool safety boundaries Add policy gates and output redaction for browser/CDP surfaces, strengthen session ownership tracking, and block credential-like query parameters before third-party browser/web backends receive URLs. Inspired by the agbrowse review: keep local browser magic-link flows possible while preventing cloud reader/browser escalation from receiving opaque token, code, signature, or key query parameters.	2026-07-01 05:04:41 -07:00
kyssta-exe	7eb9716ad7	fix(agent): apply persist override to the DB row only, never the live list (#48677 ) The persist user-message override was applied in place to the live messages list. On the early crash-resilience persist (which runs BEFORE api_messages is built), that stripped observed group-chat context off the live user message and silently dropped it when observe_unmentioned_group_messages was enabled. Fix at the single chokepoint: _flush_messages_to_session_db resolves the override (idx/content/timestamp) locally and applies it ONLY to the row written to the DB — the live dict is never mutated, so EVERY persist caller (early persist, mid tool-loop flush, /resume, /branch) is protected uniformly. This supersedes the earlier shallow-copy approach, which broke the intrinsic _DB_PERSISTED_MARKER idempotency (copies never propagated the marker back to the live dicts → duplicate rows) and closes the sibling class tracked in #56303. Trailing empty-response scaffolding is still dropped from the live list in _persist_session (unchanged behavior). Salvaged from #48817; chokepoint reworked to coexist with the marker-based dedup (#50372). Co-authored-by: kyssta-exe <kyssta-exe@users.noreply.github.com>	2026-07-01 17:28:04 +05:30
teknium1	34de127200	fix(auth): widen portal_base_url allowlist guard to runtime credential path The salvaged PR guarded only resolve_nous_access_token; the primary resolve_nous_runtime_credentials path also POSTs the refresh token to portal_base_url on refresh with no allowlist check. Mirror the guard there so a poisoned host can't receive the bearer, and drop the stray duplicated allowlist comment. Adds a sibling-site regression test.	2026-07-01 04:57:40 -07:00
szzhoujiarui	f3c5327e67	fix(auth): validate portal_base_url and migrate stale api.nousresearch.com (#44710 )	2026-07-01 04:57:40 -07:00
teknium1	3b41df6d46	test(gateway): regression for multi-profile node symlink leak; AUTHOR_MAP Add tmp_path symlink regression tests for both generate_systemd_unit and generate_launchd_plist (~/.local/bin/node -> profile node install must not leak the profile target into the generated unit PATH). Register jearnest11's AUTHOR_MAP entry for the salvage cherry-pick.	2026-07-01 04:57:21 -07:00
Jack Earnest	9138176dcd	fix(gateway): don't resolve node symlink into profile dir generate_systemd_unit() and generate_launchd_plist() used Path(shutil.which('node')).resolve().parent to find the node bin dir. When ~/.local/bin/node is a symlink into a specific profile's node install (e.g. ~/.hermes/profiles/<p>/node/bin/node), .resolve() chases it and bakes that one profile's path into EVERY profile's service definition. This breaks profile isolation and makes systemd_unit_is_current() perpetually False: each gateway rewrites its unit + daemon-reload on every boot, destabilizing multi-profile setups into a ~5-minute restart loop (observed NRestarts ~1600 across two gateways). Fix: use Path(resolved_node).parent — the directory where node is found on PATH — instead of chasing the symlink to its resolved target. This keeps generated service definitions profile-agnostic. Affects both the systemd (Linux) and launchd (macOS) unit generators.	2026-07-01 04:57:21 -07:00
ygd58	50aaa426c1	fix(gateway): pairing store cannot bypass configured allowlist A user who tapped Always on an approval button gets a pairing-store entry. _is_user_authorized() checked the pairing store BEFORE the allowlist and returned True unconditionally, so a paired-but-not-allowed user permanently bypassed TELEGRAM_ALLOWED_USERS (or equivalent) even after being removed from the allowlist (#23778). Record pairing membership but only honor it in the no-allowlist branch. When an allowlist IS configured, the paired user must appear in the canonical allowed_ids set (the same set that resolves WhatsApp aliases, SimpleX names, group allowlists, and the '*' wildcard), so pairing grants no extra access. Cherry-picked/rebased from #47736 (#23805) by ygd58; membership check rewritten to reuse the existing allowlist logic. Adds regression tests.	2026-07-01 04:56:25 -07:00
rrevenanttt	03bbd37dd7	fix(mcp): stop EventBridge silently dropping sessions.json-only changes The MCP serve event bridge polls two files to decide whether there is new conversation activity to surface to MCP clients: the gateway sessions.json index and state.db. Its skip-when-unchanged guard was self-defeating — it refreshed self._sessions_json_mtime with the current value before comparing against it, so the sessions.json term was always true and the guard collapsed to a state.db-only check. The impact is silent message loss on the event stream. The gateway commonly persists a message to state.db on one tick and registers the owning conversation in sessions.json a moment later. On that later tick only sessions.json has changed, so the broken guard takes the early return and never processes the freshly-registered chat. Its messages are withheld from every connected MCP client (events_poll / events_wait) until state.db happens to change again — which, for an otherwise-idle conversation, may be never. A polling bridge that quietly swallows new conversations is exactly the failure mode this watcher exists to prevent. The fix is minimal and low-risk: capture the previously-seen sessions.json mtime before the cache refresh and compare against that, so the guard skips only when NEITHER file changed since the last poll. The hot-path mtime optimization is fully preserved (a genuinely idle tick still short-circuits), and all existing EventBridge polling tests continue to pass unchanged. ## What does this PR do? Fixes a logic error in `EventBridge._poll_once` (`mcp_serve.py`) where the "nothing changed, skip this poll" guard compared `sj_mtime` against `self._sessions_json_mtime` after that attribute had already been overwritten with `sj_mtime`. The comparison was therefore always true, reducing the intended "skip only if both files are unchanged" check to a state.db-only check and discarding any tick in which only sessions.json changed. The guard now compares against the mtime observed on the previous poll, restoring the intended behavior. ## Related Issue N/A ## Type of Change - [x] 🐛 Bug fix (non-breaking change that fixes an issue) - [ ] ✨ New feature (non-breaking change that adds functionality) - [ ] 🔒 Security fix - [ ] 📝 Documentation update - [ ] ✅ Tests (adding or improving test coverage) - [ ] ♻️ Refactor (no behavior change) - [ ] 🎯 New skill (bundled or hub) ## Changes Made - `mcp_serve.py`: in `EventBridge._poll_once`, snapshot `prev_sessions_json_mtime = self._sessions_json_mtime` before refreshing the cached index, and use it in the skip guard (`sj_mtime == prev_sessions_json_mtime`) so a sessions.json-only change no longer triggers the early return. Added a comment explaining the seam. - `tests/test_mcp_serve.py`: added `TestEventBridgePollE2E::test_poll_picks_up_new_conversation_when_only_sessions_json_changed`, a regression test that reproduces the boundary state (state.db unchanged, sessions.json newly updated) and asserts the new conversation's message is emitted. ## How to Test 1. Reproduce the failure on the old code: with the guard comparing against `self._sessions_json_mtime`, the new test fails — the freshly-registered conversation yields `0` events instead of `1`. 2. Apply the fix and run `pytest tests/test_mcp_serve.py -q` — all 46 tests pass (40 skipped require the optional `mcp` SDK), including the three pre-existing `TestEventBridgePollE2E` polling tests and the new regression guard. 3. `ruff check mcp_serve.py tests/test_mcp_serve.py` and `python scripts/check-windows-footguns.py mcp_serve.py` both report clean. ## Checklist ### Code - [x] I've read the [Contributing Guide](https://github.com/NousResearch/hermes-agent/blob/main/CONTRIBUTING.md) - [x] My commit messages follow [Conventional Commits](https://www.conventionalcommits.org/) (`fix(scope):`, `feat(scope):`, etc.) - [x] I searched for [existing PRs](https://github.com/NousResearch/hermes-agent/pulls) to make sure this isn't a duplicate - [x] My PR contains only changes related to this fix/feature (no unrelated commits) - [x] I've run `pytest tests/test_mcp_serve.py -q` and all tests pass - [x] I've added tests for my changes (required for bug fixes, strongly encouraged for features) - [x] I've tested on my platform: macOS 15 (Darwin) ### Documentation & Housekeeping - [x] I've updated relevant documentation (README, `docs/`, docstrings) — or N/A - [x] I've updated `cli-config.yaml.example` if I added/changed config keys — or N/A - [x] I've updated `CONTRIBUTING.md` or `AGENTS.md` if I changed architecture or workflows — or N/A - [x] I've considered cross-platform impact (Windows, macOS) per the [compatibility guide](https://github.com/NousResearch/hermes-agent/blob/main/CONTRIBUTING.md#cross-platform-compatibility) — or N/A - [x] I've updated tool descriptions/schemas if I changed tool behavior — or N/A	2026-07-01 04:55:50 -07:00
teknium1	55c8b2c81f	chore(release): add AUTHOR_MAP entry for udatny (#29433 salvage)	2026-07-01 04:55:15 -07:00
ud	c126a99fc1	fix(subdirectory_hints): catch RuntimeError from Path.expanduser() `pathlib.Path('~user').expanduser()` raises RuntimeError when the tilde-expansion can't resolve the user (e.g. `~500-700` where the LLM meant "approximately 500-700" rather than a path). The hint walker's existing `except (OSError, ValueError):` clauses do not catch RuntimeError, so it escapes through the tool dispatcher and surfaces in the conversation loop as a misleading Error during OpenAI-compatible API call #N: Could not determine home directory. Reproduced across three unrelated models (openai/gpt-5-mini, openai/gpt-5.1-codex, deepseek/deepseek-v4-flash) on terminal-tool commands containing literal tildes in non-path contexts — common in LLM output ("~500 agencies", "~45,000 CVEs", "~80/hr blended rate"). Reproduction (one-liner): >>> from pathlib import Path >>> Path("~500-700").expanduser() RuntimeError: Could not determine home directory. Fix: extend the three `except` clauses in agent/subdirectory_hints.py to also catch RuntimeError: line 138 (_add_path_candidate's outer catch around the Path().expanduser() call) lines 198+202 (_load_hints_for_directory's nested catches around hint_path.relative_to(Path.home())) Tests: tests/agent/test_subdirectory_hints_tilde.py adds three cases covering: tilde-as-approximately in heredoc commands, ~unknown_user paths, and a regression guard that legitimate ~/path expansion still works.	2026-07-01 04:55:15 -07:00
JabberELF	18a9467fca	fix(tui): prevent killpg suicide during MCP shutdown Root cause: gateway spawns LSP servers (jdtls/pyright/yaml-ls) and slash_worker without start_new_session=True, so they inherit the gateway process group (= TUI parent PID). When mcp_tool _snapshot_child_pids() races with these spawns during stdio MCP server startup, non-MCP children leak into _stdio_pgids with the TUI parent PGID. shutdown_mcp_servers() then killpg(tui_parent_pid, SIGTERM), killing the TUI itself. Evidence: tui_gateway_crash.log shows recurring SIGTERM stacks: shutdown_mcp_servers -> _kill_orphaned_mcp_children -> _send_signal -> killpg(pgid, sig) -> SIGTERM received Fix (3 layers): 1. agent/lsp/client.py: add start_new_session=True to LSP server spawn so each LSP server gets its own process group/session. 2. tui_gateway/server.py: same fix for slash_worker spawn, the symmetric root-cause patch so no gateway direct child shares the TUI parent pgid. 3. tools/mcp_tool.py: add _filter_mcp_children() defense-in-depth that drops non-MCP children (slash_worker, jdtls/eclipse LSP) from the PID delta before they can poison _stdio_pgids.	2026-07-01 04:54:46 -07:00
teknium1	04eed932eb	test(gateway): cover auto-resume auth skip + fail-closed Two tests for the auto-resume authorization gate: an unauthorized session owner is skipped without claiming a _running_agents slot or persisting one, and a raising auth check fails closed (session skipped, not resumed).	2026-07-01 04:53:58 -07:00
ygd58	0de67ad604	fix(gateway): validate user authorization before auto-resume Auto-resume of restart-interrupted sessions bypassed auth checks. The session owner was never validated against TELEGRAM_ALLOWED_USERS (or equivalent) before the synthetic resume event was dispatched. An attacker with an active session before the allowlist was configured could receive a full agent response on gateway restart (issue #23778). Clean rebase of #23800 onto current main (egilewski flagged a merge conflict in gateway/run.py on the old branch). Fix: check _is_user_authorized() for the session owner before scheduling auto-resume. Unauthorized sessions are skipped with a warning log instead of silently resuming. Fixes #23778 (partial - auto-resume auth bypass)	2026-07-01 04:53:58 -07:00
srojk34	74e59b8b68	fix(security): close abbreviated-flag bypasses in git/sudo approval patterns git's and sudo's option parsers resolve unambiguous long-flag prefixes, so `git reset --har`, `git branch --delete --force`, and `sudo --stdi`/`--ask` execute identically to their full-flag forms while evading the exact-string DANGEROUS_PATTERNS regexes that gate them. Verified live against real git and sudo binaries. Widen the patterns to accept unambiguous abbreviations, scoped narrowly enough to avoid colliding with sibling flags (--help, --soft/--mixed/--merge/--keep, --shell/--set-home).	2026-07-01 17:17:01 +05:30
kshitijk4poor	723ccda275	fix(acp): also preserve archived rows on model-switch / restore saves Follow-up widening the archived-history fix to the sibling save paths the original PR did not cover. Model switches (_cmd_model, set_session_model) and _restore mint a fresh AIAgent with _session_db_created=False, so the agent-owns-persistence guard evaluates False and the blind full-history replace_messages() fired — DELETEing the durable active=0/compacted=1 rows on any compressed ACP session (same data-loss class the PR fixes, different trigger). - hermes_state.replace_messages: add active_only=True to delete/reinsert only the live (active=1) rows, leaving soft-archived rows untouched (idea adopted from the competing PR #50306 by @mrparker0980, credited). - hermes_state.has_archived_messages: cheap existence probe for active=0 rows. - acp_adapter._persist: when the agent doesn't own persistence but the session already has archived rows on disk, replace active-only; otherwise the destructive full replace stays (fresh create/fork has nothing to lose). - Regression test: model-switch save on a compacted session keeps the archived turn discoverable via get_messages(include_inactive=True) + search_messages.	2026-07-01 17:16:51 +05:30
sasquatch9818	897240462a	fix(acp): stop _persist from deleting compression-archived history ACP's SessionManager._persist() called db.replace_messages() on every save. That delete-then-reinsert is destructive by design. The agent backing each ACP session already persists to the same SessionDB itself: it flushes turns incrementally via append_message and, on context compression, preserves pre-compaction turns non-destructively through archive_and_compact() as searchable active=0/compacted=1 rows. So the per-save replace_messages() was a redundant double-write that deleted exactly those archived rows (and their FTS entries). Worse, after a compression-driven id rotation the agent's live head no longer equals the ACP session id, so the replace overwrote the ended parent transcript while new turns flowed to the new id — split-brain corruption of one conversation. Any ACP conversation (VS Code / Zed / JetBrains) long enough to compress lost history. Now _persist skips the destructive replace when the agent owns persistence to this DB (its _session_db is this db and its row exists), relying on the agent's own incremental + archival flush. It still falls back to the atomic replace when the agent is not self-persisting — test agent factories, and fresh create/fork sessions whose copied history the agent has not flushed yet — so the #13675 rollback guarantee holds. ## What does this PR do? Fixes silent history loss in ACP editor sessions. ACP _persist no longer destroys the compression-archived transcript the agent already wrote. Long enough conversations compress; that compression archives old turns non-destructively; ACP then hard-deleted them on the next save. After an id rotation it also clobbered the ended parent and split the conversation across two ids. This change defers to the agent's own persistence when it owns the DB and only uses the destructive replace when nothing else is writing the transcript. ## Related Issue N/A ## Type of Change - [x] 🐛 Bug fix (non-breaking change that fixes an issue) - [ ] ✨ New feature (non-breaking change that adds functionality) - [ ] 🔒 Security fix - [ ] 📝 Documentation update - [ ] ✅ Tests (adding or improving test coverage) - [ ] ♻️ Refactor (no behavior change) - [ ] 🎯 New skill (bundled or hub) ## Changes Made - `acp_adapter/session.py`: in `SessionManager._persist`, guard the `db.replace_messages()` call. Skip it when the agent owns persistence to this DB (`agent._session_db is db` and `agent._session_db_created`); otherwise keep the destructive atomic replace as the fallback. - `tests/acp/test_session.py`: add a regression test proving archived (active=0/compacted=1) rows survive a save when the agent self-persists and stay FTS-searchable; add a test confirming the replace path still runs for agents that do not own DB persistence. ## How to Test 1. Run `pytest tests/acp/test_session.py -q` — 43 pass. 2. `test_save_session_preserves_agent_archived_history`: archive a turn via `archive_and_compact`, save, and confirm it survives and is found by `search_messages` (fails before this fix — replace_messages deleted it). 3. `test_save_session_still_replaces_when_agent_not_self_persisting`: confirm history still overwrites cleanly for non-self-persisting agents. ## Checklist ### Code - [x] I've read the Contributing Guide - [x] My commit messages follow Conventional Commits (`fix(scope):`, `feat(scope):`, etc.) - [x] I searched for existing PRs to make sure this isn't a duplicate - [x] My PR contains only changes related to this fix/feature (no unrelated commits) - [x] I've run `pytest tests/ -q` and all tests pass - [x] I've added tests for my changes (required for bug fixes, strongly encouraged for features) - [x] I've tested on my platform: macOS 15 (Darwin 25.5) ### Documentation & Housekeeping - [x] I've updated relevant documentation (README, `docs/`, docstrings) — or N/A - [x] I've updated `cli-config.yaml.example` if I added/changed config keys — or N/A - [x] I've updated `CONTRIBUTING.md` or `AGENTS.md` if I changed architecture or workflows — or N/A - [x] I've considered cross-platform impact (Windows, macOS) — or N/A - [x] I've updated tool descriptions/schemas if I changed tool behavior — or N/A	2026-07-01 17:16:51 +05:30
kshitijk4poor	b4342a83bb	fix(approval): close bare powershell Remove-Item bypass + add ri alias (review) Rework follow-up on the Windows destructive-shell detection. The PowerShell pattern required an explicit -Command/-c before the verb, but PowerShell runs the verb as the DEFAULT POSITIONAL arg — so `powershell Remove-Item -Recurse -Force C:\x` (no -Command) slipped through, the exact case the PR body claims to close. Also missing the canonical `ri` alias. Anchor the verb to the command position (after the shell name + any leading -Flag switches + optional -Command/-c) so bare invocations are caught while a benign path arg containing 'del'/'rm' (e.g. -File c:\del-logs\run.ps1) is not. Add ri to the verb list. Mutation-verified regression tests for the bare invocation, ri alias, and the benign-path negative.	2026-07-01 17:16:08 +05:30
dsad	4b92a8cd31	fix(approval): detect Windows destructive shell commands	2026-07-01 17:16:08 +05:30
Zeheng Huang	4c2c54c78c	fix(matrix): await inbound sync handlers Register the Matrix room-message, reaction, and invite handlers with mautrix's wait_sync=True. mautrix's handle_sync() only returns the tasks for handlers registered as sync-awaited; non-waited handlers are fire-and-forget via background_task.create() and are NOT returned. Since _dispatch_sync() awaits only the returned tasks (await asyncio.gather), the inbound handlers previously had no completion point, so Tuwunel/ mautrix homeservers connected and completed initial sync but dispatched zero inbound messages. Fixes #46142. Co-authored-by: Zeheng Huang <153708448+hunjaiboy@users.noreply.github.com>	2026-07-01 04:42:33 -07:00
kshitijk4poor	dc1ea005d9	fix+test(codex): self-persist projected turns; keep agent_persisted=True Follow-up correcting the salvaged fix's persistence approach to avoid a duplicate user-message write (verified via E2E — the #860/#42039 bug class the original diff aimed to avoid). Root cause: in gateway mode the AIAgent is built WITH a session_db, so the inbound user turn is already flushed at turn start (turn_context. _persist_session). The original fix returned agent_persisted=False, making the gateway re-write the whole new-message slice via append_to_transcript -> append_message (a raw INSERT with no dedup), duplicating the already-flushed user turn. Corrected approach (single writer): run_codex_app_server_turn now flushes its OWN projected assistant/tool messages via _flush_messages_to_session_db (which dedups the already-persisted user turn through _DB_PERSISTED_MARKER) and returns agent_persisted=True so the gateway skips its write. Net result: session_search/distill see the full codex conversation, each message persisted exactly once. Adds regression coverage asserting exactly-once persistence on a real SessionDB, agent_persisted=True, FTS visibility, and standard-runtime skip-db behaviour preserved. Co-authored-by: Lubos Buracinsky <lubos@komfi.health>	2026-07-01 17:08:59 +05:30
Lubos Buracinsky	5558382457	fix(codex): persist app-server turns to session DB (fixes starved recall) The codex_app_server runtime path (run_codex_app_server_turn in agent/codex_runtime.py) is an early-return that bypasses conversation_loop and never calls _flush_messages_to_session_db(). Meanwhile, gateway/run.py sets: agent_persisted = self._session_db is not None # always True and passes skip_db=agent_persisted to every append_to_transcript call, assuming the agent self-persisted (correct for the standard runtime, wrong for codex). The result: codex turn messages are persisted nowhere. state.db accumulates only session_meta rows; session_search (full-text search over state.db) and conversation-distill are blind to real gateway conversations, causing 'the agent has no memory of what we discussed'. Fix (three-part, all backward-compatible): 1. agent/codex_runtime.py — run_codex_app_server_turn success return now includes 'agent_persisted': False, signalling that the codex path did NOT self-persist its turn. 2. gateway/run.py — the agent_persisted assignment now reads: agent_result.get('agent_persisted', self._session_db is not None) For the standard runtime (which does not set the key) the default (self._session_db is not None) preserves the existing skip-db behaviour so no duplicate-write regression (#860 / #42039) occurs. For the codex runtime the flag is False, so the gateway writes the new turn's messages to state.db and FTS index. 3. gateway/run.py — the rebuilt result dict (run_agent return, which becomes agent_result upstream) now includes agent_persisted passed through from result_holder[0], with a safe True default. Without this passthrough the flag set in step 1 was discarded when the result was reconstructed, causing agent_result.get('agent_persisted', ...) to always see the default True and never write codex turns.	2026-07-01 17:08:59 +05:30
srojk34	a76aa6198c	fix(cli): flush un-persisted messages before /resume and /branch end the old session compress_context() and /new already flush un-persisted messages before calling end_session() (fixed in #47202), but /resume and /branch still call end_session() directly. When a turn is interrupted mid-flight and the user immediately runs /resume or /branch, messages generated during that turn have not yet been written to state.db and are silently lost on session rotation. Add the same best-effort _flush_messages_to_session_db() call before end_session() in both _handle_resume_command and _handle_branch_command, mirroring the pattern established in cli.py:new_session(). Regression tests verify the flush is called when an agent is present.	2026-07-01 17:08:55 +05:30
Dutch Dim	154c382d65	fix(gateway): recover from truncated responses	2026-07-01 17:08:50 +05:30
kshitijk4poor	9cf47fef54	fix(auxiliary_client): demote the 2 sibling routing fall-throughs too (review) Phase 2c review flagged that only 2 of the 4 structurally-identical resolve_provider_client routing dead-ends were demoted. Complete the bug-class: also demote+dedup the external-process ('not directly supported') and OAuth ('not directly supported, try auto') fall-throughs, keyed by provider name, so none of the four dead-ends spam WARNING on a retry loop. Add direct tests for the unhandled-auth_type and OAuth dedup paths via a monkeypatched PROVIDER_REGISTRY (the review noted these were unverified). Mutation-checked: reverting either sibling demotion fails its test.	2026-07-01 17:00:30 +05:30
kshitijk4poor	c0d3ceb17e	fix(auxiliary_client): dedup resolve_provider_client fall-through warnings The two fall-through branches in resolve_provider_client (unknown provider, unhandled auth_type) logged at WARNING on every retry of a misconfigured provider, spamming logs during retry loops. Demote both to logger.debug with per-process dedup: the first occurrence still surfaces (a provider-name typo or PROVIDER_REGISTRY/auth_type-drift bug is worth seeing once), while identical repeats are suppressed for the process lifetime. Salvaged from #56283 (extracting only the stated auxiliary_client fix; the original PR also bundled ~2800 lines of unrelated changes across 10 other files, which are dropped).	2026-07-01 17:00:30 +05:30
kshitijk4poor	fb7a38ad21	fix(macos): compose launchd reload retry with _launchctl_bootstrap + drain-aware window Reworks @valenteff's #53277 fix per review (Teknium's 3 findings): - Route refresh_launchd_plist_if_needed's bootstrap through the existing _launchctl_bootstrap() EIO-recovery helper (canonical since #56256), wrapped in a wall-clock retry loop, instead of an ad-hoc 5x2s loop. - Window sized to agent.restart_drain_timeout (default 180s), not a fixed ~10s: the failure happens while the old gateway is still draining (finding 1). - Retry on subprocess.TimeoutExpired too, not just CalledProcessError — a bootstrap timeout after bootout otherwise escapes and leaves the service unloaded (finding 2). - Confirm success with launchctl list, not a bare bootstrap exit 0 (finding 3); mirror verify+drain-window in the detached-helper bash path. - Shared helpers _launchd_reload_log_path / _append_launchd_reload_log / _launchctl_label_registered / _retry_launchctl_bootstrap_until_registered. 3 new tests cover retry-until-listed, TimeoutExpired-retried, deadline-exhaust. E2E: real reload log + mocked launchctl — retries CalledProcessError+TimeoutExpired, verifies via launchctl list, logs failures.	2026-07-01 16:56:14 +05:30
Fabio Fernandes Valente	7a7d19e73b	fix(macos): retry launchd reload on transient bootstrap failure refresh_launchd_plist_if_needed ran `launchctl bootout` then `launchctl bootstrap` with errors silenced (`2>/dev/null` in the detached helper, `check=False` in the direct subprocess path). Under high load or a launchd race, the bootout succeeds — removing the service from launchd — but the follow-up bootstrap fails silently. The service stays unregistered; KeepAlive can't revive a service launchd no longer knows about, so the gateway stays dark until a manual `launchctl bootstrap`. Observed incident (2026-06-26): `/restart` in chat triggered a planned drain; during the drain a separate call re-triggered the plist refresh, which bootout'd the live service. Under loadavg 9.48 the bootstrap failed silently — 2h35min offline until manual recovery. Fix: retry the bootstrap up to 5 times with 2s back-off, verify with `launchctl list <label>` afterwards, and log failures to ~/.hermes/logs/launchd-reload.log so the health watchdog can detect a persistent orphan. Mirrors the contract across both the detached helper (refresh inside gateway tree) and the direct subprocess path (refresh from external CLI). Existing tests pass: - test_refresh_defers_reload_when_running_inside_gateway_tree - test_refresh_uses_direct_reload_when_not_inside_gateway_tree Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-07-01 16:56:14 +05:30
kshitij	d4e8c358c0	Merge pull request #56330 from kshitijk4poor/chore/authormap-valenteff chore: add AUTHOR_MAP entry for valenteff (#53277 salvage)	2026-07-01 16:49:16 +05:30
shawchanshek	3b739b990b	fix(title_generator): strip think blocks from LLM output before extracting title Think-enabled models (MiniMax M2.7, DeepSeek, etc.) emit inline <think>...</think> reasoning even for simple prompts like title generation, and the raw XML was leaking into session titles. Route the title-model response through the canonical strip_think_blocks scrubber before cleanup so every tag variant — closed pairs, unterminated blocks, orphan closes, mixed case — is handled, not just a single literal <think> pair. - 2 regression tests: closed <think> pair stripped, unterminated block at start yields no title. Salvaged from PR #44126 by @shawchanshek.	2026-07-01 04:18:48 -07:00
kshitij	037e389c4f	Merge pull request #56325 from kshitijk4poor/chore/authormap-session-persist chore: AUTHOR_MAP entries for session-persistence salvage batch	2026-07-01 16:47:13 +05:30
kshitijk4poor	314cf43d50	test(matrix): assert real device_id in query_keys, not just guard-skip Hardens the salvaged #53997 tests per review: the positive-resolution and reconnect-recovery tests now assert query_keys is awaited with the REAL resolved device id ({mxid: [<id>]}) and never [None] — the [null] body the homeserver rejects (the actual bug), plus await_count==2 to prove verification genuinely re-runs after resolution rather than just the flag looking right.	2026-07-01 16:46:40 +05:30
Gary Walker	09dbe76955	fix(matrix): reset _device_id_unverified at start of connect() Per review feedback on #53997 from @teknium1: the flag was set True on failed device_id resolution but never reset, so a same-adapter reconnect that successfully resolves a real device_id would keep skipping server-side key verification indefinitely. Reset now happens at the top of connect(), before resolution runs, so every connect() attempt starts clean. A repeat failure re-sets the flag (unchanged behavior); a recovery correctly clears it. Adds TestDeviceIdRecoveryOnReconnect to cover the transition.	2026-07-01 16:46:40 +05:30
Gary Walker	9048457eab	fix(matrix): device_id fallback prevents E2EE init failure on fresh bot accounts - Resolve device_id via query_keys({mxid: []}) when whoami() returns None - Guard _verify_device_keys_on_server and _reverify_keys_after_upload against None/unverified device_id to prevent 'device_keys values must be a list of strings' serialization failure - Disconnect existing client before reconnect to prevent dual OlmMachine instances on the same crypto store Re-targeted from #39779 (legacy gateway/platforms/matrix.py) onto the migrated plugins/platforms/matrix/adapter.py path following the 2026-06-20 adapter migration. Logic unchanged from original fix. 242 tests passing (233 upstream + 9 new).	2026-07-01 16:46:40 +05:30
SahilRakhaiya05	2d8d08cae6	fix(api-server): require auth for /health/detailed and fail closed on weak keys /health/detailed leaked runtime state (gateway state, connected platforms, active-agent counts, PID, exit reason) with no auth. Gate it behind the same Bearer auth as other API routes; plain /health stays open for liveness probes. Also refuse to start on a placeholder/too-short (<16 char) API_SERVER_KEY regardless of bind address — a guessable key on a terminal-capable endpoint is RCE-adjacent even on loopback, since any local process can reach it. The required-key check was already unconditional; this extends the strength floor to loopback binds too. Startup guards are hoisted above app/background-task creation so a rejected start leaves no partial state. Salvaged from #44073 (external-surface hardening), split into a focused PR per maintainer request. Co-authored-by: Hermes Agent <agent@nousresearch.com>	2026-07-01 04:14:33 -07:00
kshitijk4poor	9c870548e3	chore: add AUTHOR_MAP entry for valenteff (#53277 salvage)	2026-07-01 16:44:07 +05:30
shandian64	5126902f1d	fix(title): honor configured auxiliary timeout	2026-07-01 16:41:43 +05:30
kshitijk4poor	b3f55c2037	chore: add AUTHOR_MAP entries for session-persistence salvage batch Maps the two plain-email contributors whose PRs are being salvaged so contributor_audit.py passes: - info@djimit.nl -> djimit (PR #48034) - lubos@komfi.health -> lubosxyz (PR #49225) The other two PRs in the batch (#50405 sasquatch9818, #48764 srojk34) use users.noreply.github.com emails, which check-attribution auto-skips.	2026-07-01 16:38:56 +05:30
SahilRakhaiya05	5178b3f056	fix(code-exec): bind execute_code tool socket to a per-session RPC token The execute_code sandbox exposed its tool-call RPC (AF_UNIX socket and remote file-poll transports) without any caller check, so any local process that could reach the socket / rpc dir could dispatch terminal-capable tool calls through the parent. Mint a per-session HERMES_RPC_TOKEN, pass it to the sandboxed child, and require a timing-safe match on every request in both _rpc_server_loop and _rpc_poll_loop. Empty/missing/wrong token fails closed. Salvaged from #44073 (per-session RPC token). Added timing-safe secrets.compare_digest comparison and fail-closed regression tests. Co-authored-by: Hermes Agent <agent@nousresearch.com>	2026-07-01 04:08:37 -07:00
Teknium	5de65624d1	fix(moa): capture streamed aggregator output into full-turn traces (#56312 ) MoA full-turn traces (moa.save_traces) recorded the aggregator's acting output only on the non-streaming path, where it's captured inline at call time. On the streaming path — which every hermes chat --query run and every live gateway/CLI turn takes — the aggregator's raw token stream is handed to the live consumer, so the trace left output=null and only pointed at the session-db assistant row. An offline audit of a benchmark run (HermesBench drives --query) then couldn't see what the aggregator produced without hand-joining to state.db. Capture the resolved streamed acting text at trace-flush time (the agent already holds it in _current_streamed_assistant_text) and fold it into the trace, so the record is self-contained in both modes. New output_location value inline_from_stream marks a streamed turn whose text was captured this way; a genuinely empty acting turn (pure tool call) still points at the session db, matching state.db exactly. Touches only the trace side-channel — no change to the acting path, message history, role alternation, or prompt cache. - agent/moa_loop.py: consume_and_save_trace(..., aggregator_output_fallback) on both the facade and the MoAClient wrapper; prefer inline capture, fall back to the resolved streamed text. - agent/moa_trace.py: embed the fallback; add inline_from_stream location. - agent/conversation_loop.py: pass _current_streamed_assistant_text at flush. - tests: 5 cases across streaming / non-streaming / empty-fallback / no-double-write.	2026-07-01 04:07:46 -07:00
Teknium	81595cd588	fix(dashboard): run plugin gate after auth + enable example fixture Follow-up on the salvaged #47491 commits: - Register _plugin_api_runtime_gate BEFORE the auth middlewares so it executes AFTER them, and add an explicit auth check: unauthenticated requests to /api/plugins/<name>/ fall through to auth's 401 instead of this gate's 404. Prevents the gate from becoming a plugin-name oracle (an unauthenticated caller could otherwise fingerprint installed/enabled plugins by status code). Keeps test_non_kanban_plugin_route_requires_auth green. - Enable the 'example' user plugin in the _install_example_plugin test fixture so the auth / static-asset-allowlist tests still reach the real serving paths now that user plugins are gated on plugins.enabled. - Mark the runtime-gate unit-test scopes as authenticated so they exercise the enabled/disabled policy under the new auth-first ordering.	2026-07-01 04:05:15 -07:00
manusjs	b2e0086f1b	fix(dashboard): enforce plugin disabled gate at request time and for bundled assets Address two residual bypasses identified in review: 1. Add _plugin_api_runtime_gate middleware that checks plugins.enabled/ plugins.disabled on every request to /api/plugins/{name}/... routes. Previously, disabling a plugin at runtime had no effect on its already- mounted API routes until a restart. 2. Extend serve_plugin_asset to check plugins.disabled for bundled plugins. Previously, only user plugins were gated — a bundled plugin in plugins.disabled would still serve assets from the unauthenticated /dashboard-plugins/{name}/... endpoint. Both fixes ensure the enabled/disabled policy is evaluated live at request time, not just at startup. Adds regression tests covering: - Middleware blocks disabled user plugin API routes (404) - Middleware blocks user plugin removed from enabled set (404) - Middleware passes enabled user plugin API routes - Middleware blocks disabled bundled plugin API routes (404) - Bundled plugin assets return 404 when disabled - Bundled plugin assets served normally when not disabled - User plugin asset gating still works correctly	2026-07-01 04:05:15 -07:00
manusjs	7cff95644d	fix(dashboard): gate plugin asset serving and API mount on plugins.enabled User-installed dashboard plugins had their assets served and Python backend code imported without checking the plugins.enabled allowlist. This meant a plugin installed in the plugins directory but not enabled could still execute code at dashboard startup and serve arbitrary files. Changes: - get_dashboard_plugins API: filter out user plugins not in enabled set - serve_plugin_asset: reject requests for disabled/non-enabled user plugins - _mount_plugin_api_routes: skip Python import for non-enabled user plugins - Bundled plugins still load by default but respect explicit disables Fixes #46435	2026-07-01 04:05:15 -07:00
kshitij	8415c4703a	Merge pull request #56317 from kshitijk4poor/chore/authormap-bitcryptic chore: add AUTHOR_MAP entry for bitcryptic-gw (#53997 salvage)	2026-07-01 16:33:47 +05:30
Tao Chen	d3c8667462	fix(slack): authorize bot/workflow senders before the no-user-id guard Slack Workflow Builder posts (and other app/bot messages) arrive as subtype=bot_message with user=None. _is_user_authorized rejected them at the `if not user_id: return False` guard, which runs before the #4466 {PLATFORM}_ALLOW_BOTS bypass — so @mentioning the bot from a Slack workflow silently did nothing, even with SLACK_ALLOW_BOTS (or SLACK_ALLOW_ALL_USERS) set. The chat-scoped allowlist for Telegram/QQ already runs before that guard for the same reason (channel broadcasts with no from_user); Slack was both missing from the bot-bypass map and had the bypass running too late. - gateway/authz_mixin: move the {PLATFORM}_ALLOW_BOTS bypass ahead of the no-user-id guard and add Platform.SLACK -> SLACK_ALLOW_BOTS. - plugins/platforms/slack/adapter: set is_bot=True on inbound bot_message events so the gateway can identify workflow/app senders (they carry no user_id to match against the allowlist). Tested: new tests/gateway/test_slack_bot_auth_bypass.py plus the existing Discord/Feishu bot-auth and gateway authz/gating suites all pass.	2026-07-01 16:32:32 +05:30
kshitijk4poor	fcbf850f33	chore: add AUTHOR_MAP entry for bitcryptic-gw (#53997 salvage)	2026-07-01 16:28:15 +05:30
teknium1	27347b2239	fix(gateway): align resume safety-net note with canonical recovery wording Follow-up on the salvaged resume_pending fix: the empty-turn safety net now emits the same reason-aware recovery note as the _is_resume_pending branch (reason phrase + 'session restored' guidance + no-re-execute instruction) instead of a second, differently-worded note. Also adds the AUTHOR_MAP entry for the salvaged commit.	2026-07-01 03:57:44 -07:00
Adam Chiaravalle	c2db3ed7d8	fix(gateway): recover resume_pending sessions instead of sending a blank turn A session interrupted by a gateway restart is flagged resume_pending and auto-continued on startup via _schedule_resume_pending_sessions(), which dispatches an empty-text internal MessageEvent. The recovery system note that should fill that empty turn is gated, in _run_agent(), on _interruption_is_fresh — the age of the LAST PERSISTED TRANSCRIPT ROW. For an active thread returned to after >1h of silence, that transcript clock is stale even though the interruption (last_resume_marked_at) is seconds old. The gate evaluates False, the note is not prepended, and the model receives a genuinely blank user turn — replying with confused 'that message came through blank' noise. Fix (two parts, both default-on, behavior unchanged for healthy turns): 1. resume_pending freshness now also considers last_resume_marked_at (the restart watchdog's own stamp). The branch fires when EITHER the transcript clock OR the resume mark is fresh, so the startup scheduler's freshness decision and the per-turn injection agree. 2. Empty-turn safety net: if the user turn is still blank after all injections AND the session is resume_pending, backfill a recovery note so a blank turn can never reach the model. Scoped to resume_pending so ordinary empty turns (e.g. uncaptioned image) are untouched. Adds 3 regression tests; the two core ones fail on the pre-fix logic.	2026-07-01 03:57:44 -07:00
teknium1	d1d1d81900	fix(gateway): repair sibling tests + harden _adapter_for_source after fail-closed flip Follow-up to the salvaged fail-closed defaults. The own-policy default flip (open -> pairing) and the email dispatch-level deny broke sibling tests across the suite that relied on the old fail-open behavior: - test_email.py: dispatch-mechanics tests now opt into EMAIL_ALLOW_ALL_USERS (they test formatting/attachments/threading, not authz); the two auth contract tests are rewritten to assert the new fail-closed behavior (no allowlist + no allow-all => sender dropped at the adapter). - test_whatsapp_cloud.py / test_whatsapp_formatting.py / test_whatsapp_from_owner.py: autouse fixture opts into WHATSAPP_ALLOW_ALL_USERS so dm_policy: open dispatch-mechanics tests still flow (open now requires an explicit allow-all opt-in, SECURITY.md 2.6). - _adapter_for_source: use getattr for source.platform/profile so bare SimpleNamespace test fixtures without .profile don't crash the busy/queue ingress path (AGENTS.md pitfall #17). Full tests/gateway/ + yuanbao pipeline: 8555 passed, 0 failed.	2026-07-01 03:56:28 -07:00

1 2 3 4 5 ...

14093 commits