hermes-agent

Author	SHA1	Message	Date
Teknium	522a5e93b2	chore(release): map x9x9x9x9x9x91 for #49247 salvage	2026-07-01 02:18:56 -07:00
itenev	f981d47cb0	fix(gateway): prevent Discord disconnects from blocking event loop models_dev.py's fetch uses a synchronous requests.get(timeout=15). Called from the async gateway message handlers, it blocked the event loop for up to 15s, starving Discord heartbeats and causing ClientConnectionResetError disconnects. Adds get_model_context_length_async() which offloads the entire sync resolution chain to a worker thread via asyncio.to_thread(), and switches the two async gateway call sites (_prepare_inbound_message_text, _handle_message_with_agent) to await it. The loop stays responsive; the sync path remains the single source of truth for the cache. Salvaged from PR #22753 by @itenev. Follow-up: dropped the unused fetch_models_dev_async/lookup_models_dev_context_async aiohttp variants from the original PR (dead code with zero callers that had drifted from the sync cache logic) — the to_thread wrapper already runs the sync path off-loop, so they were redundant.	2026-07-01 02:17:35 -07:00
Teknium	ea533e7f41	chore(release): map justin-cyhuang contributor email for #31960 salvage	2026-07-01 02:12:25 -07:00
Teknium	259e6b87a7	fix(teams-pipeline): reject dot-only recording display_name Path(raw).name reduces '..'/'.'/'' to themselves, so basename extraction alone still let a Graph-provided display_name of '..' or '../' escape the temp recording directory (tmp_dir / '..' resolves to the parent). Reject the dot-only basenames explicitly and fall back to the artifact id. Extends @outsourc-e's regression coverage with the dot-only cases.	2026-07-01 02:03:48 -07:00
teknium1	6d30f8c0ab	chore: add AUTHOR_MAP entry for PR #52534 salvage (@qWaitCrypto)	2026-07-01 02:03:40 -07:00
teknium1	49cb06c07a	chore(release): map sasquatch9818 for PR #41198 salvage	2026-07-01 01:54:45 -07:00
kshitijk4poor	58ea7f9071	chore(release): map claudlos contributor email for #52351 salvage	2026-07-01 14:23:01 +05:30
Teknium	f70abae606	chore(release): map kernel-t1 for .env sanitizer salvage (#41349 )	2026-07-01 01:50:32 -07:00
teknium1	db2ac840c1	chore(release): map kyzcreig@gmail.com in AUTHOR_MAP	2026-07-01 01:44:40 -07:00
kshitijk4poor	843a3be7d6	chore(attribution): map baris@writeme.com -> isair for salvaged #50124	2026-07-01 14:09:15 +05:30
Teknium	a56bfeb2cb	chore(release): map approval-bypass PR contributors AUTHOR_MAP entries for the salvaged shell-bypass fixes: xy200303 (#40663), YLChen-007 (#26965), egilewski (co-author #40663). necoweb3 (#55653) already mapped.	2026-07-01 01:39:10 -07:00
teknium1	907cbba885	chore(release): add Vesna-9 to AUTHOR_MAP for #41274 salvage	2026-07-01 01:38:59 -07:00
teknium1	5e64dd9a98	chore: map charleneleong84 email to AUTHOR_MAP for #11736 salvage	2026-07-01 01:36:34 -07:00
Teknium	12556a9a77	chore(scripts): drop Open WebUI local bootstrap script (#56178 ) Remove scripts/setup_open_webui.sh and its 'one-command local bootstrap' doc sections (EN + zh-Hans). The script pip-installed the third-party Open WebUI frontend into ~/.local and managed a launchd/systemd user service — a maintenance liability for downstream software we don't own, and the source of the LAN first-admin signup footgun in #36121. The Open WebUI integration via the OpenAI-compatible API server is unaffected: the Docker/Docker-Compose setup, multi-user profile guide, and troubleshooting in open-webui.md stay, and Open WebUI remains a listed supported frontend. Only the install-and-service bootstrapper is gone.	2026-07-01 01:30:40 -07:00
teknium1	80d0ff8da5	chore: add AUTHOR_MAP entry for PR #40978 salvage (@friendshipisover)	2026-07-01 01:27:26 -07:00
ryo-solo	d578b6165d	fix(api_server): pop fallback model kwarg to prevent AIAgent collision When the primary provider's auth fails (expired token / 429 quota cap), _resolve_runtime_agent_kwargs() falls through to the fallback provider chain, whose runtime dict carries its own 'model' key. api_server's _create_agent then did AIAgent(model=model, **runtime_kwargs), colliding on 'model' and 500ing every /v1/chat/completions request while a fallback was active. Pop the runtime model and let it override the config model, mirroring the native gateway path (_resolve_session_agent_runtime). Salvaged from #35716 by @ryo-solo (earliest submitter); the PR's second half (Mistral reasoning_content strip) is already handled on main and dropped. Co-authored-by: Hermes Agent <noreply@nousresearch.com>	2026-07-01 01:26:27 -07:00
teknium1	ce9d180a94	chore: add redactdeveloper to AUTHOR_MAP for PR #36897 salvage	2026-07-01 01:25:43 -07:00
teknium1	081c91c147	chore: add AUTHOR_MAP entry for PR #40773 salvage (rrevenanttt)	2026-07-01 01:25:24 -07:00
petrichor-op	f2a528fb59	fix(agent): never persist empty-response recovery scaffolding Ephemeral empty-response/prefill recovery scaffolding (the synthetic assistant "(empty)" turn, the user nudge, the terminal "(empty)" sentinel, and the thinking-only prefill placeholder) exists only to drive the next API retry; the in-memory loop pops it before appending the real response. The append-only flush did not mirror that, so a mid-turn persist could commit scaffolding to the SQLite session store (and JSON log), and a resumed session would replay synthetic "(empty)"/nudge turns as genuine context — re-poisoning the empty-retry boundary forever. Filter ephemeral scaffolding at both durable-write sites (_flush_messages_to_session_db + _save_session_log), by flag not position, so buried scaffolding (an answered nudge leaves the synthetic pair mid-list) is skipped too. Covers all three flags including _thinking_prefill. Adapted onto current main's identity-tracking flush. Cherry-picked from #41281 by petrichor-op.	2026-07-01 01:08:27 -07:00
teknium1	cf427ccf08	chore: add AUTHOR_MAP entry for PR #35130 salvage (@jnibarger01)	2026-07-01 01:05:28 -07:00
teknium1	deb4629764	chore: add AUTHOR_MAP entry for PR #30491 salvage (MattKotsenas)	2026-07-01 01:02:23 -07:00
teknium1	7136b5382a	chore: add JustinOhms to release AUTHOR_MAP for PR #24469 salvage	2026-07-01 00:45:31 -07:00
teknium1	3aebdb1d23	chore: add AUTHOR_MAP entry for PR #22523 salvage (@H2KFORGIVEN)	2026-07-01 00:27:09 -07:00
Teknium	8d78be5460	revert: back out prompt_caching.enabled toggle (#56105 ) for re-evaluation (#56126 ) * Revert "fix(caching): honor prompt_caching.enabled across model switch + fallback" This reverts commit `36f9f50145`. * Revert "fix: allow disabling prompt caching" This reverts commit `c1c1a12fe6`.	2026-07-01 00:20:32 -07:00
teknium1	36f9f50145	fix(caching): honor prompt_caching.enabled across model switch + fallback @janrenz's PR #35862 added prompt_caching.enabled=false at init only. But _anthropic_prompt_cache_policy re-derives _use_prompt_caching on every /model switch (agent_runtime_helpers) and fallback-model swap (chat_completion_helpers), which re-enabled markers and re-broke the strict proxy the toggle was meant to fix. Move the kill switch into anthropic_prompt_cache_policy so it returns (False, False) on every path. Drop the now-redundant init-time override (kept @janrenz's isinstance hardening on the cache_ttl read). Add policy-level tests + docs for the toggle. Follow-up to salvaged PR #35862.	2026-07-01 00:10:42 -07:00
syahidfrd	0198713c33	fix(security): reuse auth chain when tagging unverified senders in Slack threads Mitigates indirect prompt injection (CWE-863) in Slack thread context. When the bot is mentioned mid-thread for the first time, _fetch_thread_context pulls the full thread via conversations.replies and prepends every reply to the LLM prompt. Replies from senders not on the allowlist were rendered identically to authorised senders, letting a third party in a shared channel inject instructions the model might act on when answering the next authorised message. - BasePlatformAdapter.set_authorization_check / _is_sender_authorized, registered by GatewayRunner._make_adapter_auth_check() with a closure over the existing _is_user_authorized chain (platform/global/group allowlists, allow-all flags, pairing store all stay the single source of truth — no env-var re-parsing). - Tags non-bot thread messages whose sender fails the auth check with an [unverified] prefix; strengthens the header with soft guidance only when at least one unverified message is present, so setups without an allowlist see no behaviour change. - Wired into all three adapter-init sites in run.py (start, reconnect watcher, restart) so the reconnect path is covered too. Softened wording: adapted from the original [untrusted] tag to [unverified] and non-accusatory header framing — the label reflects allowlist status, not a judgment about the person. Adapter relocated to plugins/platforms/slack/ since the PR was authored. Salvaged from #17059.	2026-06-30 18:05:43 -07:00
teknium1	7cb85733b8	chore(release): add AUTHOR_MAP entries for #54609 , #54912 salvage	2026-06-30 17:45:45 -07:00
teknium1	698c287fd0	chore(release): add AUTHOR_MAP entry for CRWuTJ (PR #17082 salvage)	2026-06-30 17:39:30 -07:00
teknium1	7de485703b	fix(gateway): preserve media + reply payload when /queue defers a turn /queue rebuilt the queued MessageEvent with only text/type/source/ message_id/channel_prompt, silently dropping any photo, document, voice, or reply context attached to the command. The deferred turn then ran with the attachment lost. Carry the full payload through, and accept a /queue that has media but no prompt text (e.g. "/queue" as an image caption). Salvaged from #13913 by @ypwcharles — the gateway busy-session/queue infrastructure was rewritten since that PR (Telegram moved to plugins/platforms/, /queue now uses the FIFO chain), so the media fix is reimplemented against the current handler; the PR's batching and busy-bypass changes targeted code paths that no longer exist. Co-authored-by: ypwcharles <92324143+ypwcharles@users.noreply.github.com>	2026-06-30 17:32:35 -07:00
Scott Gabel	4a7a6fd401	fix(approval): redact secrets in user-facing approval prompts The dangerous-command approval prompt renders the flagged command so the user can decide whether to approve. If the agent constructed it with a credential (curl -H 'Authorization: Bearer sk-...', psql postgres://user:pw@host, an execute_code script with api_key = 'sk-...'), that secret hit stdout and, via the gateway notify payload, Discord/Slack messages — which are screenshottable and forwardable. Apply the existing agent.redact.redact_sensitive_text() to every user-facing approval surface. Redaction is display-only: the raw command still executes after approval, and approval persistence keys off pattern_key (not the command text), so the allowlist is unaffected. Decision context (URL, flags, command structure) is preserved; only the secret value masks. Covers all surfaces, including the execute_code path the original PR missed: - prompt_dangerous_approval(): callback + stdout fallback - check_all_command_guards(): gateway approval_data + cron/batch pending fallback - check_execute_code_guard(): gateway approval_data + no-notifier pending fallback (script body can embed credentials) Adds TestApprovalPromptRedaction covering callback redaction, no-over-redaction of clean commands, and the execute_code pending fallback. Salvaged from PR #13139 by @sgabel; extended to the execute_code surface.	2026-06-30 17:29:11 -07:00
teknium1	caa2034f88	chore(release): map codexGW noreply email for PR #12302 salvage	2026-06-30 16:38:31 -07:00
teknium1	638d2e7bfc	fix(memory/holographic): apply FTS5 sanitizer to search_facts sibling The store-level search_facts() shared the same raw-MATCH bug class as _fts_candidates (FTS5 AND-joins tokens, zeroing prose recall). Route it through FactRetriever._sanitize_fts_query via a lazy import to keep the store->retrieval layering acyclic. Also add cyb3rwr3n to release AUTHOR_MAP.	2026-06-30 15:55:11 -07:00
teknium1	86200e7583	chore(release): map kyssta-exe id-prefixed noreply email for PR #55657 salvage	2026-06-30 15:49:36 -07:00
teknium1	0b3752eede	chore(release): add AUTHOR_MAP entry for lEWFkRAD (#53848 salvage)	2026-06-30 12:07:01 -07:00
xxxigm	a40f22798e	fix(installer): reset managed clone when ff-only pull fails Bootstrap and desktop updates run install.ps1/install.sh, which aborted with exit 128 when the managed checkout had diverged from origin/main. Mirror the hermes update recovery path: reset to origin/$BRANCH instead of failing the repository stage.	2026-06-30 20:11:01 +07:00
Neo	c969090878	fix(cli): clear input-blocking overlays when interrupting a running agent Interrupting the agent while an approval/clarify/sudo/secret prompt is up left the overlay state dict set with no thread servicing it. The prompt's worker thread is torn down on interrupt, but read_only (gated on _command_running) plus the keypress filter kept the CLI input locked until the prompt's own timeout expired — the terminal appeared frozen. Drain and clear all four input-blocking overlays on interrupt via a single helper (_clear_active_overlays_for_interrupt): approval -> deny, clarify/sudo/secret -> cancel, each guarded so a dead queue can't block the others; sudo restores the pre-modal draft. Wired into all three interrupt paths — new-message interrupt, Ctrl+C, and Ctrl+Q. Blocking overlays now clear AND fall through so one keypress both clears a stale overlay and interrupts a still-running agent; the /model picker and slash-confirm foreground prompts keep their cancel-and-return behavior. Closes #13618.	2026-06-30 04:49:29 -07:00
teknium1	8e6fd4cfa6	chore(release): add AUTHOR_MAP entry for londo161 (#15795 salvage)	2026-06-30 04:38:43 -07:00
teknium1	6148a9a3fe	chore(release): map nnnet author email for PR #25142 salvage	2026-06-30 04:23:03 -07:00
fayenix	d6c53dcdcb	fix(gateway): stop per-turn agent-cache eviction from model + message_id signature churn Two independent bugs evicted the cached gateway AIAgent on every turn, preventing the prompt cache from ever warming: 1. Model normalization mismatch: the post-run fallback-eviction check compared _agent.model (stripped in AIAgent.__init__) against the raw _resolve_gateway_model() config string. For vendor-prefixed config on native providers (e.g. 'deepseek/deepseek-v4-pro' vs 'deepseek-v4-pro') this was always unequal, so the agent was evicted after every successful run. Normalize _cfg_model the same way (skip aggregators). 2. Discord triggering message_id leaked into the cached system prompt via build_session_context_prompt()'s Discord IDs block. message_id changes every turn, so the agent-cache signature (computed from the ephemeral prompt) changed every Discord turn -> rebuild every message. The id is now injected per-turn into the user message (where per-turn content belongs and does not touch the cache signature); the cached IDs block carries a static pointer to it, preserving reply/react/pin via the discord tools. Adapted from #28846. Bug #1 fix is the contributor's; bug #2 reworked to be non-destructive (keeps the triggering-id capability instead of deleting it). Redundant auto-reset eviction (already on main via #9893/#48031) and the wrong-premise reset_context_note plumbing from the original PR were dropped. Co-authored-by: Hermes Agent <hermes@nousresearch.com>	2026-06-30 04:22:41 -07:00
Zane Ding	ac380050ea	fix(credential-pool): distinguish OpenRouter upstream 429s from account 429s OpenRouter returns 429 in two shapes: an account-level throttle on the user's key, and an upstream-provider throttle (DeepSeek/Anthropic/etc. rate-limiting OpenRouter's aggregate traffic). The classifier treated both identically and rotated/exhausted OPENROUTER_API_KEY on every 429 — burning the key for ~24min and silently disabling auxiliary features (compression, summarization, vision) on an upstream throttle where the key was healthy. Add a FailoverReason.upstream_rate_limit classified from OpenRouter's unambiguous wrapper message "Provider returned error" (the same signal the metadata-raw parser already trusts). Recovery skips credential rotation and defers to the fallback chain to switch models instead. Co-authored-by: Hermes Agent <127238744+teknium1@users.noreply.github.com>	2026-06-30 03:57:14 -07:00
Teknium	abca77615a	chore(release): map Jeffgithub0029 author email for #28558 salvage	2026-06-30 03:51:08 -07:00
teknium1	c510f48680	chore(release): add jasonQin6 to AUTHOR_MAP for PR #15093 salvage	2026-06-30 03:42:25 -07:00
teknium1	2ae9e222f0	chore: AUTHOR_MAP entry for PR #27123 salvage (jimmyjohansson84)	2026-06-30 03:42:20 -07:00
teknium1	ea95fdd6d7	chore(release): add nikshepsvn to AUTHOR_MAP for PR #27426 salvage	2026-06-30 03:41:46 -07:00
Kong	6d6702ef50	fix(whatsapp-bridge): clarify FIFO outbound-id tracker semantics Rename LRU/refresh wording to match Set insertion-order eviction and reject non-positive maxSize at construction time.	2026-06-30 03:41:43 -07:00
Keira Voss	db52ad0f07	fix(whatsapp): gate owner-typed forwards on customer chatId allowlist The opt-in WHATSAPP_FORWARD_OWNER_MESSAGES path in bot mode marks fromMe inbound messages as fromOwner: true and forwards them to the Python adapter so plugins can detect "owner just typed in this chat" and trigger handover / sliding TTL flows. The previous implementation bypassed the allowlist for that path: the existing allowlist gate at the bottom of the dispatch loop is guarded by !msg.key.fromMe, so any chat the operator happened to reply to was forwarded — even ones not on WHATSAPP_ALLOWED_USERS. Concretely, on a deployment with a single allowlisted customer, an owner reply in any other chat would still wake Hermes and let the gateway-policy plugin's owner-implicit branch create a stray handover row keyed by the non-allowlisted chatId. Fix: extract the bot-mode fromMe gate into a small pure helper (`owner_message_gate.js`) that returns one of {drop_echo, drop_disabled, drop_allowlist, forward_owner, pass} so the new allowlist branch can be unit-tested without spinning up Baileys. The check runs against the customer chatId (not senderId, which is the owner's own number/LID and won't be on the allowlist by construction). matchesAllowedUser already short-circuits true on an empty allowlist or "*", so deployments without an allowlist see no behavior change. Self-chat mode is untouched — its existing isSelfChat pin is the correct guard there. Tests: scripts/whatsapp-bridge/owner_message_gate.test.mjs covers echo drop, disabled drop, the new allowlist drop, the forward path, the open-allowlist short-circuit, and the precedence of echo/disabled checks over the allowlist check (so logs stay honest).	2026-06-30 03:41:43 -07:00
keiravoss94	84f350efe0	feat(whatsapp): opt-in forwarding of owner-typed messages in bot mode In `WHATSAPP_MODE=bot` the bridge currently drops every fromMe inbound message — they are all assumed to be echoes of our own /send calls. That makes it impossible for plugins / agents to detect when a human owner has typed directly into a customer chat from the same WhatsApp Business account (e.g. via a linked phone or WhatsApp Web). This adds an opt-in `WHATSAPP_FORWARD_OWNER_MESSAGES` env var. When true, the bridge classifies fromMe inbound by looking up `key.id` in a bounded LRU of recently-sent message IDs (the existing 50-entry echo suppressor, bumped to 512 and extracted to a testable `outbound_ids.js` helper). Hits in the LRU are still dropped (echoes); misses are forwarded to the Python adapter with `fromOwner: true`. The Python adapter lifts that flag onto `MessageEvent.metadata["whatsapp_from_owner"]`. `metadata` is a new free-form dict on the event so future per-platform signals don't each need their own field. Default behaviour is unchanged: with the env flag unset, bot mode still drops every fromMe message exactly as before. Use cases for downstream consumers: - Implicit handover activation when the owner replies manually - Sliding TTL on owner activity (keep an active session alive while the owner is engaged) - Audit trails of owner interventions - Analytics on human-vs-bot reply ratios Heuristic limitation (documented in code): the LRU is in-memory. After a bridge restart, in-flight delivery receipts of pre-restart sends will briefly look like owner-typed for a few seconds until the set is repopulated. Persisting isn't worth the disk churn — downstream consumers should treat the flag as best-effort. Tests: - tests/gateway/test_whatsapp_from_owner.py (new): adapter sets the metadata flag iff the bridge payload has `fromOwner: true`; absent otherwise. - scripts/whatsapp-bridge/outbound_ids.test.mjs (new): LRU bounds, eviction order, falsy-id handling. Backwards compatibility: with the env flag unset, every code path is identical to before. No existing deployment is affected.	2026-06-30 03:41:43 -07:00
teknium1	3ecc58a8da	chore: map trevorgordon981 in AUTHOR_MAP for #50590 co-authorship	2026-06-30 03:27:41 -07:00
teknium1	bf2dc18f84	test+chore: real-path regression test for #15157 model_extra guard + AUTHOR_MAP Adds tests/agent/test_model_extra_type_guard.py exercising the real ChatCompletionsTransport.normalize_response path with string/list/None/dict model_extra; adds the AUTHOR_MAP entry for the contributor.	2026-06-30 03:27:12 -07:00
Tao Yan	b8ebe32866	fix(agent): flatten multi-part user_message in codex intermediate-ack detector Vision requests routed through the OpenAI-compat API server forward the raw multi-part content list ([{type:"text"}, {type:"image_url"}, ...]) straight through as user_message. The codex intermediate-ack detector flattened it with (user_message or "").strip(), so a truthy list survived and .strip() raised AttributeError — killing any Codex-routed vision turn that took the require_workspace path. Route through the existing _summarize_user_message_for_log helper (which already backs the logging/banner previews on main), and widen the param type hint from str to Any to match how the function is actually called. The two logging-preview sites the original PR also touched were fixed independently on main by the conversation-loop refactor. Co-authored-by: Hermes Agent <agent@nousresearch.com>	2026-06-30 03:20:11 -07:00

1 2 3 4 5 ...

1382 commits