hermes-agent

Author	SHA1	Message	Date
kshitij	7b45a22ddf	Merge pull request #56382 from kshitijk4poor/chore/authormap-gromykoss-56372 chore: add Gromykoss to AUTHOR_MAP for PR #56372 salvage	2026-07-01 17:59:06 +05:30
Steve Lawton	c73e74386b	feat(vertex): add Google Vertex AI provider for Gemini (OAuth2) Adds Vertex AI as a first-class provider for Gemini models via Vertex's OpenAI-compatible endpoint. Vertex authenticates with short-lived OAuth2 access tokens (service-account JSON or ADC), not a static API key — the missing piece behind the recurring requests (#13484, #12639, #56259). - agent/vertex_adapter.py: OAuth2 token minting + refresh-on-expiry (5-min margin), ADC->service-account fallback, global vs regional endpoint URLs. Config precedence: env var > config.yaml > default. - plugins/model-providers/vertex/: provider profile (auth_type=vertex), reuses Gemini's extra_body.google.thinking_config translation. - runtime_provider: vertex short-circuit BEFORE the credential pool so a credentials-file path is never mistaken for a static API key; mints a fresh token + computes base_url per resolve. - run_agent + conversation_loop: _try_refresh_vertex_client_credentials() re-mints the token and rebuilds the client on a mid-session 401, so a long-lived gateway agent survives token expiry (~1h). - auxiliary_client: vertex auth_type branch for side-LLM tasks. - config.yaml: vertex.project_id / vertex.region (non-secret, bridged to env); credential path stays in .env (VERTEX_CREDENTIALS_PATH). - setup wizard + model picker: dedicated _model_flow_vertex; curated google/gemini-* model list; --provider choices. - pricing/metadata: Vertex prices off the gemini docs snapshot; endpoint host auto-maps to the vertex provider (no probe spam). - lazy_deps + pyproject [vertex] extra: google-auth, opt-in only. - docs: guides/google-vertex.md + providers page; tests for adapter + runtime resolution. Salvages and modernizes #8427 by @slawt onto current main: rewired from the legacy PROVIDER_REGISTRY path to the provider-profile architecture, moved non-secret config out of .env into config.yaml, and added the per-turn 401 token-refresh the original lacked.	2026-07-01 05:25:33 -07:00
kshitijk4poor	0ebbfbcc84	chore: add Gromykoss to AUTHOR_MAP for PR #56372 salvage Maps gromyko.ss83@gmail.com -> Gromykoss so the contributor-attribution audit passes for the #56372 salvage (context_compressor END MARKER reorder).	2026-07-01 17:52:57 +05:30
teknium1	2f167a2b84	fix: comment accuracy + AUTHOR_MAP for salvaged PR #50204 - Correct the exit-75 comment: Hermes-generated units set StartLimitIntervalSec=0 (rate limiting disabled), so StartLimitBurst does not bound loops. The real bound is that genuine crashes exit non-zero-but-not-75, and RestartForceExitStatus=75 only whitelists the planned code. - Add randomuser2026x AUTHOR_MAP entry (CI blocks unmapped emails).	2026-07-01 05:13:03 -07:00
Brett	9f03095044	fix(telegram): cap initialize() with per-attempt timeout so unreachable fallback IPs can't hang startup Wrap each Telegram initialize() attempt in asyncio.wait_for(HERMES_TELEGRAM_INIT_TIMEOUT, default 30s). When api.telegram.org and all fallback IPs are unreachable, the connect chain has no outer bound, so a single initialize() blocks for minutes and the retry-on-exception loop never fires — the gateway appears to hang after the banner. The timeout guarantees each attempt is bounded, then retries with backoff, then fails with an actionable error. Also adds WARNING-level progress logs before DoH discovery and each connect attempt (visible at default log level). Salvaged onto plugins/platforms/telegram/adapter.py (Telegram moved from gateway/platforms/ since the PR was opened). Adds env var to docs + AUTHOR_MAP. Co-authored-by: Hermes Agent <127238744+teknium1@users.noreply.github.com>	2026-07-01 05:07:10 -07:00
teknium1	050e602de3	chore(release): map HODLCLONE author for PR #49351 salvage	2026-07-01 05:06:00 -07:00
teknium1	cfbc7ed1f9	fix(browser): narrow credential-query denylist to unambiguous names Follow-up on the salvaged #49830 hardening. The contributor's sensitive query-param set included bare English words (code, key, auth, session, sig) that double as ordinary page facets — ?code= on promo/challenge pages, ?key= as a search facet, ?session= on blogs — so web_extract and cloud browser_navigate would refuse a large slice of normal browsing. Narrow the set to unambiguously credential-named params (access_token, authorization, client_secret, password, token, x-amz-signature, ...). Prefix-based vendor-key redaction (is_safe_url) still catches recognizable key shapes; this set is the belt-and-suspenders for opaque secrets carried under an explicit credential-named parameter. Also fixes two intra-PR-staleness test breakages surfaced by salvaging onto current main: - web_extract_tool() no longer accepts use_llm_processing= (signature changed since the PR was authored) — dropped the invalid kwarg. - agent.redact now fully masks keyed 'token=<secret>' to 'token=***' instead of partial 'sk-...'; the console-redaction test now asserts the real invariant (secret body gone) rather than the exact mask format. Added a regression test that generic English-word query params are NOT blocked by the credential guard.	2026-07-01 05:04:41 -07:00
teknium1	3b41df6d46	test(gateway): regression for multi-profile node symlink leak; AUTHOR_MAP Add tmp_path symlink regression tests for both generate_systemd_unit and generate_launchd_plist (~/.local/bin/node -> profile node install must not leak the profile target into the generated unit PATH). Register jearnest11's AUTHOR_MAP entry for the salvage cherry-pick.	2026-07-01 04:57:21 -07:00
teknium1	55c8b2c81f	chore(release): add AUTHOR_MAP entry for udatny (#29433 salvage)	2026-07-01 04:55:15 -07:00
Zeheng Huang	4c2c54c78c	fix(matrix): await inbound sync handlers Register the Matrix room-message, reaction, and invite handlers with mautrix's wait_sync=True. mautrix's handle_sync() only returns the tasks for handlers registered as sync-awaited; non-waited handlers are fire-and-forget via background_task.create() and are NOT returned. Since _dispatch_sync() awaits only the returned tasks (await asyncio.gather), the inbound handlers previously had no completion point, so Tuwunel/ mautrix homeservers connected and completed initial sync but dispatched zero inbound messages. Fixes #46142. Co-authored-by: Zeheng Huang <153708448+hunjaiboy@users.noreply.github.com>	2026-07-01 04:42:33 -07:00
kshitij	d4e8c358c0	Merge pull request #56330 from kshitijk4poor/chore/authormap-valenteff chore: add AUTHOR_MAP entry for valenteff (#53277 salvage)	2026-07-01 16:49:16 +05:30
shawchanshek	3b739b990b	fix(title_generator): strip think blocks from LLM output before extracting title Think-enabled models (MiniMax M2.7, DeepSeek, etc.) emit inline <think>...</think> reasoning even for simple prompts like title generation, and the raw XML was leaking into session titles. Route the title-model response through the canonical strip_think_blocks scrubber before cleanup so every tag variant — closed pairs, unterminated blocks, orphan closes, mixed case — is handled, not just a single literal <think> pair. - 2 regression tests: closed <think> pair stripped, unterminated block at start yields no title. Salvaged from PR #44126 by @shawchanshek.	2026-07-01 04:18:48 -07:00
kshitijk4poor	9c870548e3	chore: add AUTHOR_MAP entry for valenteff (#53277 salvage)	2026-07-01 16:44:07 +05:30
kshitijk4poor	b3f55c2037	chore: add AUTHOR_MAP entries for session-persistence salvage batch Maps the two plain-email contributors whose PRs are being salvaged so contributor_audit.py passes: - info@djimit.nl -> djimit (PR #48034) - lubos@komfi.health -> lubosxyz (PR #49225) The other two PRs in the batch (#50405 sasquatch9818, #48764 srojk34) use users.noreply.github.com emails, which check-attribution auto-skips.	2026-07-01 16:38:56 +05:30
kshitij	8415c4703a	Merge pull request #56317 from kshitijk4poor/chore/authormap-bitcryptic chore: add AUTHOR_MAP entry for bitcryptic-gw (#53997 salvage)	2026-07-01 16:33:47 +05:30
Tao Chen	d3c8667462	fix(slack): authorize bot/workflow senders before the no-user-id guard Slack Workflow Builder posts (and other app/bot messages) arrive as subtype=bot_message with user=None. _is_user_authorized rejected them at the `if not user_id: return False` guard, which runs before the #4466 {PLATFORM}_ALLOW_BOTS bypass — so @mentioning the bot from a Slack workflow silently did nothing, even with SLACK_ALLOW_BOTS (or SLACK_ALLOW_ALL_USERS) set. The chat-scoped allowlist for Telegram/QQ already runs before that guard for the same reason (channel broadcasts with no from_user); Slack was both missing from the bot-bypass map and had the bypass running too late. - gateway/authz_mixin: move the {PLATFORM}_ALLOW_BOTS bypass ahead of the no-user-id guard and add Platform.SLACK -> SLACK_ALLOW_BOTS. - plugins/platforms/slack/adapter: set is_bot=True on inbound bot_message events so the gateway can identify workflow/app senders (they carry no user_id to match against the allowlist). Tested: new tests/gateway/test_slack_bot_auth_bypass.py plus the existing Discord/Feishu bot-auth and gateway authz/gating suites all pass.	2026-07-01 16:32:32 +05:30
kshitijk4poor	fcbf850f33	chore: add AUTHOR_MAP entry for bitcryptic-gw (#53997 salvage)	2026-07-01 16:28:15 +05:30
teknium1	27347b2239	fix(gateway): align resume safety-net note with canonical recovery wording Follow-up on the salvaged resume_pending fix: the empty-turn safety net now emits the same reason-aware recovery note as the _is_resume_pending branch (reason phrase + 'session restored' guidance + no-re-execute instruction) instead of a second, differently-worded note. Also adds the AUTHOR_MAP entry for the salvaged commit.	2026-07-01 03:57:44 -07:00
teknium1	49a87bcd1e	chore(release): map SahilRakhaiya05 contributor email for #44073 salvage	2026-07-01 03:56:28 -07:00
Swissly	242c9639a8	fix(cron): prevent multi-target delivery loop crash on per-target failure The standalone thread-pool fallback in _deliver_result() runs inside the `except RuntimeError:` block (taken when asyncio.run() sees a running loop). When future.result() raised there (SMTP ConnectionError, timeout, etc.), the exception was NOT caught by the sibling `except Exception:` — it escaped _deliver_result() and crashed the whole delivery loop, silently skipping every remaining target. Multi-target delivery (e.g. deliver: 'email:a,email:b') is a documented feature, so this broke a promised contract. Wrap the fallback in its own try/except so a per-target failure is logged with exc_info and the loop continues to the next target. Fixes #47163	2026-07-01 03:48:37 -07:00
teknium1	5c2dccd06f	chore(release): map kangsoo-bit author for PR #47508 salvage	2026-07-01 03:42:32 -07:00
teknium1	9dd6451c80	chore(release): add WXBR to AUTHOR_MAP for #46183 salvage	2026-07-01 03:34:49 -07:00
YLChen-007	e23f723389	fix: make streaming reasoning-tag filter case-insensitive The streaming think-tag suppressors in cli.py (_stream_delta) and gateway/stream_consumer.py (_filter_and_accumulate) matched tag names with case-sensitive str.find(), so only the exact-case literals in the tag tuples were caught. Mixed-case variants a model may emit — <Think>, <ThInK>, <REASONING>, <Thought> — slipped through and leaked raw reasoning into the user-visible stream. Match against a lowercased view of the buffer with lowercased tag names at all three sites (open-tag boundary search, partial-tag hold-back, close-tag search) in both paths. Only KNOWN tag names are matched — no substring matching — and the block-boundary gating that protects prose mentions of <think> is preserved. - 6 parametrized case-insensitive regression tests in each of tests/gateway/test_stream_consumer.py and tests/cli/test_stream_delta_think_tag.py. Salvaged from PR #27289 by @YLChen-007.	2026-07-01 03:25:02 -07:00
teknium1	1c350728ec	chore(release): map Lazymonter into AUTHOR_MAP for PR #42914 salvage	2026-07-01 03:21:20 -07:00
Teknium	2b8adb8683	chore(release): map tgmerritt author for PR #43553 salvage	2026-07-01 03:17:48 -07:00
testingbuddies24	e07768a53f	fix(gateway): strip orphan think-tag close tags in progressive stream When a model emits an inline <think>...</think> block but the opening tag is dropped upstream (thinking-mode toggle, truncated stream, or incomplete upstream filtering), the bare </think> close tag leaked through to the user in the live progressive edit. The agent-side final scrubber (agent/think_scrubber.py) already had _strip_orphan_close_tags; this ports the same logic into GatewayStreamConsumer so the streaming display stays clean too. - _filter_and_accumulate: strip orphan close tags before appending the 'no-opening-tag' branch text to _accumulated. - _flush_think_buffer: same on stream end for held-back partials. - 14 regression tests (TestStripOrphanCloseTags): all 6 close-tag variants, multi-tag, partial-tag-untouched, trailing whitespace, and end-to-end through _filter_and_accumulate / _flush_think_buffer. Only strips KNOWN close-tag names (case-insensitive) — never arbitrary tag-shaped substrings — so comparison operators and unrelated prose are preserved. Salvaged from PR #43192 by @testingbuddies24.	2026-07-01 03:04:01 -07:00
Wing Huang	828f33e6b1	fix(ci): map contributor email for attribution check scripts/release.py AUTHOR_MAP is greped by the Contributor Attribution Check to resolve a commit author's email -> GitHub username. Add huangsen365@gmail.com -> huangsen365 so this PR's commits pass the check. (This commit originally also carried a gateway race-test flake fix; that edit is now dropped because main independently hardened the same test with a superior server._sessions snapshot/restore isolation, making ours redundant.)	2026-07-01 02:51:45 -07:00
teknium1	d15a288812	chore(release): map arthurzhang author for PR #34718 salvage	2026-07-01 02:45:22 -07:00
mrparker0980	10a54ccc2c	fix(security): anchor @file context refs to canonical read deny-list `@file` / `@folder` context-reference expansion enforced its own narrow deny-list (`_ensure_reference_path_allowed` in `agent/context_references.py`) that only covered `~/.ssh` keys, a handful of shell dotfiles, `~/.hermes/.env`, and `skills/.hub`. It never blocked the credential stores that the canonical read guard (`agent/file_safety.get_read_block_error`) protects: provider API keys (`~/.hermes/auth.json`), Anthropic OAuth tokens (`~/.hermes/.anthropic_oauth.json`), MCP OAuth material (`~/.hermes/mcp-tokens/`), webhook HMAC secrets, and project-local `.env` files. This matters because the messaging gateway feeds untrusted remote text straight into reference expansion: `gateway/run.py` calls `preprocess_context_references_async(..., allowed_root=_msg_cwd)` where `_msg_cwd` defaults to the operator's HOME when `TERMINAL_CWD` is unset. A chat peer (Telegram/Discord/Slack/...) could send `@file:~/.hermes/auth.json`, pass the `allowed_root` check (it resolves under HOME), slip past the narrow list, and have the operator's live keys read into the agent's context — where the model would typically echo or act on them. Rather than duplicate and re-sync a second secret list, this routes the guard through the existing single source of truth. A reviewer might ask "why not just add `auth.json` to the local list?" — because the local list has already drifted once (a prior commit had to add `.config/gh`); anchoring to `get_read_block_error` means every future addition there protects this path too. The narrow checks are kept as a fallback since they also cover dirs that guard does not (`.aws`, `.gnupg`, `.kube`, etc.), and the canonical lookup is wrapped so it can never crash reference expansion. N/A - [x] 🔒 Security fix - `agent/context_references.py`: `_ensure_reference_path_allowed` now also consults `agent.file_safety.get_read_block_error` after its existing checks and refuses the reference when that canonical guard flags the resolved path. The lookup is wrapped so guard-resolution failures fall back to the explicit checks instead of breaking expansion. - `tests/agent/test_context_references.py`: added `test_blocks_canonical_read_denylist_credential_stores`, asserting that `@file` attaches for `auth.json`, `.anthropic_oauth.json`, `mcp-tokens/`, and a project-local `.env` are all refused and their secret bodies never reach the expanded message. - `scripts/release.py`: added the contributor email to `AUTHOR_MAP` (release gate). 1. `scripts/run_tests.sh tests/agent/test_context_references.py` — all 15 tests pass, including the new credential-store case. 2. Regression proof: stash `agent/context_references.py`, run the suite with `-- -k canonical`, and confirm the new test fails (secrets leak into the message) without the fix; restore and confirm it passes. 3. `ruff check agent/context_references.py tests/agent/test_context_references.py` and `python scripts/check-windows-footguns.py agent/context_references.py tests/agent/test_context_references.py` both pass. - [x] I've read the Contributing Guide - [x] My commit messages follow Conventional Commits (`fix(scope):`, etc.) - [x] I searched for existing PRs to make sure this isn't a duplicate - [x] My PR contains only* changes related to this fix (plus the AUTHOR_MAP release gate) - [x] I've run the test suite for the touched area and all tests pass - [x] I've added tests for my changes (required for bug fixes) - [x] I've tested on my platform: macOS 15 (Darwin 25.5) - [x] I've updated relevant documentation (README, `docs/`, docstrings) — or N/A - [x] I've updated `cli-config.yaml.example` if I added/changed config keys — or N/A - [x] I've updated `CONTRIBUTING.md` or `AGENTS.md` if I changed architecture or workflows — or N/A - [x] I've considered cross-platform impact (Windows, macOS) — or N/A - [x] I've updated tool descriptions/schemas if I changed tool behavior — or N/A	2026-07-01 02:43:49 -07:00
kshitijk4poor	8f1d22d7ed	chore(release): map r266-tech contributor noreply email for #55780 salvage	2026-07-01 15:01:33 +05:30
teknium1	116a63d3a0	chore(release): map jcjc81 + Tranquil-Flow in AUTHOR_MAP for #54947 cluster salvage	2026-07-01 02:29:24 -07:00
Teknium	522a5e93b2	chore(release): map x9x9x9x9x9x91 for #49247 salvage	2026-07-01 02:18:56 -07:00
itenev	f981d47cb0	fix(gateway): prevent Discord disconnects from blocking event loop models_dev.py's fetch uses a synchronous requests.get(timeout=15). Called from the async gateway message handlers, it blocked the event loop for up to 15s, starving Discord heartbeats and causing ClientConnectionResetError disconnects. Adds get_model_context_length_async() which offloads the entire sync resolution chain to a worker thread via asyncio.to_thread(), and switches the two async gateway call sites (_prepare_inbound_message_text, _handle_message_with_agent) to await it. The loop stays responsive; the sync path remains the single source of truth for the cache. Salvaged from PR #22753 by @itenev. Follow-up: dropped the unused fetch_models_dev_async/lookup_models_dev_context_async aiohttp variants from the original PR (dead code with zero callers that had drifted from the sync cache logic) — the to_thread wrapper already runs the sync path off-loop, so they were redundant.	2026-07-01 02:17:35 -07:00
Teknium	ea533e7f41	chore(release): map justin-cyhuang contributor email for #31960 salvage	2026-07-01 02:12:25 -07:00
Teknium	259e6b87a7	fix(teams-pipeline): reject dot-only recording display_name Path(raw).name reduces '..'/'.'/'' to themselves, so basename extraction alone still let a Graph-provided display_name of '..' or '../' escape the temp recording directory (tmp_dir / '..' resolves to the parent). Reject the dot-only basenames explicitly and fall back to the artifact id. Extends @outsourc-e's regression coverage with the dot-only cases.	2026-07-01 02:03:48 -07:00
teknium1	6d30f8c0ab	chore: add AUTHOR_MAP entry for PR #52534 salvage (@qWaitCrypto)	2026-07-01 02:03:40 -07:00
teknium1	49cb06c07a	chore(release): map sasquatch9818 for PR #41198 salvage	2026-07-01 01:54:45 -07:00
kshitijk4poor	58ea7f9071	chore(release): map claudlos contributor email for #52351 salvage	2026-07-01 14:23:01 +05:30
Teknium	f70abae606	chore(release): map kernel-t1 for .env sanitizer salvage (#41349 )	2026-07-01 01:50:32 -07:00
teknium1	db2ac840c1	chore(release): map kyzcreig@gmail.com in AUTHOR_MAP	2026-07-01 01:44:40 -07:00
kshitijk4poor	843a3be7d6	chore(attribution): map baris@writeme.com -> isair for salvaged #50124	2026-07-01 14:09:15 +05:30
Teknium	a56bfeb2cb	chore(release): map approval-bypass PR contributors AUTHOR_MAP entries for the salvaged shell-bypass fixes: xy200303 (#40663), YLChen-007 (#26965), egilewski (co-author #40663). necoweb3 (#55653) already mapped.	2026-07-01 01:39:10 -07:00
teknium1	907cbba885	chore(release): add Vesna-9 to AUTHOR_MAP for #41274 salvage	2026-07-01 01:38:59 -07:00
teknium1	5e64dd9a98	chore: map charleneleong84 email to AUTHOR_MAP for #11736 salvage	2026-07-01 01:36:34 -07:00
Teknium	12556a9a77	chore(scripts): drop Open WebUI local bootstrap script (#56178 ) Remove scripts/setup_open_webui.sh and its 'one-command local bootstrap' doc sections (EN + zh-Hans). The script pip-installed the third-party Open WebUI frontend into ~/.local and managed a launchd/systemd user service — a maintenance liability for downstream software we don't own, and the source of the LAN first-admin signup footgun in #36121. The Open WebUI integration via the OpenAI-compatible API server is unaffected: the Docker/Docker-Compose setup, multi-user profile guide, and troubleshooting in open-webui.md stay, and Open WebUI remains a listed supported frontend. Only the install-and-service bootstrapper is gone.	2026-07-01 01:30:40 -07:00
teknium1	80d0ff8da5	chore: add AUTHOR_MAP entry for PR #40978 salvage (@friendshipisover)	2026-07-01 01:27:26 -07:00
ryo-solo	d578b6165d	fix(api_server): pop fallback model kwarg to prevent AIAgent collision When the primary provider's auth fails (expired token / 429 quota cap), _resolve_runtime_agent_kwargs() falls through to the fallback provider chain, whose runtime dict carries its own 'model' key. api_server's _create_agent then did AIAgent(model=model, **runtime_kwargs), colliding on 'model' and 500ing every /v1/chat/completions request while a fallback was active. Pop the runtime model and let it override the config model, mirroring the native gateway path (_resolve_session_agent_runtime). Salvaged from #35716 by @ryo-solo (earliest submitter); the PR's second half (Mistral reasoning_content strip) is already handled on main and dropped. Co-authored-by: Hermes Agent <noreply@nousresearch.com>	2026-07-01 01:26:27 -07:00
teknium1	ce9d180a94	chore: add redactdeveloper to AUTHOR_MAP for PR #36897 salvage	2026-07-01 01:25:43 -07:00
teknium1	081c91c147	chore: add AUTHOR_MAP entry for PR #40773 salvage (rrevenanttt)	2026-07-01 01:25:24 -07:00
petrichor-op	f2a528fb59	fix(agent): never persist empty-response recovery scaffolding Ephemeral empty-response/prefill recovery scaffolding (the synthetic assistant "(empty)" turn, the user nudge, the terminal "(empty)" sentinel, and the thinking-only prefill placeholder) exists only to drive the next API retry; the in-memory loop pops it before appending the real response. The append-only flush did not mirror that, so a mid-turn persist could commit scaffolding to the SQLite session store (and JSON log), and a resumed session would replay synthetic "(empty)"/nudge turns as genuine context — re-poisoning the empty-retry boundary forever. Filter ephemeral scaffolding at both durable-write sites (_flush_messages_to_session_db + _save_session_log), by flag not position, so buried scaffolding (an answered nudge leaves the synthetic pair mid-list) is skipped too. Covers all three flags including _thinking_prefill. Adapted onto current main's identity-tracking flush. Cherry-picked from #41281 by petrichor-op.	2026-07-01 01:08:27 -07:00

1 2 3 4 5 ...

1413 commits