Where does Hermes Agent store its memory, and will I lose it when I migrate to a Mac?

Core state lives in the ~/.hermes/ directory, including state.db and Markdown memory files. Pack that directory before migration and restore it on the new machine.

Should I pick a 16GB or 32GB Mac Mini M4 for Hermes?

Gateway plus cloud API only: start with 16GB. If you plan local Hermes-3 inference and a large long-lived Skill library, 32GB is the safer choice.

Is renting a Mac Mini or buying one cheaper over 24 months?

Over a 24-month horizon, rental usually gives more predictable OpEx and upgrade flexibility. Buying makes sense when you already know you need three or more years of dedicated use and can absorb depreciation.

Three Months of Hermes Agent on a VPS — Why I Switched to a Rented Mac Mini M4: 2026 UMA & Cost Guide

Three months on a VPS: what worked, and what pushed me to switch

In February 2026, Nous Research open-sourced Hermes Agent on GitHub and it spread fast. This is not a chat sidebar that forgets you between sessions. Hermes lives on your machine, remembers preferences across conversations, and turns repeated workflows into reusable Skill documents. I deployed it on a typical 4 vCPU / 8 GB Linux VPS: one curl install, Gateway always on, Telegram as the control plane. For the first two weeks it felt like magic.

By month three, three problems stacked on top of each other. Latency first. From my phone, a simple instruction to the moment Hermes started executing often took 200–400 ms round trip. Long tasks with multiple tool calls made that gap obvious. Cost second. The VPS itself was cheap, but OpenRouter and other cloud APIs bill per token. As Skills multiplied and automation ran more often, the monthly API line item grew steeper than I had modeled. Platform gap third. I wanted to try local Hermes-3 inference and wire in Xcode-side scripts. On Linux that meant Docker workarounds and hours of debugging for things that are one-liners on native macOS.

The breaking point was a maintenance window. The cloud provider rebooted my instance. Gateway came back, but a snapshot rollback had touched the disk where ~/.hermes/state.db lived. Episodic retrieval quality dropped for roughly two weeks of accumulated context. That was when I stopped treating Hermes as "can it run?" and started asking "can it run continuously, with memory that compounds?" The answer, for my use case, was a rented Mac Mini M4 running 24/7 on my desk network — silent, native, and easy to back up.

None of this means VPS deployment is wrong. It is the fastest way to validate Gateway wiring, channel tokens, and your first handful of Skills. The timeline below is what I actually experienced; your mileage will vary with provider, region, and how aggressively you automate.

01
Weeks 1–2: VPS was enough for Gateway + Telegram. Ideal for "get it running and poke at it."
02
Weeks 3–4: Skill library grew; disk I/O and SQLite FTS queries slowed. I started manually trimming logs.
03
Month 2: API spend tracked automation frequency almost linearly. Long-context jobs hurt the most.
04
Late month 2: Tried local model paths; x86 VPS has no Metal, so inference stayed on remote API.
05
Month 3: Provider reboot plus snapshot rollback broke memory continuity. Started evaluating a dedicated Mac.
06
After the switch: Desktop Mac ran silent 24/7; Telegram latency became negligible; ~/.hermes backed up with Time Machine.

If you are still in weeks 1–2, stay on the VPS and iterate. If you are past week four and Hermes is part of daily operations, start pricing a Mac path now. The migration itself is a few hours once you respect the backup step — the expensive mistake is discovering memory loss after you have already deleted the old instance.

What Hermes Agent actually consumes: Gateway, Skills, and ~/.hermes

Before swapping hardware, map the workload. Nous Research docs and community write-ups agree: Hermes gets smarter over time because three layers stay resident. The Gateway process bridges Telegram, Discord, Slack, and 20+ other channels. The Skill library stores reusable procedures as Markdown. The memory layer under ~/.hermes/ holds SOUL.md, MEMORY.md, USER.md, and an FTS5-indexed state.db.

Your box is not running a one-off Python script. It is doing network I/O, spawning tool subprocesses, serving full-text and vector retrieval, and — if you enable it — local LLM inference at the same time. On my VPS, Gateway alone sat at 300–600 MB RAM. Once Skills passed a few dozen files and session history bloated, random disk I/O became the bottleneck before CPU did. When state.db crossed 2 GB, retrieval latency moved from milliseconds to hundreds of milliseconds. That is the difference between "instant recall" and "wait, let me search."

Hermes also assumes a stable filesystem. Cloud snapshots, noisy neighbors on shared VPS hosts, and aggressive log rotation all interact badly with a growing SQLite database. macOS on dedicated hardware gives you predictable local storage, native launchd service management, and backup tooling that understands user home directories. For an agent whose value is accumulated memory, that operational simplicity matters as much as raw GHz.

Dimension	Linux VPS (8 GB plan I used)	Rented Mac Mini M4 (16 GB)
Install path	Works, but macOS-specific scripts need workarounds	Official curl one-liner, launchd daemon
Local Hermes-3 / Metal	Not available	UMA + Neural Engine path for expansion
Memory directory backup	DIY rsync or snapshot policy	Time Machine / external copy of ~/.hermes
24/7 power and noise	Datacenter — invisible to you	Desktop-silent (~5–8 W idle class)
24-month hardware cost	Low base rent + volatile API spend	Fixed monthly OpEx, easier to forecast

"Hermes Agent's moat is not one brilliant reply. It is memory and Skills compounding over months — and hardware's job is to keep that compounding online without dragging retrieval."

Treat Gateway uptime and state.db integrity as first-class SLOs, not afterthoughts. If retrieval slows, you will stop trusting the agent even when the model is fine. That trust erosion is harder to fix than swapping a VPS tier.

M4 unified memory in practice: choosing 16 GB vs 32 GB

Apple Silicon unified memory (UMA) puts CPU, GPU, and Neural Engine in one high-bandwidth pool. For agent workloads that matters: when Gateway pulls a local model, tensors are not copied back and forth between "system RAM" and "VRAM" the way they are on many x86 + discrete GPU setups. Nous tuned Hermes-3 with Atropos RL for tool use and long tasks. If you mostly route through OpenRouter or another cloud API, a 16 GB M4 is usually enough for Gateway, browser tools, and a moderate state.db.

Move to 32 GB when you plan local Hermes-3 weights, multiple concurrent channel sessions, or heavy code-sandbox tooling at once. That is not luxury spec — it is headroom for SQLite growth and model KV cache. On a 16 GB machine I logged a week of production use: Gateway idle around 400 MB, peaks near 12 GB when local inference overlapped with large Skill retrieval. Headroom determines whether macOS kills side processes or whether Hermes keeps running cleanly.

UMA also simplifies capacity planning. On a VPS you might upgrade vCPU and RAM independently and still hit disk limits. On M4 you are really asking one question: how much unified memory do I need for Gateway + retrieval + optional local inference? Answer that honestly and the rest of the stack falls into place.

bash

# Official install on macOS (after your rental machine is ready)
curl -fsSL https://get.hermes-agent.org | bash

# On the old VPS before migration — pack memory
tar czf hermes-backup.tgz -C ~ .hermes

# On the new Mac — restore and restart Gateway
tar xzf hermes-backup.tgz -C ~
# Follow the setup wizard for channel tokens and start the service

warning

Warning: Do not wipe the old machine until ~/.hermes/ is backed up and verified on the new host. Skills and episodic memory live in that directory. Cloud API keys cannot reconstruct it.

After restore, run a few retrieval-heavy prompts you know from the old environment. If episodic answers reference the right past tasks, your migration succeeded. If not, stop and diff the tarball contents before you decommission the VPS.

24-month TCO: buy a Mac Mini M4 vs rent one for Hermes

The table below mixes qualitative judgment with order-of-magnitude numbers you can sanity-check against your region. Exact rental quotes change; see the Mac Mini rental rates page for current SKUs. Buying front-loads CapEx and leaves you with depreciation, power, and repair. Renting converts that into OpEx and keeps an upgrade path when the next M-series drop arrives.

API spend sits outside both columns. Hermes does not replace your model provider bill — it changes how predictable the surrounding infrastructure feels. My VPS looked cheap until API usage climbed; a fixed Mac rental line made monthly planning simpler even when tokens still varied.

Cost line (24 months)	Buy Mac Mini M4 (16 GB)	Rent Mac Mini M4
Up-front cash	Large one-time hardware outlay	Low deposit / steady monthly fee
Depreciation and refresh	Two-year-old M chip may feel dated by year three	Contract end often allows spec swap
Ops time	You own repairs, moves, desk or rack setup	Provider handles swap and baseline ops
Hermes fit	Best — native macOS	Same native path, good for try-before-buy
Who it suits	Already committed to 3+ years exclusive use	Validate agent workflows before capital lock-in

info

Tip: Teams running separate Hermes instances for dev, staging, and personal use can scale rentals per node instead of buying three Mac Minis that may sit idle between releases.

Buying wins when you have already amortized Apple hardware across other work — Xcode builds, design tools, local inference experiments — and Hermes is one more always-on service on a box you would own anyway. Renting wins when Hermes is the primary reason you need macOS 24/7 and you are not ready to bet on a three-year depreciation curve for a single agent experiment.

Reference specs and decision summary

Memory path: Default ~/.hermes/; upstream docs emphasize on-device storage with no telemetry upload (MIT license).
Channel scale: Gateway supports Telegram, Discord, Slack, and 20+ platforms; run it on a machine that stays awake — not a laptop that sleeps when you close the lid.
RAM guidance: Cloud API + Gateway only → start at 16 GB. Local inference + large Skill library → 32 GB is safer (community M4 UMA experience, not a vendor benchmark).
Install command: curl -fsSL https://get.hermes-agent.org | bash (macOS / Linux / WSL2; this article focuses on native macOS).

Looking back, the VPS was the right sandbox. Once Hermes became something I talked to every day — not a weekend project — the gaps added up: no Metal for local models, fragile memory on shared cloud disks, API bills that spiked with every new Skill. Buying a Mac Mini outright is valid if you already know the workload lasts years. For most solo builders, rent a Mac Mini M4, run Hermes 24/7 for a quarter, then decide whether to buy is the lowest-risk path.

If you also need iOS builds, Xcode automation, or SSH access for teammates on the same machine, squeezing everything onto a small VPS or a personal laptop introduces sleep, neighbor noise, and incomplete signing environments. For production setups that need Hermes Agent always on with a native macOS toolchain, NodeMini Mac Mini cloud rental is usually less painful than "good enough Linux VPS + remote API" — you focus on agent compounding, not 2 a.m. Gateway recovery.

Start with the backup command in section 03, migrate on a quiet weekday, and keep the old VPS running read-only for 48 hours as a fallback. Hermes rewards continuity. Give it hardware that treats memory as an asset, not a disposable cache, and the three-month frustration curve flattens fast.

FAQ

Frequently asked questions

Everything important lives under ~/.hermes/, including state.db and Markdown memory files. Before migration run tar czf hermes-backup.tgz -C ~ .hermes, restore on the new Mac, then verify retrieval. Before ending a rental, export the directory and wipe the device per your provider's policy.

NodeMini offers dedicated Mac Mini rentals billed monthly or quarterly. Current models and live pricing are on the rental rates page. Hermes model API charges remain separate and bill through whichever provider you configure.

Yes — for example a VPS can handle lightweight webhooks while the Mac runs Gateway and memory. For lowest latency and the official install path, keep Gateway and ~/.hermes on the same host. More setup questions are covered in the help center.

Three Months of Hermes Agent on a VPS, Why I Switched to a Rented Mac Mini M4

Three months on a VPS: what worked, and what pushed me to switch

What Hermes Agent actually consumes: Gateway, Skills, and ~/.hermes

M4 unified memory in practice: choosing 16 GB vs 32 GB

24-month TCO: buy a Mac Mini M4 vs rent one for Hermes

Reference specs and decision summary

Frequently asked questions

Three Months of Hermes Agent on a VPS,
Why I Switched to a Rented Mac Mini M4