Hermes Agent Setup Guide: Self-Improving AI Assistant on Your Own Server
Step-by-step guide to installing Hermes Agent on a Linux VPS with OpenRouter free models. Covers configuration, Telegram and Discord integration, memory, skills, voice mode, and how it compares to OpenClaw.
I’ve been testing Hermes Agent from Nous Research alongside my OpenClaw setup and OpenFang instance for about a week. The thing that hooked me is the learning loop. Hermes creates skills from tasks it completes, improves those skills during later use, and remembers who you are across sessions. It also migrates your existing OpenClaw config, memories, and skills with a single command.
Hermes Agent GitHubWhat this guide covers
- Installing Hermes Agent on a Linux VPS via the one-line installer
- Configuring OpenRouter with free models (zero API cost)
- Setting up Telegram, Discord, and other messaging platforms
- Memory system with MEMORY.md, USER.md, and session search
- Skills that the agent creates and improves on its own
- Voice mode for CLI and messaging platforms
- Scheduled tasks with natural-language cron jobs
- How Hermes Agent differs from OpenClaw, nanobot, and the rest
If you’re comparing self-hosted AI assistant options, our OpenClaw alternatives roundup covers several projects including NanoClaw, nanobot, PicoClaw, ZeroClaw, NullClaw, and OpenFang. For security considerations when running any of these, see the OpenClaw security guide. For the best web UIs and dashboards to manage your Hermes Agent from a browser, see the best Hermes dashboards roundup.
What Hermes Agent actually is
Nous Research built Hermes Agent as what they call a “self-improving” AI assistant. That label actually means something here: the agent watches what it does, extracts reusable skills from complex tasks, and refines those skills the next time it runs them. Most other assistants in this category forget everything between sessions. Hermes carries forward what it learned.
The architecture:
You (CLI / Telegram / Discord / Slack / WhatsApp / Signal / Email)
↓
Hermes Gateway (single background process)
↓
AI Agent (session store, memory, skills, tools)
↓
LLM Provider (OpenRouter, Nous Portal, OpenAI, Anthropic, Ollama, custom)
↓
40+ Tools (terminal, web search, browser, file editing, code execution, TTS)
Messages arrive from whatever platform you use. The gateway routes them through a per-chat session store, the agent processes them with access to 40+ built-in tools, and everything runs from a single process. No microservices, no database server.
How it compares to OpenClaw and others
| Feature | Hermes Agent | OpenClaw | nanobot | OpenFang |
|---|---|---|---|---|
| Built by | Nous Research | Community | HKUDS | RightNow AI |
| Language | Python | TypeScript | Python | Rust |
| Install | curl one-liner | curl one-liner | pip | curl one-liner |
| Learning loop | Yes (skill creation + improvement) | No | No | No |
| Channels | 12 (Telegram, Discord, Slack, WhatsApp, Signal, SMS, Email, Home Assistant, Mattermost, Matrix, DingTalk, CLI) | Telegram, WhatsApp, Slack, Discord | Telegram, Discord, WhatsApp, Slack, Feishu, DingTalk, Email, QQ | 40 adapters |
| Memory | MEMORY.md + USER.md + FTS5 session search + Honcho user modeling | File-based + semantic search | Built-in | SQLite + vector |
| Skills | Auto-created from experience, self-improving, Skills Hub | Community skills | Built-in | Agent templates |
| Voice mode | CLI mic + messaging TTS + Discord voice channels | No | No | No |
| Terminal backends | Local, Docker, SSH, Daytona, Singularity, Modal | Local | Local | Local, Docker |
| Scheduled tasks | Natural-language cron with platform delivery | Cron | Cron | Cron + autonomous Hands |
| OpenClaw migration | Built-in (hermes claw migrate) | N/A | No | No |
| Personality system | SOUL.md + 14 built-in presets + custom | System prompt | System prompt | System prompt |
| License | MIT | MIT | MIT | MIT |
The learning loop is the real differentiator. After you ask Hermes to do something complex (say, deploy a Docker service), it extracts that workflow into a skill. Next time you ask for something similar, it uses and refines that skill. The FTS5 session search means it can recall details from conversations weeks ago. And voice mode in Discord voice channels is something none of the other projects offer.
OpenClaw still has the larger community, more third-party dashboards, and a longer track record. If OpenClaw already works for you, the migration command makes switching painless whenever you’re ready.
Installation
The one-line installer handles Python, Node.js, dependencies, and the hermes command. Works on Linux, macOS, and WSL2.
The recommended approach. No prerequisites except git.
curl -fsSL https://raw.githubusercontent.com/NousResearch/hermes-agent/main/scripts/install.sh | bashAfter it finishes, reload your shell:
source ~/.bashrc # or source ~/.zshrcVerify the install:
hermes --version If you prefer to manage Python yourself:
# Make sure you have Python 3.11+
python3 --version
# Create a virtual environment (recommended)
python3 -m venv ~/.hermes-venv
source ~/.hermes-venv/bin/activate
# Install
pip install hermes-agent
# For voice mode support
pip install hermes-agent[voice]
# For all optional features
pip install hermes-agent[all] For contributors or if you want the latest development version:
git clone https://github.com/NousResearch/hermes-agent.git
cd hermes-agent
git submodule update --init mini-swe-agent
curl -LsSf https://astral.sh/uv/install.sh | sh
uv venv venv --python 3.11
source venv/bin/activate
uv pip install -e ".[all,dev]"
uv pip install -e "./mini-swe-agent" Installing on a Hetzner VPS
If you want Hermes running 24/7, a cheap VPS does the job. I use a Hetzner CX22 (2 vCPU, 4GB RAM) for €3.99/month.
Get Started with Hetzner
Get €20 credit, Hostinger VPS when you sign up through our referral link. That covers about 5 months of running Hermes Agent.
SSH into your server and run:
ssh root@YOUR_SERVER_IP
apt update && apt upgrade -y
curl -fsSL https://raw.githubusercontent.com/NousResearch/hermes-agent/main/scripts/install.sh | bash
source ~/.bashrc
Setting up OpenRouter with free models
OpenRouter routes to 200+ models through a single API key. They also have a free tier with several models at zero cost, which is the cheapest way to get started.
Get your API key
- Go to openrouter.ai and create an account
- Navigate to Keys in your dashboard
- Click Create Key and copy the key
Configure Hermes Agent
You can configure interactively or by editing the config files directly.
Interactive setup (recommended):
hermes setup
This walks you through model provider, API key, tools, and platform configuration. When it asks for a provider, select OpenRouter and paste your key.
Manual setup:
Add your API key to ~/.hermes/.env:
echo "OPENROUTER_API_KEY=sk-or-v1-your-key-here" >> ~/.hermes/.env
Set the model in ~/.hermes/config.yaml:
model:
provider: "openrouter"
default: "openrouter/free"
The openrouter/free model routes to the best available free model. You can also specify a model directly:
hermes config set model openrouter/free
Switch models on the fly
You don’t need to restart anything to change models. Use the interactive picker:
hermes model
Or switch mid-conversation with a slash command:
/model openrouter/free
Using paid models later
When you’re ready to move beyond free models, just change the model name:
hermes config set model anthropic/claude-sonnet-4.6
# or
hermes config set model openai/gpt-5.4
OpenRouter handles the routing. Your API key stays the same.
First conversation
Start chatting:
hermes
You’ll see a welcome banner showing your model, available tools, and skills. Type a message and press Enter:
❯ What can you help me with?
Some things to try right away:
❯ What's my disk usage? Show the top 5 largest directories.
❯ Search the web for the latest Docker release and summarize it.
❯ Create a Python script that monitors CPU usage and alerts me above 80%.
The agent runs terminal commands, searches the web, reads and edits files, and executes code. No extra configuration needed.
Useful slash commands
Type / to see an autocomplete dropdown. The ones you’ll use most:
| Command | What it does |
|---|---|
/help | Show all available commands |
/model | Switch models interactively |
/tools | List available tools |
/personality pirate | Try a fun personality |
/new | Start a fresh conversation |
/save | Save the conversation |
/compress | Compress context when it gets long |
/usage | Show token usage for this session |
/skills | Browse installed skills |
/voice on | Enable voice mode |
Multi-line input and interrupts
Press Alt+Enter or Ctrl+J to add a new line. Useful for pasting code blocks or writing detailed prompts.
If the agent is taking too long, type a new message and press Enter — it interrupts the current task and switches to your new instructions. Ctrl+C also works.
Resume a session
When you exit, Hermes prints a resume command:
hermes --continue # Resume the most recent session
hermes -c # Short form
Memory system
Hermes has three layers of memory that work together.
MEMORY.md — Agent’s personal notes
The agent writes its own notes about your environment, conventions, and things it learned. Stored in ~/.hermes/memories/MEMORY.md. Limited to 2,200 characters (~800 tokens) to keep the system prompt bounded.
The agent manages this automatically. When it discovers that your server runs Ubuntu 22.04 with Docker installed, it saves that. When it learns you prefer concise responses, it saves that too. You don’t need to tell it to remember — it watches for useful facts and stores them.
USER.md — Your profile
Information about you: name, preferences, communication style, timezone. Stored in ~/.hermes/memories/USER.md. Limited to 1,375 characters (~500 tokens).
Session search
Beyond the two markdown files, Hermes stores all past conversations in SQLite with FTS5 full-text search. It can search through weeks of old conversations to find something you discussed before:
❯ What did we discuss about the Nginx configuration last week?
The agent searches its session history, summarizes the relevant parts, and gives you the answer. No other assistant in the OpenClaw family does this.
Configure memory
In ~/.hermes/config.yaml:
memory:
memory_enabled: true
user_profile_enabled: true
memory_char_limit: 2200
user_char_limit: 1375
Skills system
This is the feature I keep coming back to. Skills are not pre-built plugins you install from a hub. The agent creates them from tasks it completes.
How skill creation works
- You ask Hermes to do something complex (deploy a service, set up a CI pipeline, configure Nginx)
- Hermes completes the task using its tools
- After finishing, it extracts the workflow into a reusable skill
- Next time you ask for something similar, it uses and refines that skill
Skills also self-improve. Each time one runs, the agent checks whether the steps could be better and updates the skill if so.
Browse and install community skills
hermes skills search kubernetes
hermes skills search react --source skills-sh
hermes skills install openai/skills/k8s
Or use the /skills slash command inside chat.
Skills Hub
Community-contributed skills live at agentskills.io. You can browse and install them, or publish your own.
Setting up messaging platforms
Hermes talks to Telegram, Discord, Slack, WhatsApp, Signal, SMS, Email, Home Assistant, Mattermost, Matrix, and DingTalk through a single gateway process.
Quick setup
hermes gateway setup
The interactive wizard walks you through each platform. It shows what’s already configured and offers to start the gateway when done.
- Talk to @BotFather on Telegram and create a new bot
- Copy the bot token
- Add it to
~/.hermes/.env:
TELEGRAM_BOT_TOKEN=your-bot-token-here
TELEGRAM_ALLOWED_USERS=your-telegram-user-idTo find your user ID, talk to @userinfobot on Telegram.
- Start the gateway:
hermes gateway start - Go to the Discord Developer Portal
- Create a new application and add a Bot
- Under Privileged Gateway Intents, enable Message Content Intent
- Copy the bot token and add it to
~/.hermes/.env:
DISCORD_BOT_TOKEN=your-bot-token-here
DISCORD_ALLOWED_USERS=your-discord-user-id- Invite the bot to your server with this URL (replace
YOUR_APP_ID):
https://discord.com/oauth2/authorize?client_id=YOUR_APP_ID&scope=bot+applications.commands&permissions=274878286912- Start the gateway:
hermes gateway start WhatsApp and Signal both work through the gateway. Run the setup wizard:
hermes gateway setupSelect the platform you want and follow the prompts. WhatsApp uses the Baileys library for web client pairing. Signal requires the Signal CLI.
Run the gateway as a service
On a VPS, you want the gateway running at boot:
# Install as a systemd service
hermes gateway install
# Start it
hermes gateway start
# Check status
hermes gateway status
# View logs
journalctl --user -u hermes-gateway -f
# Enable lingering so it survives logout
sudo loginctl enable-linger $USER
On a headless VPS, use the system service instead:
sudo hermes gateway install --system
sudo hermes gateway start --system
Security
The gateway denies all users by default. Only users in the allowlist can interact:
# In ~/.hermes/.env
TELEGRAM_ALLOWED_USERS=123456789
DISCORD_ALLOWED_USERS=123456789012345678
As an alternative to allowlists, you can use DM pairing. Unknown users get a one-time pairing code when they message the bot:
hermes pairing approve telegram XKGH5N7P
hermes pairing list
hermes pairing revoke telegram 123456789
Terminal backends
By default, Hermes runs commands directly on your machine. For security, you can isolate command execution in a container or send it to a remote server.
# In ~/.hermes/config.yaml
terminal:
backend: local # or: docker, ssh, singularity, modal, daytona
timeout: 180
| Backend | What it does | Best for |
|---|---|---|
local | Runs on your machine (default) | Development, trusted tasks |
docker | Isolated containers | Security, reproducibility |
ssh | Remote server | Keeping the agent away from its own code |
daytona | Cloud sandbox workspace | Persistent remote dev environments |
singularity | HPC containers | Cluster computing |
modal | Serverless cloud | Scale-to-zero, pay-per-use |
Docker isolation
terminal:
backend: docker
docker_image: python:3.11-slim
container_cpu: 1
container_memory: 5120
container_disk: 51200
container_persistent: true
Containers run with a read-only root filesystem, all Linux capabilities dropped, no privilege escalation, PID limits, and namespace isolation.
SSH backend
Run commands on a separate machine entirely:
terminal:
backend: ssh
# In ~/.hermes/.env
TERMINAL_SSH_HOST=my-server.example.com
TERMINAL_SSH_USER=myuser
TERMINAL_SSH_KEY=~/.ssh/id_rsa
Voice mode
Hermes has voice support across CLI and messaging. You can talk to it with your mic in the terminal, get spoken replies in Telegram and Discord, and have live voice conversations in Discord voice channels.
Prerequisites
# Install voice dependencies
pip install hermes-agent[voice]
# Optional: free local speech-to-text (recommended)
pip install faster-whisper
# System dependencies (Ubuntu/Debian)
sudo apt install portaudio19-dev ffmpeg libopus0
CLI voice mode
Start the CLI and enable voice:
hermes
/voice on
Press Ctrl+B to record. Speak, and when you stop, it auto-detects silence after 3 seconds and transcribes your audio. If TTS is enabled (/voice tts), the agent speaks its reply back.
Messaging voice replies
In Telegram or Discord, send:
/voice tts
The agent now sends spoken audio alongside text for every response.
Discord voice channels
The agent can join a Discord voice channel, listen to you speak, and reply with spoken audio:
/voice join
This requires additional Discord bot permissions (Connect, Speak, Use Voice Activity) and the Opus codec on your server.
TTS providers
| Provider | Cost | Quality | Setup |
|---|---|---|---|
| Edge TTS | Free | Good | Works out of the box |
| NeuTTS | Free | Good | pip install neutts[all] |
| ElevenLabs | Paid | Premium | Set ELEVENLABS_API_KEY |
| OpenAI TTS | Paid | Good | Set VOICE_TOOLS_OPENAI_KEY |
Configure in ~/.hermes/config.yaml:
tts:
provider: "edge"
edge:
voice: "en-US-AriaNeural"
stt:
provider: "local"
local:
model: "base"
Personality and SOUL.md
Hermes uses a file called SOUL.md as its identity. It goes into the system prompt first, before anything else, and shapes how the agent talks and thinks.
Edit it at ~/.hermes/SOUL.md:
# Personality
You are a pragmatic senior engineer with strong taste.
You optimize for truth, clarity, and usefulness over politeness theater.
## Style
- Be direct without being cold
- Prefer substance over filler
- Push back when something is a bad idea
- Keep explanations compact unless depth is useful
## What to avoid
- Sycophancy
- Hype language
- Overexplaining obvious things
Built-in personalities
Switch personalities on the fly with /personality:
| Name | Description |
|---|---|
helpful | Friendly, general-purpose assistant |
concise | Brief, to-the-point responses |
technical | Detailed technical expert |
creative | Innovative thinking |
teacher | Patient educator with examples |
pirate | Tech-savvy buccaneer |
noir | Hard-boiled detective narration |
/personality concise
/personality pirate
SOUL.md is your baseline. /personality is a session-level overlay.
Scheduled tasks
Hermes has a built-in cron scheduler. Just tell it what you want in plain English:
❯ Every morning at 9am, check Hacker News for AI news and send me a summary on Telegram.
❯ Every Friday at 5pm, back up the PostgreSQL database and report the size.
❯ Every hour, check if nginx is running and restart it if not.
The agent creates cron jobs that run through the gateway. Results land on whatever platform you configured.
Migrating from OpenClaw
If you’re coming from OpenClaw, Hermes can import your settings, memories, skills, and API keys.
During first-time setup: The setup wizard (hermes setup) auto-detects ~/.openclaw and offers to migrate.
Anytime after install:
hermes claw migrate # Interactive migration
hermes claw migrate --dry-run # Preview what would be migrated
hermes claw migrate --preset user-data # Migrate without secrets
hermes claw migrate --overwrite # Overwrite existing conflicts
What gets imported:
- SOUL.md — persona file
- Memories — MEMORY.md and USER.md entries
- Skills — user-created skills go to
~/.hermes/skills/openclaw-imports/ - Command allowlist — approval patterns
- Messaging settings — platform configs, allowed users, working directory
- API keys — Telegram, OpenRouter, OpenAI, Anthropic, ElevenLabs tokens
- TTS assets — workspace audio files
How Hermes Agent differs from OpenClaw
I’ve used both extensively. Here are the differences that actually matter day to day:
The learning loop changes how you work with it. In OpenClaw, every complex task starts from scratch. In Hermes, once you’ve walked the agent through deploying a Docker service, it creates a skill for that workflow. Next time you say “deploy the staging API,” it already knows the steps.
Session search gives it long-term recall. OpenClaw’s memory is limited to what fits in its context files. Hermes stores every conversation in SQLite with full-text search. Ask “what port did we configure for Redis last month?” and it finds the answer in old sessions.
The personality system is more structured. OpenClaw uses a system prompt you edit by hand. Hermes has SOUL.md as a durable identity file, session-level /personality overlays, and 14 built-in presets.
Voice mode actually works. Mic input in the CLI, TTS replies on messaging, and live Discord voice channel conversations. Nothing else in the OpenClaw family does this.
Terminal backend isolation. OpenClaw runs commands on your local machine. Hermes can run them in Docker containers, on remote servers via SSH, or in serverless environments like Modal and Daytona. If the agent breaks something, it breaks the sandbox, not your server.
Migration is painless. hermes claw migrate imports everything from OpenClaw — memories, skills, API keys, platform configs. You don’t start over.
Where OpenClaw still wins: bigger community, more third-party dashboards and skins, longer track record. If you want the more battle-tested option, OpenClaw is still it.
CLI command reference
| Command | Description |
|---|---|
hermes | Start chatting |
hermes model | Choose your LLM provider and model |
hermes tools | Configure which tools are enabled |
hermes setup | Full setup wizard |
hermes config set KEY VAL | Set a config value |
hermes config edit | Open config.yaml in your editor |
hermes gateway setup | Configure messaging platforms |
hermes gateway install | Install as a system service |
hermes gateway start | Start the messaging gateway |
hermes gateway status | Check gateway status |
hermes skills search QUERY | Search for skills |
hermes skills install SKILL | Install a skill |
hermes claw migrate | Migrate from OpenClaw |
hermes update | Update to latest version |
hermes doctor | Diagnose issues |
hermes --continue | Resume last session |
Configuration reference
The full directory structure:
~/.hermes/
├── config.yaml # Settings (model, terminal, TTS, compression, etc.)
├── .env # API keys and secrets
├── auth.json # OAuth provider credentials
├── SOUL.md # Agent identity
├── memories/ # MEMORY.md, USER.md
├── skills/ # Agent-created skills
├── cron/ # Scheduled jobs
├── sessions/ # Gateway sessions
└── logs/ # Logs (secrets auto-redacted)
Key config options in config.yaml:
model:
provider: "openrouter"
default: "openrouter/free"
terminal:
backend: local
timeout: 180
memory:
memory_enabled: true
user_profile_enabled: true
tts:
provider: "edge"
stt:
provider: "local"
display:
tool_progress: all
Frequently asked questions
Do I need a GPU to run Hermes Agent?
No. Hermes Agent is the client — it sends requests to an LLM provider like OpenRouter. The provider runs the model on their hardware. Your server just needs enough RAM for the Python process (around 200MB).
Can I use Hermes Agent with Ollama for fully local inference?
Yes. Set OPENAI_BASE_URL=http://localhost:11434/v1 and OPENAI_API_KEY=ollama in ~/.hermes/.env. Or use hermes model and select “Custom endpoint.” See our Ollama Docker guide for setting up Ollama.
How does the OpenRouter free tier work?
OpenRouter offers several models at zero cost with rate limiting. You get a generous daily allowance. For heavier use, add credits to your OpenRouter account and switch to paid models. No changes needed on the Hermes side — same API key, different model name.
Can I migrate from OpenClaw?
Yes. Run hermes claw migrate and it imports your persona, memories, skills, API keys, and platform configs. Use --dry-run first to preview what gets migrated.
Does Hermes work on Windows?
Not natively. Install WSL2 and run the installer inside WSL2.
Can multiple people use one Hermes instance?
Yes, through the messaging gateway. Each platform chat gets its own session. Multiple Telegram users or Discord channels can talk to the same instance with separate conversation histories. Use allowlists to control who has access.
How does Hermes Agent compare to OpenFang?
OpenFang focuses on autonomous agents (Hands) that work on schedules without prompting, and has 40 channel adapters with 16 security layers. Hermes focuses on the learning loop, voice mode, and a deeper memory system. Different strengths. See our OpenFang setup guide for the full comparison.
How does it compare to CoPaw?
CoPaw has a built-in web console and better support for Chinese messaging apps (DingTalk, Feishu, QQ). Hermes has the learning loop, voice mode, and more terminal backend options. See our CoPaw setup guide for details.
Is my data private?
All data stays on your server. The only external calls go to your configured LLM provider. If you run local models via Ollama, nothing leaves your machine at all. Logs auto-redact secrets.
If you want an assistant that actually gets better the more you use it, Hermes Agent is the one to try. The skill creation loop and session search are things I haven’t seen in any other project in this space. And if you’re already on OpenClaw, the migration command means you can try it without losing anything.
For other self-hosted assistant options, check out our OpenClaw alternatives roundup. If you want autonomous agents running on schedules, the OpenFang setup guide covers that. For a single Go binary on minimal hardware, see the PicoClaw setup guide. And for the multi-channel web console approach, the CoPaw setup guide has the details. For the best third-party web UIs and dashboards for Hermes Agent, see the best Hermes dashboards roundup.