Enterprise MLOps Edition

Hermes Agent — Field Guide

A terminal AI agent with 40+ tools and 51 industry skills.
Built on NousResearch hermes-agent · Enterprise MLOps overlay by QbitLoop

Quick Install
curl -fsSL https://hermes.qbitloop.com/install.sh | bash

The Problem With Institutional MLOps Knowledge

Every team that runs ML in production has the same problem: the knowledge to operate it lives in people's heads. The senior engineer who knows the fraud model's blind spots, the vLLM timeout trick, the W&B artifact naming convention — that's not in any runbook.

When that person is on vacation, it's a 2am page and a four-hour war room. When they leave, the knowledge walks out with them.

Hermes captures that knowledge as skills — plain-text SKILL.md files that describe tools, workflows, and domain rules. The first deployment might take 6 hours. The tenth takes one command.

"Traditional monitoring tells you '3 towers are red.' This tells you CHI-4412 backhaul died, the same fiber that got cut in November, neighbors are absorbing traffic, and you have until 7 AM before the cluster collapses — here's the work order."
40+ Built-in Tools
51 MLOps Skills
60 Max Agent Iterations
5 Sandbox Backends

Five Verticals, One Agent

Each vertical ships with pre-built skills that encode domain rules — no prompt engineering required. Copy the commands below and run them.

Financial Services
Fraud detection MLOps · vLLM latency diagnosis · W&B governance audit trail
hermes chat -q "diagnose vLLM latency spike on fraud-model-v3"
hermes chat -q "generate W&B governance report for Q4 model runs"
Healthcare & Life Sciences
Clinical RAG over 2M documents · FHIR structured output · CLIP multimodal search
hermes chat -q "build RAG pipeline over clinical-notes/ with FHIR output"
hermes chat -q "run CLIP similarity search on radiology-images/"
Telecommunications ★ Hero Demo
Network anomaly detection at 50K towers · SAELens interpretability · real-time alerting
hermes chat -q "detect anomalies in tower-metrics/ using SAELens"
hermes chat -q "schedule hourly network health report to Telegram"
Retail & E-Commerce
Black Friday 10x traffic scaling · Qdrant product search · demand forecasting
hermes chat -q "scale inference cluster for Black Friday load test"
hermes chat -q "index product-catalog/ in Qdrant with CLIP embeddings"
Public Sector & Defense
Document intelligence · 100K forms/day · Chroma policy search · air-gap support
hermes chat -q "extract structured data from forms/ at 100K/day throughput"
hermes chat -q "search policy-docs/ for FedRAMP ATO requirements"

Capabilities

Terminal · hermes — startup
Hermes Agent TUI startup showing Available Tools and Skills panels

Hermes TUI on launch: pixel-art banner, Available Tools panel (left), Loaded Skills panel (right)

Memory System

Session context + persistent MEMORY.md (~800 tokens) + USER.md profile (~500 tokens). Agent remembers preferences across sessions.

ReAct Loop

Reason → Act → Observe, up to 60 iterations per task. Chains multiple tool calls automatically to solve complex multi-step problems.

Tool Suite

Terminal, file, web, browser, vision, image gen, code execution, cron, Slack, Discord, Telegram, WhatsApp — all built in.

Skill System

SKILL.md format: plain text files that give the agent domain knowledge. Searchable from agentskills.io. MIT licensed.

Model-Agnostic

OpenRouter (200+ models), Nous Portal, or custom endpoint. Switch providers with one command. No vendor lock-in.

Sandbox Backends

Local, Docker, E2B, Modal, or SSH remote execution. Agent code stays on your machine; commands run in the sandbox.

Terminal · "What MLOps skills do I have installed?"
Agent response listing all 24 MLOps skills with descriptions

Agent auto-categorizes 24 installed MLOps skills across Training, Inference, Vector DBs, Vision, Eval, and Industry verticals


Watching the Agent Think

Every query shows its reasoning in real time via tool calls. Here's what happened when asked to diagnose a fraud model outage:

Tool Call What It Means
📚 skill finance Loaded finance-mlops SKILL.md — now knows fraud detection patterns, SR 11-7 model risk rules, vLLM workflows
📚 skill weights-and-biases Loaded W&B skill — uses experiment tracking to find when model drift started
📚 skill vllm Loaded vLLM skill — for diagnosing the inference serving layer
📚 skill instructor Loaded Instructor skill — for structured output when extracting model diagnostics
💡 computing... (48.2s) Synthesizing all loaded skills into a coherent diagnosis plan

The key insight: the agent pulled 4 relevant skills automatically from one plain-English prompt. No one told it to load W&B or vLLM — it reasoned that a fraud detection outage would involve those tools. This is institutional knowledge capture in action.

Terminal · 50,000-tower telecom diagnosis
Chicago tower diagnosis timeline — morning rush avoided

Chicago cluster: 3 towers flagged at 2am, root cause isolated in 10 minutes, morning rush impact avoided

Telecom Hero Timeline

Time Action Tool
02:00 AMAlert fires on 3 towersMonitoring system
02:02Pull tower metadata + adjacencyTower DB
02:05KPI snapshot, identify fingerprintKPI pipeline
02:10SAELens diagnosis → "CHI-4412 backhaul failure" (15.3x)Modal + SAELens
02:15Incident RAG → "Fiber cut Nov, same tower, same crew"Qdrant
02:20Cascade prediction → "18% drop rate by 7 AM"SAELens steering
02:25Work order generated + dispatchedInstructor
02:30Microwave backup activatedNOC manual
04:15Fiber splice completeField crew
07:00+Morning rush — full capacity, no dropped callsAvoided
MetricValue
Diagnose to root cause10 minutes
Mitigate (microwave backup)30 minutes
Full resolution (fiber repair)2h 15m
Morning rush impactAVOIDED

Fraud Detection Tool Assignment

StepToolWhy
Feature drift detectionW&BLog PSI, compare distributions, governance artifacts
Model version auditW&B RegistryCheck who deployed what, when
Serving diagnosticsvLLM metricsLatency, cache usage, throughput
FN cluster analysisInstructorStructured extraction of new fraud pattern
Threshold re-tuningW&B SweepsBayesian optimization of threshold + class weights
Burst retrainingModalSpin up GPUs for fast retrain
Incident documentationW&B ArtifactsSR 11-7 compliant audit trail
Ongoing monitoringModal + W&BScheduled feature health checks + automated alerts

From Zero to Running Agent in 5 Minutes

1
Install Hermes
One command installs the agent, enterprise MLOps overlay, and all 51 skills.
curl -fsSL https://hermes.qbitloop.com/install.sh | bash
2
Add your LLM key
Get a free key at openrouter.ai — 200+ models, pay-per-use, no subscription.
# Edit ~/.hermes/.env
OPENROUTER_API_KEY=sk-or-v1-...
3
Launch the TUI
Opens the full terminal interface with tools and skills panels.
hermes
4
Try your first query
Ask it anything. For MLOps, start with your vertical:
hermes chat -q "What MLOps skills do I have installed and what can each one do?"
5
Run a vertical demo
Pick a scenario from Section 02 and run it. The agent will auto-load the relevant skills.
# Telecom example:
hermes chat -q "I manage 50,000 cell towers. Three towers in the Chicago cluster showed anomalous traffic patterns at 2am. Walk me through how you'd diagnose this using the telecom-mlops skill."