- Nightwatch is an open‑source, read‑only AI layer that sits on top of your monitoring stack.
The project bundles a local agent that watches each environment, keeps credentials on‑prem, and only contacts a central brain outbound. It clusters alerts into incidents, suppresses noisy checks, and can launch the agent to collect evidence from live services. The author built it after a botched Kubernetes upgrade forced a night‑time rollback, where figuring out the broken component took too long.
For SRE teams, the tool offers a head start on incident triage without opening new inbound ports or relying on large language models in production. All clustering runs offline; the only LLM needed is for the agent’s tool‑calling, which can be pointed at a self‑hosted model.
It’s a niche experiment, useful if you already run multiple clusters, but the read‑only guarantee means you’ll still need human verification before any remediation.
