added
Arqos now watches the gear that can't run an agent
Hypervisors, switches, and storage appliances don't allow software installs — so any Arqos agent at the same site now quietly polls them for you (ping and port checks, every minute). There's nothing to assign or babysit: agents share the work as a pool, and if one goes away another takes over on its own. If the site ever loses all eyes, you get one honest "monitoring blind" notice instead of a wall of false alarms.
[+] for the curious[–] for the curious
Checks are site-scoped leases claimed and renewed by any reporting agent (failover in ~3–4 minutes once the fleet is on v0.4+). Probed devices use the same absence-based detection as agents: green checks count as life, partial failure shows as degraded, and silent-with-no-evidence becomes gray "unknown" with a single site-level alert after a grace period. SNMP and read-only vSphere collection are in development.
added
Alerts now know who to tell, and when to escalate
You can route notifications by site and severity, every incident stays in one email thread from first alert to all-clear, and anyone can acknowledge with a single click — no sign-in needed. If a critical alert sits unacknowledged, Arqos walks it up the chain you define, one contact at a time.
[+] for the curious[–] for the curious
Per-channel routing with severity floors, signed one-click acknowledge links, and an escalation ladder driven by per-channel unacknowledged-delay. Recovery notices reply into the original incident thread, and anyone who was escalated gets the all-clear too.
added
Snooze alerts for planned maintenance
Rebooting a server on purpose? Snooze its alert emails for an hour, four hours, or until 8am — right from the alert email itself. Detection never pauses and the record never has gaps; only the emails go quiet. If the machine is still down when the snooze expires, you get one final alert.
improved
The agent can no longer be wedged by a dead network drive
A disconnected mapped drive or a stuck Windows subsystem could previously freeze an agent mid-collection — alive as a process, silent as a reporter. Every system call the agent makes now has a deadline, and a watchdog restarts the agent cleanly if a collection cycle ever stalls.
[+] for the curious[–] for the curious
Per-call timeouts on load, partition, and per-mount usage reads; mounts that time out are benched for 30 minutes; no completed cycle within max(5× interval, 5 min) exits the process for the scheduled task to restart. Shipped as agent v0.4.2.
improved
The fleet view reads like a status, not a spreadsheet
Devices are now grouped by site with problems sorted to the top, healthy fleets collapse into a one-line "all quiet" strip that shows your last incident and how fast it recovered, and a quick switcher (⌘K) jumps you to any device by name.