System Monitor

Real-time infrastructure health & performance

Live · updating every 1.2s
Global Uptime
99.1%
vs 99.5% target
Avg Latency
48ms
within SLA
Peak RPS
2,420
req/sec throughput
Error Rate
2.3%
within tolerance
Active Nodes
8
2 stressed · 1 overloaded
Latency Trend
Avg + P95 over last 60s
Avg
P95
Error Rate
% failed requests over last 60s
Throughput per Node
Requests/sec across all deployed infrastructure
Total: 14,315 RPS
Node Health
8 nodes deployed
1 overloaded
1 stressed
Name
Type
Tier
CPU
Memory
Conns
RPS
Uptime
Status
auth-svcMicroserviceT1
94%
89%
21,56096.1%overloaded
app-srv-01ServerT1
78%
82%
11,05098.2%stressed
app-srv-02ServerT1
61%
57%
198099.4%active
lb-primaryLoad BalancerT1
55%
44%
32,18099.9%active
pg-mainDatabaseT2
48%
71%
31,89099.7%active
api-gwAPI GatewayT1
42%
38%
22,42099.8%active
cdn-usCDNT1
28%
22%
0615100.0%active
redis-01CacheT1
22%
45%
13,620100.0%active
Alert Feed
1 critical · 3 warnings
Live
Overload Detected
auth-svc
CPU at 94%, memory at 89%. Cascade risk elevated.
12s ago
High CPU
app-srv-01
CPU sustained above 78% for 3 minutes.
1m ago
Connection Saturated
lb-primary → app-srv-01
Link saturation at 78%. Consider adding replica.
2m ago
Traffic Spike
api-gw
RPS jumped from 1.8k to 2.4k in 30 seconds.
4m ago
Latency Spike Resolved
pg-main
P95 latency returned below 100ms SLA.
7m ago
Cache Hit Rate Low
redis-01
Hit rate dropped to 68%. Review cache TTL config.
12m ago
Memory Pressure
pg-main
Memory at 71%. Monitor for buffer pool exhaustion.
18m ago
Node Recovery
cdn-us
CDN node recovered after brief connectivity drop.
25m ago