System Monitor
Real-time infrastructure health & performance
Live · updating every 1.2s
Global Uptime
99.1%
↓ vs 99.5% target
Avg Latency
48ms
↓ within SLA
Peak RPS
2,420
↑ req/sec throughput
Error Rate
2.3%
→ within tolerance
Active Nodes
8
→ 2 stressed · 1 overloaded
Latency Trend
Avg + P95 over last 60s
Avg
P95
Error Rate
% failed requests over last 60s
Throughput per Node
Requests/sec across all deployed infrastructure
Total: 14,315 RPS
Node Health
8 nodes deployed
1 overloaded1 stressed
Name | Type | Tier | CPU | Memory | Conns | RPS | Uptime | Status |
|---|---|---|---|---|---|---|---|---|
| auth-svc | Microservice | T1 | 94% | 89% | 2 | 1,560 | 96.1% | overloaded |
| app-srv-01 | Server | T1 | 78% | 82% | 1 | 1,050 | 98.2% | stressed |
| app-srv-02 | Server | T1 | 61% | 57% | 1 | 980 | 99.4% | active |
| lb-primary | Load Balancer | T1 | 55% | 44% | 3 | 2,180 | 99.9% | active |
| pg-main | Database | T2 | 48% | 71% | 3 | 1,890 | 99.7% | active |
| api-gw | API Gateway | T1 | 42% | 38% | 2 | 2,420 | 99.8% | active |
| cdn-us | CDN | T1 | 28% | 22% | 0 | 615 | 100.0% | active |
| redis-01 | Cache | T1 | 22% | 45% | 1 | 3,620 | 100.0% | active |
Alert Feed
1 critical · 3 warnings
Live
Overload Detected
auth-svc
CPU at 94%, memory at 89%. Cascade risk elevated.
12s ago
High CPU
app-srv-01
CPU sustained above 78% for 3 minutes.
1m ago
Connection Saturated
lb-primary → app-srv-01
Link saturation at 78%. Consider adding replica.
2m ago
Traffic Spike
api-gw
RPS jumped from 1.8k to 2.4k in 30 seconds.
4m ago
Latency Spike Resolved
pg-main
P95 latency returned below 100ms SLA.
7m ago
Cache Hit Rate Low
redis-01
Hit rate dropped to 68%. Review cache TTL config.
12m ago
Memory Pressure
pg-main
Memory at 71%. Monitor for buffer pool exhaustion.
18m ago
Node Recovery
cdn-us
CDN node recovered after brief connectivity drop.
25m ago