Observability stack for validator nodes
Hands-on steps
- Simulate lag incident signal:
cd examples/validator-ops
./scripts/simulate-head-lag.sh
cat artifacts/incident-head-lag.log
- Triage checklist (write in runbook):
- local vs network-wide lag
- import latency correlation
- peer churn check
- Add a "Detection" section to:
artifacts/validator-ops-runbook.md.
Expected evidence
artifacts/incident-head-lag.logcontainsALERT=PAGE
Verification
grep -q "ALERT=PAGE" artifacts/incident-head-lag.log && echo PASS