Rule Engine
Overview
In data centre operations, a rule engine with alerts for various metrics is essential for proactive monitoring and management of critical components and services. Let's see the different types of rule engine alerts for specific metrics in a data centre environment
ASIC IPv4 Routes ASIC IPv6 Routes BGP Neighbours Down CPU Core Temperature CPU Utilization DISK Health DISK Temperature DISK Used Memory Percent Device Down Device Queue Transmit Counter Docker CPU Utilization Docker Down Docker MEM Utilization Dynamic IP Change Dynamic IP Change With Only Conflicts FAN Speed Failed FANs Failed PSUs Memory Utilization PSU Temperature Unhealthy Devices
Interface Flap Interface PFC Receive Counters Interface PFC Transmit Counters Interface Queue Transmit Counters Traffic InDiscards Traffic InErrors Traffic OutDiscards Traffic OutErrors Traffic Rx Utilization Traffic Tx Utilization Transceiver Rx Power Transceiver Temperature Transceiver Tx Power Transceiver Voltage
CPU Core Temperature CPU Utilization DISK Health DISK Temperature DISK Used Memory Percent Device Down Docker Down GPU Memory Utilization GPU PSU 1 Power Draw GPU PSU 2 Power Draw GPU Temperature GPU Utilization Memory Utilization
Push Notification
Rule Engine pushes the configured rule notification in case any device breaches the threshold value configured under the rule to
Slack channel
Zendesk Support ticket
Service Now ticket
To use Rule Engine Alert feature User needs to setup first Slack channel integration, Zendesk Support integration or Service-Now integration
Last updated
Was this helpful?