Previous Topic: Override Speed In and Speed Out Values on InterfacesNext Topic: Performance Management Events


Eventing

This section contains the following topics:

Event Performance Guidelines

Performance Management Events

Baseline Averages

How to Monitor Device Performance Using Events

Monitoring Metrics with Event Rules

View Events

How to Configure Notifications from Event Manager

Event Performance Guidelines

The following configuration was used to validate and benchmark event performance:

How to Monitor Event Processing

To determine if you are doing too much eventing, you need to monitor a few key performance indicators in Data Aggregator. Eventing in Data Aggregator is performed in batches (such as, events are evaluated and generated for large groups of items at once). For this reason, we used a variety of metrics that were tracked through the Data Aggregator system’s self-monitoring mechanism to assess the health of the Data Aggregator system. To view these important metrics, add a custom IM Device MultiTrend view to a dashboard. Edit the dashboard, using the following metrics from the metric family Data Aggregator Event Calculation Times:

By watching all of these metrics over time you can judge the health of event performance on your system. Additionally, if the Karaf log on the Data Aggregator system contains database and/or other errors, this can be an indication of a system under stress. In general, these self-monitored metrics should be steady. However during the evening hours (by default between 2 and 4 AM UTC), some database intensive jobs are run which can cause fluctuations in the self-monitored metrics. If the metrics return to a steady state, the system can be considered still in good health (although events can be delayed during the time the system is busy).

We recommend that you turn on eventing slowly and judge the system health before moving forward with different rules. We also recommend that you monitor the health of the system over 24 hours after each subsequent change, as there is nightly processing that can have an impact even though eventing may appear steady through-out the day-time hours.

How to Remediate When the Threshold is Exceeded

To remediate when you exceed the threshold, follow this process:

  1. Turn off event rules one at a time. Check the performance after you turn off each rule before turning off another rule.
  2. Reduce the number of items being polled.
  3. Reduce the number of monitoring profiles with event rules that are polling items.
  4. If these steps do not improve the performance, contact CA Support.