Dashboard Explained
The Dashboard page is meant as a quick status overview of the system. While it compiles the most important health information about your devices on one place, it may get a little time to get used to. Let's explain the main elements of the page.
Global status
The global DWARFG_LONG service status is the first light on the page. You may notice that the color matches the color of the light on the top status bar - and that clicking on the status light bar will always get you to the Dashboard.
The different light meanings are:
- Green ... all good. Global status detailed explanation: DWARFG_LONG service is 100% healthy and none of the registered devices are misbehaving above a reasonable margin (slight connection delays are tolerated).
- Yellow ... warning, some action is pending or something is not ok. Global status detailed explanation: While DWARFG_LONG service is running fine, there are either some pending actions, some devices that are misbehaving, late to the report, alerts raised or we expect a problem to come in future. Examples:
- There are alerts raised for some devices/groups.
- There is a configuration update pending for a device (device configuration is not matching the desired configuration).
- There are no free device seats available.
- The license is close to expiration date.
- Red ... error. Global status detailed explanation: DWARFG_LONG service is not running or having problems, determining device real status is not reliable.
- Grey (only in the top bar) ... DWARFG_LONG UI has issues determining DWARFG_LONG service state.
Please note that the Dashboard is not refreshed automatically. It may happen that after some while, the actual status of the system changes. To see the update, you need to refresh the page. To indicate the changes in the global state, the global state top bar icon could be used - the icon does refresh regularly. In case the color in the top bar differs from the one you are seeing on the Dashboard, simply reload the Dashboard page.
If the Dashboard page global status is not green, DWARFG_LONG tries to help you by giving a quick indicator of what is not ok in the caption below the global status light.
Global component lights
Next to the global status light, the status is further decomposed into additional four lights:
- Number of free device slots ... when there are free device slots, the light is green and it is showing the number of free slots. If you are at the capacity and no new devices are trying to get registered, the light is yellow. If there are declined registrations of new devices because there are no free spots left, light is red and shows negative number - counter of declined registration attempts since service startup.
- Number of days left until the license expires ... red if license is expired, yellow when the license is about to expire (<20 days left) and green otherwise.
- Tunnel status indicator ... shows number of defined tunnels together with worst of their status - green if no issue is found, red if there is a problem with any tunnel detected.
- Group status indicator ... shows counter of groups and the worst status among the groups. Because groups are composed of devices, this actually translates to device performing as expected or not. If there is a sync error or raised alert, this light goes red. If there is e.g. a past alert (not raised at the moment but triggered since last manual alert cleanup), the light goes yellow. If all groups (and thus all devices) are ok, light is green.
Problem summary
If no problem has been detected, you will not see anything between component lights and Groups breakdown. In the other case, DWARFG_LONG will try to give a one-sentence summary for every type of problem detected in the hope that it could help you with identifying where is the problem coming from.
Monitoring groups breakdown
Every defined monitoring group has its own entry in the Dashboard. Until you define your own groups, you would see here only the very special All devices system group that simply contains all of the devices.
Every group status breaks down to three components, each sporting a three-lights in a semaphore fashion. Before getting to the semaphores description, please note that each of the component lights caption is clickable so that you can quickly access the exact list of devices in that state.
- Freshness ... shows if some devices are lingering or late
- Online ... the number of devices that are on-time with data updates.
- Delayed ... the number of devices that are slightly delayed with data update. NOTE: when not all of your devices are Delayed, semaphore intentionally shows green color as slight delays are not considered a warning state for the whole Monitoring Group.
- Late ... the number of devices that are considered too late with data update, thus causing an error being reported.
- Sync status ... shows if there are configuration updates or update errors
- Synced ... the number of devices that are in fully synchronized state - that is, there is no pending configuration change waiting for these.
- Pending ... the number of devices for those there is a configuration change ready to be picked up on their next data exchange.
- Error ... the number of devices that picked up the configuration change but did not applied the change (that is detected on their next data exchange).
- Alerts ... shows number of alerts relevant for this particular Monitoring Group
- Active ... the number of valid and regularly evaluated alerts
- Past+Inv ... the sum of:
- number of alerts triggered in past (and not acknowledged since) but not raised at the moment
- number of Invalid alerts - these are alerts that cannot be evaluated
- Raised ... number of alerts raised at the moment
NOTE regarding alerts: alerts that are defined against individual devices (not against any group) are being reported among the All devices group alerts.
Triggered alerts list
If there are any triggered alerts (both alerts raised at the moment and raised in past but not cleaned up), their breakdown is given under the groups:
- For raised alerts, following is given:
- Name/link to the alert definition.
- Link to table of all devices for which the alert is raised.
- Several of such devices are listed directly - if the list is too long, it is clipped.
- For past alerts:
- Name/link to the alert definition
- Link to clear the alert trigger counter ... by clicking on the link, the alert trigger counter is reset and the alert is no longer reported as triggered in past.