Hello!
We started to experience alert notifications to be sent out two times consecutively upon a single polling time, but it dies not happen every polling interval. The main problem here is that I cannot find a way to troubleshoot this behavior properly, as it is completely unpredictable.
From what we have found so far:
- test-alert.php script does not send multiple alerts.
- poller.php does send multiple alerts, but approximately once per two polling intervals. Polling time does not affect this behavior - despite the time it takes to poll a device, alerts can be sent multiple times.
- The problem seems to have emerged after this commit: https://github.com/librenms/librenms/pull/9136
- The problem persists with multiple alert rules and multiple devices, this is not related.
- I think the problem is not related to transport, since multiple notification sending is logged in event log.
- If we revert to older commit, the problem perishes: https://github.com/librenms/librenms/pull/9221
Event log sample: http://prntscr.com/kyhgjc
LibreNMS central validate output:
====================================
Component | Version
--------- | -------
LibreNMS | 1.43-110-gd9b4fb5
DB Schema | 268
PHP | 5.6.37-0+deb8u1
MySQL | 5.7.20-log
RRDTool | 1.6.0
SNMP | NET-SNMP 5.7.2.1
====================================
LibreNMS poller validate output (with problem):
====================================
Component | Version
--------- | -------
LibreNMS | 1.43-102-gd9b4fb5
DB Schema | 268
PHP | 5.6.37-0+deb8u1
MySQL | 5.7.20-log
RRDTool | 1.6.0
SNMP | NET-SNMP 5.7.2.1
====================================
LibreNMS poller validate output (without problem, same poller):
====================================
Component | Version
--------- | -------
LibreNMS | 1.43-102-g2757232
DB Schema | 268
PHP | 5.6.37-0+deb8u1
MySQL | 5.7.20-log
RRDTool | 1.6.0
SNMP | NET-SNMP 5.7.2.1
====================================
Does anybody know how can I troubleshoot this behavior?