Device Down due to No ICMP Response - Every 60 seconds

Ever since version 23.6.0 for sure, but possibly 23.5.0 any time I add a new linux server to my LibreNMS instance (I have a couple and they all do the same thing), I get constant alerts for device down due to no ICMP response every 60 seconds until I run the the daily.sh script on the poller that is responsible for monitoring the device. What’s even more odd is my delay on that alert is 31 minutes, so why is it firing every 60 seconds, even if the device was down?

Standard ping and fping show normal results (<1ms latency) when I run them manually from the same poller. I have tried rebooting the main LibreNMS server and remote probe and this doesn’t help, but the alerts almost immediately stop when the ./daily.sh script get’s run, so it seems to be something happening in there that gets cleaned up or something.

This happens on the main LibreNMS Server and remote pollers. ./validate.php doesn’t show anything wrong while the issue is happening. Wondering if anyone else is experiencing the same issue, or has any ideas where to look?

===========================================
Component | Version
--------- | -------
LibreNMS  | 23.6.0 (2023-06-23T08:25:03-05:00)
DB Schema | 2023_04_27_164904_update_slas_opstatus_tinyint (252)
PHP       | 8.2.8
Python    | 3.8.10
Database  | MariaDB 10.3.38-MariaDB-0ubuntu0.20.04.1-log
RRDTool   | 1.7.2
SNMP      | 5.8
===========================================

[OK]    Composer Version: 2.5.8
[OK]    Dependencies up-to-date.
[OK]    Database connection successful
[OK]    Database Schema is current
[OK]    SQL Server meets minimum requirements
[OK]    lower_case_table_names is enabled
[OK]    MySQL engine is optimal
[OK]    Database and column collations are correct
[OK]    Database schema correct
[OK]    MySQl and PHP time match
[OK]    Distributed Polling setting is enabled globally
[OK]    Connection to memcached is ok
[OK]    Connected to rrdcached
[OK]    Active pollers found
[OK]    Dispatcher Service not detected
[OK]    Locks are functional
[OK]    Python poller wrapper is polling
[OK]    Redis is unavailable
[OK]    rrdtool version ok
[OK]    Connected to rrdcached

Are you saying after running daily.sh the devices then start responding ok?

Yeah, snmp is fine, it’s just the ICMP alerts.

I’ve done a diff between version 23.4.1 and 23.7 (latest) and can’t see anything substantial that’s changed, especially around the icmp code.

Can you roll back to 23.4.1 and try again to see?

I’ve never done a rollback, are there instructions on this?

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.