I’m wondering if anyone has an idea or can maybe explain how the “polling duration”, along with the “last polled” time affect each other (if they do at all).
I have roughly 600 devices. Of which the polling duration for each device runs from 10s - 200s. I get a “Warning” when I run validate that “Some devices have not been polled in the last 5 minutes”. Of which it shows me a count of roughly 50-60 devices that haven’t been polled in that 5min time frame.
I’m not sure how to go about troubleshooting this. I’ve disabled all un-needed modules. I’ve enabled the per-port-polling on any of the ones that tend to take a long time.
Certainly some of my devices being polled have some latency, just due to locations (remote from the librenms server), but overall latency generally doesn’t exceed 50-100ms, and no packet loss… generally.
My librenms server is not overly taxed, 8-10% of RAM in use, 20-30% CPU usage in general.
I’m just wondering if anyone has further suggestions on trying to troubleshoot this. I’ve run through all the docs and tweaks from the performance section in the documentation.
[root@nms1 librenms]# ./validate.php
Component | Version |
---|---|
LibreNMS | 1.31.03-43-gf158a56 |
DB Schema | 206 |
PHP | 7.0.22 |
MySQL | 5.5.52-MariaDB |
RRDTool | 1.4.8 |
SNMP | NET-SNMP 5.7.2 |
====================================
[OK] Database connection successful
[OK] Database schema correct
[WARN] Some devices have not been polled in the last 5 minutes.
You may have performance issues. Check your poll log and see: http://docs.librenms.org/Support/Performance/
sw0.blah.dev
and 54 more…