RRD broken image

How can I solve this problem?



[root@librenms librenms]# ./daily.sh

Re-running /opt/librenms/daily.sh as librenms user
Updating to latest codebase OK
Updating Composer packages OK
Updating SQL-Schema OK
Updating submodules OK
Cleaning up DB OK
Fetching notifications OK
Caching PeeringDB data OK
[root@librenms librenms]# ./validate.php

Component Version
LibreNMS 1.55-41-g5a8d3ae
DB Schema 2019_09_05_153524_create_notifications_attribs_index (141)
PHP 7.2.10
MySQL 5.5.60-MariaDB
RRDTool 1.4.8
SNMP NET-SNMP 5.7.2
====================================

[OK] Composer Version: 1.9.0
[OK] Dependencies up-to-date.
[OK] Database connection successful
[OK] Database schema correct
[WARN] Some devices have not been polled in the last 5 minutes. You may have performance issues.
[FIX]:
Check your poll log and see: Performance - LibreNMS Docs
Devices:
xx-nasfs1.micb2b.com
[FAIL] We have found some files that are owned by a different user than librenms, this will stop you updating automatically and / or rrd files being updated causing graphs to fail.
[FIX]:
sudo chown -R librenms:librenms /opt/librenms
sudo setfacl -d -m g::rwx /opt/librenms/rrd /opt/librenms/logs /opt/librenms/bootstrap/cache/ /opt/librenms/storage/
sudo chmod -R ug=rwX /opt/librenms/rrd /opt/librenms/logs /opt/librenms/bootstrap/cache/ /opt/librenms/storage/
Files:
/opt/librenms/.git/refs/tags/1.53
/opt/librenms/.git/refs/tags/1.53.1
/opt/librenms/.git/objects/pack/pack-4738c85e7d31e6df90019c00b042a9c5ebab1e1f.pack
/opt/librenms/.git/objects/pack/pack-4738c85e7d31e6df90019c00b042a9c5ebab1e1f.idx
/opt/librenms/LibreNMS/Alert/Transport/Ciscospark.php
/opt/librenms/LibreNMS/Alert/Transport/Dummy.php
/opt/librenms/LibreNMS/Alert/Transport/Api.php
/opt/librenms/doc/General/Changelogs/2017.md
/opt/librenms/doc/General/Changelogs/2018.md
/opt/librenms/html/images/logos/unitrends.png
/opt/librenms/html/images/os/panasonic.png
/opt/librenms/html/images/os/unitrends.png
/opt/librenms/html/plugins/bk-Weathermap
/opt/librenms/html/plugins/bk-Weathermap/.git
/opt/librenms/html/plugins/bk-Weathermap/.git/refs
and 468 more…
[root@librenms librenms]#

Is it just for that one device that your getting GAPS on all graphs?

If you are getting GAPS on all graphs for all devices, i would add localhost as a device, and see if the LibreNMS server itself has any GAPS.

also try to run those commands in the validate.php output

Hello, some machines have such problems. Not all
Is there any place to adjust?

[root@librenms librenms]# sudo chown -R librenms:librenms /opt/librenms
[root@librenms librenms]# sudo setfacl -d -m g::rwx /opt/librenms/rrd /opt/librenms/logs /opt/librenms/bootstrap/cache/ /opt/librenms/storage/
[root@librenms librenms]# sudo chmod -R ug=rwX /opt/librenms/rrd /opt/librenms/logs /opt/librenms/bootstrap/cache/ /opt/librenms/storage/
[root@librenms librenms]# ./validate.php 
====================================
Component | Version
--------- | -------
LibreNMS  | 1.55-43-gc4569a6
DB Schema | 2019_09_05_153524_create_notifications_attribs_index (141)
PHP       | 7.2.10
MySQL     | 5.5.60-MariaDB
RRDTool   | 1.4.8
SNMP      | NET-SNMP 5.7.2
====================================

[OK]    Composer Version: 1.9.0
[OK]    Dependencies up-to-date.
[OK]    Database connection successful
[OK]    Database schema correct
[WARN]  Your install is over 24 hours out of date, last update: Wed, 25 Sep 2019 06:42:45 +0000
	[FIX]: 
	Make sure your daily.sh cron is running and run ./daily.sh by hand to see if there are any errors.
[root@librenms librenms]#

Run through the performance guide https://docs.librenms.org/Support/Performance/ sometimes this can help.

Can you tie any similarities over the devices with GAPS in, do they go through a certain firewall, are they are a particular OS, do they have more ports etc… you may want to run some snmpwalks and check for packet loss at those times.

On the device, check Graphs -> Poller modules, do you see any spikes at the same time, or are they GAPS also ?

Gaps means no data was collected. This happens when you take the server down for maintenance or sometime backup routines can cause this by pausing the VM. In your case it is likely what Chas mentioned above.

Hello, everyone
There is no backup arrangement at the moment. The traffic is not particularly abnormal. The ping value is not particularly high before the exception.
Performance tuning I currently use py 8. Hardware is RAM: 16G CPU: 16CORE monitors 150 devices

yeah, the gap means no data was stored during that timeframe. RRD doesn’t support filling past data.