Cisco Catalyst switches stack ports sensor alert

Hello LibreNMS community,

Today, I suddenly noticed lots of alerts related to Cisco Catalyst stack ports, screenshot attached below:

Multiple devices were affected, with different software and hardware versions:

  • 2960X
  • 9200L

Nothing changed in the Network Infrastructure, and LibreNMS ran long before this happened.

librenms@ow-lnms01:~$ ./validate.php
===========================================
Component | Version
--------- | -------
LibreNMS  | 24.8.0-57-g30b174508 (2024-09-08T22:00:42+00:00)
DB Schema | 2024_08_27_182000_ports_statistics_table_rev_length (299)
PHP       | 8.1.10
Python    | 3.10.12
Database  | MariaDB 10.6.12-MariaDB-0ubuntu0.22.04.1
RRDTool   | 1.7.2
SNMP      | 5.9.1
===========================================

[OK]    Composer Version: 2.7.8
[OK]    Dependencies up-to-date.
[OK]    Database connection successful
[OK]    Database connection successful
[OK]    Database Schema is current
[OK]    SQL Server meets minimum requirements
[OK]    lower_case_table_names is enabled
[OK]    MySQL engine is optimal
[OK]    Database and column collations are correct
[OK]    Database schema correct
[OK]    MySQL and PHP time match
[OK]    Active pollers found
[OK]    Dispatcher Service not detected
[OK]    Locks are functional
[OK]    Python poller wrapper is polling
[OK]    Redis is unavailable
[OK]    rrdtool version ok
[OK]    Connected to rrdcached

I understand that we can hide those alerts, but I’d like to understand what exactly triggered the problem.

I saw some other topics, which could be related, but didn’t find any obvious answers, what triggered the problem and how it was fixed:

Thank you in advance

I am having the same issue. It started on Saturday at 6am Eastern.

librenms@fgfnms ~ % ./validate.php
===========================================
Component | Version
--------- | -------
LibreNMS  | 24.8.0-61-gf1e7a218f (2024-09-09T12:48:07-04:00)
DB Schema | 2024_08_27_182000_ports_statistics_table_rev_length (300)
PHP       | 8.3.11
Python    | 3.10.12
Database  | MariaDB 11.3.2-MariaDB-1:11.3.2+maria~ubu2204
RRDTool   | 1.7.2
SNMP      | 5.9.1
===========================================

[OK]    Composer Version: 2.7.9
[OK]    Dependencies up-to-date.
[OK]    Database connection successful
[OK]    Database connection successful
[OK]    Database Schema is current
[OK]    SQL Server meets minimum requirements
[OK]    lower_case_table_names is enabled
[OK]    MySQL engine is optimal
[OK]    Database and column collations are correct
[OK]    Database schema correct
[OK]    MySQL and PHP time match
[OK]    Active pollers found
[OK]    Dispatcher Service not detected
[OK]    Locks are functional
[OK]    Python poller wrapper is polling
[OK]    Redis is unavailable
[OK]    rrdtool version ok
[OK]    Connected to rrdcached

It looks like they updated /opt/librenms/includes/discovery/sensors/state/cisco.inc.php 3 days ago and it was pushed via daily. I reverted back to the old version and I am no longer getting the alerts.

I had the issue too, I manually disabled all these checks on all my switches

I’ve done a PR to fix this: Changed the cswRingRedundant failed state to a soft fail by laf · Pull Request #16379 · librenms/librenms · GitHub if someone wants to test it?

It basically changes the severity of the false state so most likely won’t trigger your rules now.

This is Cisco Stack state errors on unstacked device · Issue #16374 · librenms/librenms · GitHub

@laf I think your PR is fixing the problem in the wrong place/way.

The devices with these issues aren’t stacked so shouldn’t be tracking these at all. For stacked devices, it probably should remain as a hard fail.

I’ve got a fix suggestion in my issue above (which basically keeps the behaviour as it was 5 days ago, but with the updated code from @murrant in cisco.inc.php) but haven’t had a chance to PR the fix myself yet.