Arista threshold values not populating correctly?

When asking for help and support, please provide as much information as possible. This should include:

  • Steps to reproduce an issue.
  • The output of ./validate.php

If it’s an issue with the WebUI then please consider including a screenshot and the browser version you are using.

If you are having troubles with discovery/polling include the pastebin output of:

./discovery.php -h HOSTNAME -d | ./pbin.sh
./poller.php -h HOSTNAME -r -f -d | ./pbin.sh

If you need to post any text longer than a few lines, please use a pastebin service such as https://p.libren.ms using non-expiring pastes.

I’m trying to monitor a bunch of Arista infrastructure but am getting a tonne of alerts for RX/TX power and RX/TX bias from each switch. It’s hard to know if these alerts are valid or not - looking at Update LibreNMS to grab Arista optic high/low sensor values · Issue #7078 · librenms/librenms · GitHub it seems as though Libre has support for pulling threshold values from Arista switches. However, when I compare threshold values for two identical switches I can see they’re different, which would suggest they have been calculated on the fly when the device was discovered?

Switch models include 7010T-48, 7050SX3-48YC8

Here’s my ./validate.php:

librenms@librenms01:~$ ./validate.php

Component Version
LibreNMS 23.9.1-104-g0e453e593 (2023-10-18T08:00:39+01:00)
DB Schema 2023_10_14_162234_restore_bgp_peers_cbgp_delta_fields (267)
PHP 8.1.2-1ubuntu2.14
Python 3.10.12
Database MariaDB 10.6.12-MariaDB-0ubuntu0.22.04.1
RRDTool 1.7.2
SNMP 5.9.1
===========================================

[OK] Composer Version: 2.4.4
[OK] Dependencies up-to-date.
[OK] Database connection successful
[OK] Database Schema is current
[OK] SQL Server meets minimum requirements
[OK] lower_case_table_names is enabled
[OK] MySQL engine is optimal
[OK] Database and column collations are correct
[OK] Database schema correct
[OK] MySQl and PHP time match
[OK] Active pollers found
[OK] Dispatcher Service is enabled
[OK] Locks are functional
[OK] No active python wrapper pollers found
[OK] Redis is unavailable
[OK] rrdtool version ok
[OK] Connected to rrdcached

Can you compare with the result of show int et x/y tra det and see if the treshold are correct or not?

Hey

The RX/TX power do seem to match the switches values upon examination.
It’s just the RX/TX Bias that seem to be a bit all over the place. Do you know where these can be examined on the switch?

Thanks

Sorry to bug you - I was just wondering if you’d seen my update and question?

Hi, did you try something like sh int etx/y tra det or sh int et x/y tra dom tresh?

The values in LibreNMS don’t match the results I’m reading out from the switch…
Looks like I’m going to have to amend them manually in device health settings, unless you know of a better way?

Can you provide an example of what you have from the switch vs from librenms?
It looks like there is a special case for arista dbm sensors here: fix: Arista watts to dbm conversion · librenms/librenms@883095d · GitHub

An example would be ethernet port 18:

LON-CO3-SW-P-SFP-04-CL#show interfaces ethernet 18 transceiver dom thresholds
Ch: Channel, mA: milliamperes, dBm: decibels (milliwatts),
C: Celsius, V: Volts, NA or N/A: not applicable.

Port 18
Last update: 0:00:00 ago
                                              High Alarm  High Warn   Low Warn  Low Alarm
                                       Value   Threshold  Threshold  Threshold  Threshold  Unit  Indicator
                                 ---------------------------------------------------------------------------
   Temperature                         31.66       80.00      75.00       0.00      -5.00     C
   Voltage                              3.35        3.60       3.50       3.10       3.00     V
Ethernet18 (Channel 1)
   TX bias current                      0.00       90.00      85.00      10.00       5.00    mA      ALARM
   Optical TX power                   -30.00        0.00      -1.00      -6.00      -7.00   dBm      ALARM
   Optical RX power                   -30.00        0.00      -1.00     -14.00     -15.00   dBm      ALARM

The DOM TX Bias values in LibreNMS are:

High: 0.09
High warn: 0.085
Low warn: 0.01
Low: 0.005

Typing those values made me realise that Libre is doing a conversion to the values it’s reading out from the switch. Judging by the link you provided this may be being done incorrectly?

Cheers

Actually my apologies; looking at those alert thresholds, the alerts are being generated correctly by LibreNMS.

My problem is these alerts are being generated for pretty much every port in every arista switch we have here which is incredibly noisy. I think I’m going to have to turn them off.

You can try to keep at least RX power at first, since an alert on this would lead to packet loss. Do you have alerts on all your ports for rx power too?

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.