Would anyone care to share their LibreNMS alerts & templates? Here are 24 of mine

zimmertr · 5 September 2017 17:02

LibreNMS is not very plentiful with their example alerts. It took me a significant amount of time to come up with the following rules so I thought I would create a repository to retain them in the event that I need to rebuild my LibreNMS server in the future.

I’m looking forward to seeing what rules the rest of the community has designed. Here are some other things I would be interested in monitoring:

Disk IOPS
CPU Contention
Disk Read/Write Latency
Memory Contention
HTTPS Endpoint Availability
ICMP response time
Successful/Failed SSH/RDP connection initiation

PS: If you have any feedback for the Alert-related issues described in the TODO section of the GitHub README I would love some insight.

Kevin_Krumm · 5 September 2017 17:26

Very Nice thank you for sharing. When I get a second I will post up some of mine.

Also, LibreNMS has Alert Collection https://github.com/librenms/librenms/blob/master/misc/alert_rules.json

Kevin_Krumm · 5 September 2017 17:27

Here is the one I use for login failures.
%syslog.timestamp > = %macros.past_5m && %syslog.msg ~ “@authentication failure@”

All of my Alert Rules: kkrumm NMS Alert Rules - Pastebin.com

florianbeer · 5 September 2017 21:10

The last 3 of your list can be easily checked using LibreNMS’ “Services”.
check_http
check_icmp
check_ssh

An alert rule to get notifications for service warnings would be

%services.service_status = "1"

or

%services.service_status = "2"

for status critical.

FTBZ · 6 September 2017 05:18

Excellent, thanks for sharing.

aldemir_a · 6 September 2017 07:46

Most important ones for me are the Disk Space checks.
So for Critical alerts:

%storage.storage_perc >= "95" && %devices.type = "Server"

and for warnings:

%storage.storage_perc >= "85" && %storage.storage_perc < "95" && %devices.type = "Server"

However, to fine tune this, for the servers with huge disks percetages should be higher. In nagios I would create different host groups for these kinds of servers, but nevermind we’ll get there in good time.

laf · 6 September 2017 08:02

@aldemir_a Use the device groups and bind each rule to that group

laf · 6 September 2017 08:03

@zimmertr It would be great if you could contribute those rules to the collections we have that @Kevin_Krumm linked to.

FTBZ · 6 September 2017 09:11

About this, it will be better to list the rules without searching because sometimes you don’t know what you want to search.

florianbeer · 6 September 2017 11:56

You can use

%storage.storage_perc >= %storage_perc_warn && %devices.type = "server"

and then adjust the storage warning levels in the device settings individually.
That’s exactly what they are there for IMHO.

aldemir_a · 6 September 2017 12:29

Thanks for the good tip, however I wouldn’t set things per device as much as I can. Might create headaches eventually.

Kevin_Krumm · 26 October 2017 22:41

A post was split to a new topic: Help how can I create an alert rule

Kevin_Krumm · 4 December 2017 21:00

2 posts were split to a new topic: Help With Alert Rule

Kevin_Krumm · 21 January 2018 17:11

Just wanted to say that we added more alert rules to the collection. 84 in total and still growing.

Eric_Garnel · 1 October 2018 14:13

These are awesome. Is there a way to import alert rules from file or copy-paste?

Kevin_Krumm · 1 October 2018 17:09

Allot these were added in the alerts collection. Take a gander.