Librenms Bad performance (timeouts)

There are installs with over 50,000 devices without performance issues.

What is your /etc/php/8.3/fpm/pool.d/librenms.conf file?

The primary issue is that NGINX cannot connect to the PHP-FPM socket /run/php-fpm-librenms.sock, likely because PHP-FPM is crashing or misconfigured.

Hi @murrant ,

I currently have 333 devices and 42113 ports

root@librenms:~# tail -f /var/log/mysql/error.log
2025-04-16  8:40:35 136936 [Warning] Aborted connection 136936 to db: 'librenms' user: 'librenms' host: 'localhost' (Got an error reading communication packets)
2025-04-16 12:15:08 0 [Warning] Aborted connection 0 to db: 'unconnected' user: 'unauthenticated' host: 'connecting host' (Too many connections)
2025-04-16 12:15:12 0 [Warning] Aborted connection 0 to db: 'unconnected' user: 'unauthenticated' host: 'connecting host' (Too many connections)
2025-04-16 12:20:59 168856 [Warning] Aborted connection 168856 to db: 'librenms' user: 'librenms' host: 'localhost' (Got an error writing communication packets)
2025-04-16 12:48:02 172787 [Warning] Aborted connection 172787 to db: 'librenms' user: 'librenms' host: 'localhost' (Got an error reading communication packets)
2025-04-16 17:25:35 211712 [Warning] Aborted connection 211712 to db: 'librenms' user: 'librenms' host: 'localhost' (Got an error writing communication packets)
2025-04-16 18:48:54 223443 [Warning] Aborted connection 223443 to db: 'librenms' user: 'librenms' host: 'localhost' (Got an error reading communication packets)
2025-04-17  0:48:02 272899 [Warning] Aborted connection 272899 to db: 'librenms' user: 'librenms' host: 'localhost' (Got an error writing communication packets)
2025-04-17  6:48:01 322371 [Warning] Aborted connection 322371 to db: 'librenms' user: 'librenms' host: 'localhost' (Got an error reading communication packets)
2025-04-17 12:48:01 372977 [Warning] Aborted connection 372977 to db: 'librenms' user: 'librenms' host: 'localhost' (Got an error reading communication packets)
^C
root@librenms:~# tail -f /var/log/php8.3-fpm.log
[15-Apr-2025 16:44:10] NOTICE: Terminating ...
[15-Apr-2025 16:44:10] NOTICE: exiting, bye-bye!
[15-Apr-2025 16:44:10] NOTICE: fpm is running, pid 1341261
[15-Apr-2025 16:44:10] NOTICE: ready to handle connections
[15-Apr-2025 16:44:10] NOTICE: systemd monitor interval set to 10000ms
[15-Apr-2025 16:45:53] NOTICE: Terminating ...
[15-Apr-2025 16:45:53] NOTICE: exiting, bye-bye!
[15-Apr-2025 16:45:53] NOTICE: fpm is running, pid 1360329
[15-Apr-2025 16:45:53] NOTICE: ready to handle connections
[15-Apr-2025 16:45:53] NOTICE: systemd monitor interval set to 10000ms
^C
root@librenms:~# tail -f /var/log/nginx/error.log
2025/04/17 11:04:58 [error] 758988#758988: *5754 upstream timed out (110: Connection timed out) while reading response header from upstream, client: 10.6.6.22, server: librenms.sonangol.co.ao, request: "POST /ajax/dash/device-summary-horiz HTTP/1.1", upstream: "fastcgi://unix:/run/php-fpm-librenms.sock", host: "10.6.1.230", referrer: "http://10.6.1.230/overview?dashboard=2"
2025/04/17 11:06:59 [error] 758993#758993: *5788 upstream timed out (110: Connection timed out) while reading response header from upstream, client: 10.6.6.22, server: librenms.sonangol.co.ao, request: "POST /ajax/dash/device-summary-horiz HTTP/1.1", upstream: "fastcgi://unix:/run/php-fpm-librenms.sock", host: "10.6.1.230", referrer: "http://10.6.1.230/overview?dashboard=2"
2025/04/17 11:07:01 [error] 758990#758990: *5794 upstream timed out (110: Connection timed out) while reading response header from upstream, client: 10.6.6.22, server: librenms.sonangol.co.ao, request: "POST /ajax/table/eventlog HTTP/1.1", upstream: "fastcgi://unix:/run/php-fpm-librenms.sock", host: "10.6.1.230", referrer: "http://10.6.1.230/overview?dashboard=2"
2025/04/17 11:08:58 [error] 758993#758993: *5834 upstream timed out (110: Connection timed out) while reading response header from upstream, client: 10.6.6.22, server: librenms.sonangol.co.ao, request: "POST /ajax/table/eventlog HTTP/1.1", upstream: "fastcgi://unix:/run/php-fpm-librenms.sock", host: "10.6.1.230", referrer: "http://10.6.1.230/overview?dashboard=2"
2025/04/17 11:08:58 [error] 758993#758993: *5840 upstream timed out (110: Connection timed out) while reading response header from upstream, client: 10.6.6.22, server: librenms.sonangol.co.ao, request: "POST /ajax/dash/device-summary-horiz HTTP/1.1", upstream: "fastcgi://unix:/run/php-fpm-librenms.sock", host: "10.6.1.230", referrer: "http://10.6.1.230/overview?dashboard=2"
2025/04/17 11:10:58 [error] 758993#758993: *5862 upstream timed out (110: Connection timed out) while reading response header from upstream, client: 10.6.6.22, server: librenms.sonangol.co.ao, request: "POST /ajax/table/eventlog HTTP/1.1", upstream: "fastcgi://unix:/run/php-fpm-librenms.sock", host: "10.6.1.230", referrer: "http://10.6.1.230/overview?dashboard=2"
2025/04/17 11:10:58 [error] 758990#758990: *5868 upstream timed out (110: Connection timed out) while reading response header from upstream, client: 10.6.6.22, server: librenms.sonangol.co.ao, request: "POST /ajax/dash/device-summary-horiz HTTP/1.1", upstream: "fastcgi://unix:/run/php-fpm-librenms.sock", host: "10.6.1.230", referrer: "http://10.6.1.230/overview?dashboard=2"
2025/04/17 12:49:02 [error] 758995#758995: *6101 upstream timed out (110: Connection timed out) while reading response header from upstream, client: 10.6.6.22, server: librenms.sonangol.co.ao, request: "POST /ajax/table/eventlog HTTP/1.1", upstream: "fastcgi://unix:/run/php-fpm-librenms.sock", host: "10.6.1.230", referrer: "http://10.6.1.230/overview?dashboard=2"
2025/04/17 12:50:03 [error] 758995#758995: *6101 upstream timed out (110: Connection timed out) while reading response header from upstream, client: 10.6.6.22, server: librenms.sonangol.co.ao, request: "POST /ajax/table/eventlog HTTP/1.1", upstream: "fastcgi://unix:/run/php-fpm-librenms.sock", host: "10.6.1.230", referrer: "http://10.6.1.230/overview?dashboard=2"
2025/04/17 12:51:04 [error] 758997#758997: *6321 upstream timed out (110: Connection timed out) while reading response header from upstream, client: 10.6.6.22, server: librenms.sonangol.co.ao, request: "POST /ajax/table/eventlog HTTP/1.1", upstream: "fastcgi://unix:/run/php-fpm-librenms.sock", host: "10.6.1.230", referrer: "http://10.6.1.230/overview?dashboard=2"


; Start a new pool named 'www'.
; the variable $pool can be used in any directive and will be replaced by the
; pool name ('www' here)
[librenms]

; Per pool prefix
; It only applies on the following directives:
; - 'access.log'
; - 'slowlog'
; - 'listen' (unixsocket)
; - 'chroot'
; - 'chdir'
; - 'php_values'
; - 'php_admin_values'
; When not set, the global prefix (or /usr) applies instead.
; Note: This directive can also be relative to the global prefix.
; Default Value: none
;prefix = /path/to/pools/$pool

; Unix user/group of the child processes. This can be used only if the master
; process running user is root. It is set after the child process is created.
; The user and group can be specified either by their name or by their numeric
; IDs.
; Note: If the user is root, the executable needs to be started with
;       --allow-to-run-as-root option to work.
; Default Values: The user is set to master process running user by default.
;                 If the group is not set, the user's group is used.
user = librenms
group = librenms

; The address on which to accept FastCGI requests.
; Valid syntaxes are:
;   'ip.add.re.ss:port'    - to listen on a TCP socket to a specific IPv4 address on
;                            a specific port;
;   '[ip:6:addr:ess]:port' - to listen on a TCP socket to a specific IPv6 address on
;                            a specific port;
;   'port'                 - to listen on a TCP socket to all addresses
;                            (IPv6 and IPv4-mapped) on a specific port;
;   '/path/to/unix/socket' - to listen on a unix socket.
; Note: This value is mandatory.
listen = /run/php-fpm-librenms.sock

; Set listen(2) backlog.
; Default Value: 511 (-1 on Linux, FreeBSD and OpenBSD)
;listen.backlog = 511

; Set permissions for unix socket, if one is used. In Linux, read/write
; permissions must be set in order to allow connections from a web server. Many
; BSD-derived systems allow connections regardless of permissions. The owner
; and group can be specified either by name or by their numeric IDs.
; Default Values: Owner is set to the master process running user. If the group
;                 is not set, the owner's group is used. Mode is set to 0660.
listen.owner = www-data
listen.group = www-data
;listen.mode = 0660

; When POSIX Access Control Lists are supported you can set them using
; these options, value is a comma separated list of user/group names.
; When set, listen.owner and listen.group are ignored
;listen.acl_users =
;listen.acl_groups =

; List of addresses (IPv4/IPv6) of FastCGI clients which are allowed to connect.
; Equivalent to the FCGI_WEB_SERVER_ADDRS environment variable in the original
; PHP FCGI (5.2.2+). Makes sense only with a tcp listening socket. Each address
; must be separated by a comma. If this value is left blank, connections will be
; accepted from any ip address.
; Default Value: any
;listen.allowed_clients = 127.0.0.1

; Set the associated the route table (FIB). FreeBSD only
; Default Value: -1
;listen.setfib = 1

; Specify the nice(2) priority to apply to the pool processes (only if set)
; The value can vary from -19 (highest priority) to 20 (lower priority)
; Note: - It will only work if the FPM master process is launched as root
;       - The pool processes will inherit the master process priority
;         unless it specified otherwise
; Default Value: no set
; process.priority = -19

; Set the process dumpable flag (PR_SET_DUMPABLE prctl for Linux or
; PROC_TRACE_CTL procctl for FreeBSD) even if the process user
; or group is different than the master process user. It allows to create process
; core dump and ptrace the process for the pool user.
; Default Value: no
; process.dumpable = yes

; Choose how the process manager will control the number of child processes.
; Possible Values:
;   static  - a fixed number (pm.max_children) of child processes;
;   dynamic - the number of child processes are set dynamically based on the
;             following directives. With this process management, there will be
;             always at least 1 children.
;             pm.max_children      - the maximum number of children that can
;                                    be alive at the same time.
;             pm.start_servers     - the number of children created on startup.
;             pm.min_spare_servers - the minimum number of children in 'idle'
;                                    state (waiting to process). If the number
;                                    of 'idle' processes is less than this
;                                    number then some children will be created.
;             pm.max_spare_servers - the maximum number of children in 'idle'
;                                    state (waiting to process). If the number
;                                    of 'idle' processes is greater than this
;                                    number then some children will be killed.
;             pm.max_spawn_rate    - the maximum number of rate to spawn child
;                                    processes at once.
;  ondemand - no children are created at startup. Children will be forked when
;             new requests will connect. The following parameter are used:
;             pm.max_children           - the maximum number of children that
;                                         can be alive at the same time.
;             pm.process_idle_timeout   - The number of seconds after which
;                                         an idle process will be killed.
; Note: This value is mandatory.
pm = dynamic

; The number of child processes to be created when pm is set to 'static' and the
; maximum number of child processes when pm is set to 'dynamic' or 'ondemand'.
; This value sets the limit on the number of simultaneous requests that will be
; served. Equivalent to the ApacheMaxClients directive with mpm_prefork.
; Equivalent to the PHP_FCGI_CHILDREN environment variable in the original PHP
; CGI. The below defaults are based on a server without much resources. Don't
; forget to tweak pm.* to fit your needs.
; Note: Used when pm is set to 'static', 'dynamic' or 'ondemand'
; Note: This value is mandatory.
pm.max_children = 25

; The number of child processes created on startup.
; Note: Used only when pm is set to 'dynamic'
; Default Value: (min_spare_servers + max_spare_servers) / 2
pm.start_servers = 8

; The desired minimum number of idle server processes.
; Note: Used only when pm is set to 'dynamic'
; Note: Mandatory when pm is set to 'dynamic'
pm.min_spare_servers = 6

; The desired maximum number of idle server processes.
; Note: Used only when pm is set to 'dynamic'
; Note: Mandatory when pm is set to 'dynamic'
pm.max_spare_servers = 15

; The number of rate to spawn child processes at once.
; Note: Used only when pm is set to 'dynamic'
; Note: Mandatory when pm is set to 'dynamic'
; Default Value: 32
;pm.max_spawn_rate = 32

; The number of seconds after which an idle process will be killed.
; Note: Used only when pm is set to 'ondemand'
; Default Value: 10s
;pm.process_idle_timeout = 10s;

; The number of requests each child process should execute before respawning.
; This can be useful to work around memory leaks in 3rd party libraries. For
; endless request processing specify '0'. Equivalent to PHP_FCGI_MAX_REQUESTS.
; Default Value: 0
;pm.max_requests = 500

; The URI to view the FPM status page. If this value is not set, no URI will be
; recognized as a status page. It shows the following information:
;   pool                 - the name of the pool;
;   process manager      - static, dynamic or ondemand;
;   start time           - the date and time FPM has started;
;   start since          - number of seconds since FPM has started;
;   accepted conn        - the number of request accepted by the pool;
;   listen queue         - the number of request in the queue of pending
;                          connections (see backlog in listen(2));
;   max listen queue     - the maximum number of requests in the queue
;                          of pending connections since FPM has started;
;   listen queue len     - the size of the socket queue of pending connections;
;   idle processes       - the number of idle processes;
;   active processes     - the number of active processes;
;   total processes      - the number of idle + active processes;
;   max active processes - the maximum number of active processes since FPM
;                          has started;
;   max children reached - number of times, the process limit has been reached,
;                          when pm tries to start more children (works only for
;                          pm 'dynamic' and 'ondemand');
; Value are updated in real time.
; Example output:
;   pool:                 www
;   process manager:      static
;   start time:           01/Jul/2011:17:53:49 +0200
;   start since:          62636
;   accepted conn:        190460
;   listen queue:         0
;   max listen queue:     1
;   listen queue len:     42
;   idle processes:       4
;   active processes:     11
;   total processes:      15
;   max active processes: 12
;   max children reached: 0
;
; By default the status page output is formatted as text/plain. Passing either
; 'html', 'xml' or 'json' in the query string will return the corresponding
; output syntax. Example:
;   http://www.foo.bar/status
;   http://www.foo.bar/status?json
;   http://www.foo.bar/status?html
;   http://www.foo.bar/status?xml
;
; By default the status page only outputs short status. Passing 'full' in the
; query string will also return status for each pool process.
; Example:
;   http://www.foo.bar/status?full
;   http://www.foo.bar/status?json&full
;   http://www.foo.bar/status?html&full
;   http://www.foo.bar/status?xml&full
; The Full status returns for each process:
;   pid                  - the PID of the process;
;   state                - the state of the process (Idle, Running, ...);
;   start time           - the date and time the process has started;
;   start since          - the number of seconds since the process has started;
;   requests             - the number of requests the process has served;
;   request duration     - the duration in µs of the requests;
;   request method       - the request method (GET, POST, ...);
;   request URI          - the request URI with the query string;
;   content length       - the content length of the request (only with POST);
;   user                 - the user (PHP_AUTH_USER) (or '-' if not set);
;   script               - the main script called (or '-' if not set);
;   last request cpu     - the %cpu the last request consumed
;                          it's always 0 if the process is not in Idle state
;                          because CPU calculation is done when the request
;                          processing has terminated;
;   last request memory  - the max amount of memory the last request consumed
;                          it's always 0 if the process is not in Idle state
;                          because memory calculation is done when the request
;                          processing has terminated;
; If the process is in Idle state, then informations are related to the
; last request the process has served. Otherwise informations are related to
; the current request being served.
; Example output:
;   ************************
;   pid:                  31330
;   state:                Running
;   start time:           01/Jul/2011:17:53:49 +0200
;   start since:          63087
;   requests:             12808
;   request duration:     1250261
;   request method:       GET
;   request URI:          /test_mem.php?N=10000
;   content length:       0
;   user:                 -
;   script:               /home/fat/web/docs/php/test_mem.php
;   last request cpu:     0.00
;   last request memory:  0
;
; Note: There is a real-time FPM status monitoring sample web page available
;       It's available in: /usr/share/php/8.3/fpm/status.html
;
; Note: The value must start with a leading slash (/). The value can be
;       anything, but it may not be a good idea to use the .php extension or it
;       may conflict with a real PHP file.
; Default Value: not set
;pm.status_path = /status

; The address on which to accept FastCGI status request. This creates a new
; invisible pool that can handle requests independently. This is useful
; if the main pool is busy with long running requests because it is still possible
; to get the status before finishing the long running requests.
;
; Valid syntaxes are:
;   'ip.add.re.ss:port'    - to listen on a TCP socket to a specific IPv4 address on
;                            a specific port;
;   '[ip:6:addr:ess]:port' - to listen on a TCP socket to a specific IPv6 address on
;                            a specific port;
;   'port'                 - to listen on a TCP socket to all addresses
;                            (IPv6 and IPv4-mapped) on a specific port;
;   '/path/to/unix/socket' - to listen on a unix socket.
; Default Value: value of the listen option
;pm.status_listen = 127.0.0.1:9001

; The ping URI to call the monitoring page of FPM. If this value is not set, no
; URI will be recognized as a ping page. This could be used to test from outside
; that FPM is alive and responding, or to
; - create a graph of FPM availability (rrd or such);
; - remove a server from a group if it is not responding (load balancing);
; - trigger alerts for the operating team (24/7).
; Note: The value must start with a leading slash (/). The value can be
;       anything, but it may not be a good idea to use the .php extension or it
;       may conflict with a real PHP file.
; Default Value: not set
;ping.path = /ping

; This directive may be used to customize the response of a ping request. The
; response is formatted as text/plain with a 200 response code.
; Default Value: pong
;ping.response = pong

; The access log file
; Default: not set
;access.log = log/$pool.access.log

; The access log format.
; The following syntax is allowed
;  %%: the '%' character
;  %C: %CPU used by the request
;      it can accept the following format:
;      - %{user}C for user CPU only
;      - %{system}C for system CPU only
;      - %{total}C  for user + system CPU (default)
;  %d: time taken to serve the request
;      it can accept the following format:
;      - %{seconds}d (default)
;      - %{milliseconds}d
;      - %{milli}d
;      - %{microseconds}d
;      - %{micro}d
;  %e: an environment variable (same as $_ENV or $_SERVER)
;      it must be associated with embraces to specify the name of the env
;      variable. Some examples:
;      - server specifics like: %{REQUEST_METHOD}e or %{SERVER_PROTOCOL}e
;      - HTTP headers like: %{HTTP_HOST}e or %{HTTP_USER_AGENT}e
;  %f: script filename
;  %l: content-length of the request (for POST request only)
;  %m: request method
;  %M: peak of memory allocated by PHP
;      it can accept the following format:
;      - %{bytes}M (default)
;      - %{kilobytes}M
;      - %{kilo}M
;      - %{megabytes}M
;      - %{mega}M
;  %n: pool name
;  %o: output header
;      it must be associated with embraces to specify the name of the header:
;      - %{Content-Type}o
;      - %{X-Powered-By}o
;      - %{Transfert-Encoding}o
;      - ....
;  %p: PID of the child that serviced the request
;  %P: PID of the parent of the child that serviced the request
;  %q: the query string
;  %Q: the '?' character if query string exists
;  %r: the request URI (without the query string, see %q and %Q)
;  %R: remote IP address
;  %s: status (response code)
;  %t: server time the request was received
;      it can accept a strftime(3) format:
;      %d/%b/%Y:%H:%M:%S %z (default)
;      The strftime(3) format must be encapsulated in a %{<strftime_format>}t tag
;      e.g. for a ISO8601 formatted timestring, use: %{%Y-%m-%dT%H:%M:%S%z}t
;  %T: time the log has been written (the request has finished)
;      it can accept a strftime(3) format:
;      %d/%b/%Y:%H:%M:%S %z (default)
;      The strftime(3) format must be encapsulated in a %{<strftime_format>}t tag
;      e.g. for a ISO8601 formatted timestring, use: %{%Y-%m-%dT%H:%M:%S%z}t
;  %u: remote user
;
; Default: "%R - %u %t \"%m %r\" %s"
;access.format = "%R - %u %t \"%m %r%Q%q\" %s %f %{milli}d %{kilo}M %C%%"

; A list of request_uri values which should be filtered from the access log.
;
; As a security precuation, this setting will be ignored if:
;     - the request method is not GET or HEAD; or
;     - there is a request body; or
;     - there are query parameters; or
;     - the response code is outwith the successful range of 200 to 299
;
; Note: The paths are matched against the output of the access.format tag "%r".
;       On common configurations, this may look more like SCRIPT_NAME than the
;       expected pre-rewrite URI.
;
; Default Value: not set
;access.suppress_path[] = /ping
;access.suppress_path[] = /health_check.php

; The log file for slow requests
; Default Value: not set
; Note: slowlog is mandatory if request_slowlog_timeout is set
;slowlog = log/$pool.log.slow

; The timeout for serving a single request after which a PHP backtrace will be
; dumped to the 'slowlog' file. A value of '0s' means 'off'.
; Available units: s(econds)(default), m(inutes), h(ours), or d(ays)
; Default Value: 0
;request_slowlog_timeout = 0

; Depth of slow log stack trace.
; Default Value: 20
;request_slowlog_trace_depth = 20

; The timeout for serving a single request after which the worker process will
; be killed. This option should be used when the 'max_execution_time' ini option
; does not stop script execution for some reason. A value of '0' means 'off'.
; Available units: s(econds)(default), m(inutes), h(ours), or d(ays)
; Default Value: 0
;request_terminate_timeout = 0

; The timeout set by 'request_terminate_timeout' ini option is not engaged after
; application calls 'fastcgi_finish_request' or when application has finished and
; shutdown functions are being called (registered via register_shutdown_function).
; This option will enable timeout limit to be applied unconditionally
; even in such cases.
; Default Value: no
;request_terminate_timeout_track_finished = no

; Set open file descriptor rlimit.
; Default Value: system defined value
;rlimit_files = 1024

; Set max core size rlimit.
; Possible Values: 'unlimited' or an integer greater or equal to 0
; Default Value: system defined value
;rlimit_core = 0

; Chroot to this directory at the start. This value must be defined as an
; absolute path. When this value is not set, chroot is not used.
; Note: you can prefix with '$prefix' to chroot to the pool prefix or one
; of its subdirectories. If the pool prefix is not set, the global prefix
; will be used instead.
; Note: chrooting is a great security feature and should be used whenever
;       possible. However, all PHP paths will be relative to the chroot
;       (error_log, sessions.save_path, ...).
; Default Value: not set
;chroot =

; Chdir to this directory at the start.
; Note: relative path can be used.
; Default Value: current directory or / when chroot
;chdir = /var/www

; Redirect worker stdout and stderr into main error log. If not set, stdout and
; stderr will be redirected to /dev/null according to FastCGI specs.
; Note: on highloaded environment, this can cause some delay in the page
; process time (several ms).
; Default Value: no
;catch_workers_output = yes

; Decorate worker output with prefix and suffix containing information about
; the child that writes to the log and if stdout or stderr is used as well as
; log level and time. This options is used only if catch_workers_output is yes.
; Settings to "no" will output data as written to the stdout or stderr.
; Default value: yes
;decorate_workers_output = no

; Clear environment in FPM workers
; Prevents arbitrary environment variables from reaching FPM worker processes
; by clearing the environment in workers before env vars specified in this
; pool configuration are added.
; Setting to "no" will make all environment variables available to PHP code
; via getenv(), $_ENV and $_SERVER.
; Default Value: yes
;clear_env = no

; Limits the extensions of the main script FPM will allow to parse. This can
; prevent configuration mistakes on the web server side. You should only limit
; FPM to .php extensions to prevent malicious users to use other extensions to
; execute php code.
; Note: set an empty value to allow all extensions.
; Default Value: .php
;security.limit_extensions = .php .php3 .php4 .php5 .php7

; Pass environment variables like LD_LIBRARY_PATH. All $VARIABLEs are taken from
; the current environment.
; Default Value: clean env
;env[HOSTNAME] = $HOSTNAME
;env[PATH] = /usr/local/bin:/usr/bin:/bin
;env[TMP] = /tmp
;env[TMPDIR] = /tmp
;env[TEMP] = /tmp

; Additional php.ini defines, specific to this pool of workers. These settings
; overwrite the values previously defined in the php.ini. The directives are the
; same as the PHP SAPI:
;   php_value/php_flag             - you can set classic ini defines which can
;                                    be overwritten from PHP call 'ini_set'.
;   php_admin_value/php_admin_flag - these directives won't be overwritten by
;                                     PHP call 'ini_set'
; For php_*flag, valid values are on, off, 1, 0, true, false, yes or no.

; Defining 'extension' will load the corresponding shared extension from
; extension_dir. Defining 'disable_functions' or 'disable_classes' will not
; overwrite previously defined php.ini values, but will append the new value
; instead.

; Note: path INI options can be relative and will be expanded with the prefix
; (pool, global or /usr)

; Default Value: nothing is defined by default except the values in php.ini and
;                specified at startup with the -d argument
;php_admin_value[sendmail_path] = /usr/sbin/sendmail -t -i -f [email protected]
;php_flag[display_errors] = off
;php_admin_value[error_log] = /var/log/fpm-php.www.log
;php_admin_flag[log_errors] = on
;php_admin_value[memory_limit] = 32M

One of your errors from mysql is too many connections. The default connection count is 200, most installs shouldn’t run into that limit (especially with only 333 devices). You should probably increase the mysql connection limit and also could be an indication of your problem.

Are you using the dispatcher service? what do you have your worker counts set to?

Hi @murrant,

I am using the default installation, it does not install any other service that was not part of the installation document.
Will the increase in connections be done in the mysql or php configuration file?
Thanks

/etc/mysql/mariadb.conf.d/50-server.cnf

myisam_recover_options = BACKUP
max_connections        = 100 
table_cache            = 64

to 

myisam_recover_options = BACKUP
max_connections        = 300 
table_cache            = 64

You didn’t answer my question about how many workers. Because you are hitting a connection limit on mysql that may mean you have workers set very high (which is bad) and be causing a thundering herd problem trying to poll every device simultaneously and then sitting idle for the rest of the time…

Excuse my ignorance, how can I check?

Hi @murrant ,

I configured the recommendations in this document Performance - LibreNMS Docs, but I did not do anything recommended in this document Dispatcher Service (RC) - LibreNMS Docs.
poller-wrapper I’m using 12, because I have 6 cores and 24 cpu on my server!
Once again, I apologize if I’m doing something wrong, but if you could be more specific about the recommendations, I’d appreciate it.
Many thanks

12 should be fine.

Do you have anything else connecting to that MySQL install?

No, the server are exclusive for Librenms! No other service run.

@Domingos_Varela I linked the specific part of that document because it answered your question on how to check.
12 seems reasonable, but you might try a slightly higher number. watching the CPU usage (at higher resolution than LibreNMS polls) will give you an idea of where your number should be.

usage like this ^_ means the number is too high
usages should be more like this — but you shouldn’t have any devices missing their polling interval that means it is too low.

You still have the issues:

  1. Your database server is crashing/becoming unavailable
  2. php-fpm is crashing/becoming unavailable

Mysql PROCESSLIST

MariaDB [(none)]> SHOW PROCESSLIST;
+---------+----------+-----------------+----------+---------+-------+-----------------+------------------------------------------------------------------------------------------------------+----------+
| Id      | User     | Host            | db       | Command | Time  | State           | Info                                                                                                 | Progress |
+---------+----------+-----------------+----------+---------+-------+-----------------+------------------------------------------------------------------------------------------------------+----------+
| 2714117 | librenms | localhost:59340 | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 2714122 | librenms | localhost:40366 | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 2721305 | librenms | localhost:54972 | librenms | Sleep   |   183 |                 | NULL                                                                                                 |    0.000 |
| 2721306 | librenms | localhost:54984 | librenms | Sleep   |   183 |                 | NULL                                                                                                 |    0.000 |
| 2775875 | librenms | localhost       | librenms | Execute | 38032 | Sending data    | SELECT * FROM (SELECT `port_id` FROM `ports` WHERE `ifPhysAddress`=? LIMIT 1) p                      |    0.000 |
| 2780182 | librenms | localhost       | librenms | Execute | 38032 | Sending data    | SELECT * FROM (SELECT `port_id` FROM `ports` WHERE `ifPhysAddress`=? LIMIT 1) p                      |    0.000 |
| 2822683 | librenms | localhost:44948 | librenms | Sleep   |  6436 |                 | NULL                                                                                                 |    0.000 |
| 2990923 | librenms | localhost:57774 | librenms | Sleep   |  3776 |                 | NULL                                                                                                 |    0.000 |
| 2991355 | librenms | localhost:58236 | librenms | Sleep   |  3480 |                 | NULL                                                                                                 |    0.000 |
| 2992212 | librenms | localhost:44386 | librenms | Sleep   |  3124 |                 | NULL                                                                                                 |    0.000 |
| 3003306 | librenms | localhost:37240 | librenms | Sleep   |  2825 |                 | NULL                                                                                                 |    0.000 |
| 3003307 | librenms | localhost:37250 | librenms | Sleep   |  2825 |                 | NULL                                                                                                 |    0.000 |
| 3003723 | librenms | localhost:37788 | librenms | Sleep   |  2581 |                 | NULL                                                                                                 |    0.000 |
| 3016078 | librenms | localhost:34236 | librenms | Sleep   |  1980 |                 | NULL                                                                                                 |    0.000 |
| 3016079 | librenms | localhost:34238 | librenms | Sleep   |  1980 |                 | NULL                                                                                                 |    0.000 |
| 3016592 | librenms | localhost       | librenms | Sleep   |    38 |                 | NULL                                                                                                 |    0.000 |
| 3016594 | librenms | localhost       | librenms | Execute |    55 | Sending data    | SELECT `device_id` FROM `ports` WHERE `ifPhysAddress`=?                                              |    0.000 |
| 3016644 | librenms | localhost:37808 | librenms | Sleep   |  1660 |                 | NULL                                                                                                 |    0.000 |
| 3016645 | librenms | localhost:37816 | librenms | Sleep   |  1660 |                 | NULL                                                                                                 |    0.000 |
| 3016984 | librenms | localhost:58650 | librenms | Sleep   |  1381 |                 | NULL                                                                                                 |    0.000 |
| 3016985 | librenms | localhost:58666 | librenms | Sleep   |  1381 |                 | NULL                                                                                                 |    0.000 |
| 3017293 | librenms | localhost:58528 | librenms | Sleep   |  1080 |                 | NULL                                                                                                 |    0.000 |
| 3017294 | librenms | localhost:58538 | librenms | Sleep   |  1080 |                 | NULL                                                                                                 |    0.000 |
| 3017881 | librenms | localhost       | librenms | Sleep   |     3 |                 | NULL                                                                                                 |    0.000 |
| 3018077 | librenms | localhost       | librenms | Execute |    56 | Sending data    | SELECT * FROM (SELECT `port_id` FROM `ports` WHERE `ifPhysAddress`=? LIMIT 1) p                      |    0.000 |
| 3018200 | librenms | localhost:47624 | librenms | Query   |     0 | Updating        | UPDATE pollers SET last_polled=NOW(), devices='333', time_taken='781' WHERE poller_name='librenms'   |    0.000 |
| 3022962 | librenms | localhost:45900 | librenms | Sleep   |   473 |                 | NULL                                                                                                 |    0.000 |
| 3022963 | librenms | localhost:45902 | librenms | Sleep   |   473 |                 | NULL                                                                                                 |    0.000 |
| 3022995 | librenms | localhost       | librenms | Sleep   |   328 |                 | NULL                                                                                                 |    0.000 |
| 3023000 | librenms | localhost       | librenms | Sleep   |   328 |                 | NULL                                                                                                 |    0.000 |
| 3027580 | librenms | localhost       | librenms | Execute |   143 | Sending data    | SELECT `device_id` FROM `ports` WHERE `ifPhysAddress`=?                                              |    0.000 |
| 3027633 | librenms | localhost       | librenms | Execute |     0 | Statistics      | SELECT COUNT(*) FROM `links` WHERE `remote_hostname` = ? AND `local_port_id` = ? AND `protocol` = ?  |    0.000 |
| 3027637 | librenms | localhost       | librenms | Execute |     0 | Statistics      | SELECT COUNT(*) FROM `links` WHERE `remote_hostname` = ? AND `local_port_id` = ? AND `protocol` = ?  |    0.000 |
| 3027639 | librenms | localhost       | librenms | Execute |     0 | Statistics      | SELECT COUNT(*) FROM `links` WHERE `remote_hostname` = ? AND `local_port_id` = ? AND `protocol` = ?  |    0.000 |
| 3027794 | librenms | localhost:59516 | librenms | Sleep   |   182 |                 | NULL                                                                                                 |    0.000 |
| 3027795 | librenms | localhost:59526 | librenms | Sleep   |   182 |                 | NULL                                                                                                 |    0.000 |
| 3027798 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3027799 | librenms | localhost       | librenms | Sleep   |    73 |                 | NULL                                                                                                 |    0.000 |
| 3027800 | librenms | localhost       | librenms | Sleep   |    67 |                 | NULL                                                                                                 |    0.000 |
| 3027803 | librenms | localhost       | librenms | Execute |     0 | Sending data    | SELECT * FROM devices,sensors WHERE (devices.device_id = ? AND devices.device_id = sensors.device_id |    0.000 |
| 3027808 | librenms | localhost       | librenms | Sleep   |    31 |                 | NULL                                                                                                 |    0.000 |
| 3027809 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3027812 | librenms | localhost       | librenms | Sleep   |     1 |                 | NULL                                                                                                 |    0.000 |
| 3027815 | librenms | localhost       | librenms | Execute |     0 | Sending data    | SELECT * FROM devices,sensors WHERE (devices.device_id = ? AND devices.device_id = sensors.device_id |    0.000 |
| 3027817 | librenms | localhost       | librenms | Sleep   |    73 |                 | NULL                                                                                                 |    0.000 |
| 3027820 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3027818 | librenms | localhost       | librenms | Sleep   |    16 |                 | NULL                                                                                                 |    0.000 |
| 3027825 | librenms | localhost       | librenms | Sleep   |    31 |                 | NULL                                                                                                 |    0.000 |
| 3027841 | librenms | localhost       | librenms | Execute |     0 | init for update | UPDATE `alerts` set `state`=?,`open`=?,`note`=?,`timestamp`=? WHERE device_id = ? && rule_id = ?     |    0.000 |
| 3027881 | librenms | localhost       | librenms | Execute |     0 | Sending data    | SELECT * FROM devices,sensors WHERE (devices.device_id = ? AND devices.device_id = sensors.device_id |    0.000 |
| 3027925 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3027978 | librenms | localhost       | librenms | Execute |    22 | Updating        | UPDATE `ports` set `deleted`=? WHERE `port_id` = ?                                                   |    0.000 |
| 3027982 | librenms | localhost       | librenms | Sleep   |    14 |                 | NULL                                                                                                 |    0.000 |
| 3027984 | librenms | localhost       | librenms | Execute |     0 | Statistics      | SELECT * FROM devices,sensors WHERE (devices.device_id = ? AND devices.device_id = sensors.device_id |    0.000 |
| 3027985 | librenms | localhost       | librenms | Execute |     0 | Statistics      | SELECT COUNT(*) FROM `links` WHERE `remote_hostname` = ? AND `local_port_id` = ? AND `protocol` = ?  |    0.000 |
| 3027990 | librenms | localhost       | librenms | Sleep   |    41 |                 | NULL                                                                                                 |    0.000 |
| 3027991 | librenms | localhost       | librenms | Sleep   |    42 |                 | NULL                                                                                                 |    0.000 |
| 3027993 | librenms | localhost       | librenms | Sleep   |   119 |                 | NULL                                                                                                 |    0.000 |
| 3027998 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3027999 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028001 | librenms | localhost       | librenms | Sleep   |    37 |                 | NULL                                                                                                 |    0.000 |
| 3028013 | librenms | localhost       | librenms | Execute |     0 | Statistics      | SELECT * FROM devices,sensors WHERE (devices.device_id = ? AND devices.device_id = sensors.device_id |    0.000 |
| 3028020 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028031 | librenms | localhost       | librenms | Execute |     0 | Sending data    | SELECT * FROM devices,sensors WHERE (devices.device_id = ? AND devices.device_id = sensors.device_id |    0.000 |
| 3028032 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028041 | librenms | localhost       | librenms | Execute |     0 | Sending data    | SELECT * FROM devices,sensors WHERE (devices.device_id = ? AND devices.device_id = sensors.device_id |    0.000 |
| 3028043 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028045 | librenms | localhost       | librenms | Execute |     0 | Statistics      | SELECT * FROM devices,ports WHERE (devices.device_id = ? AND devices.device_id = ports.device_id) AN |    0.000 |
| 3028051 | librenms | localhost       | librenms | Execute |     0 | Statistics      | SELECT * FROM devices,sensors WHERE (devices.device_id = ? AND devices.device_id = sensors.device_id |    0.000 |
| 3028052 | librenms | localhost       | librenms | Execute |    91 | Updating        | update `devices` set `last_polled` = ?, `last_polled_timetaken` = ? where `device_id` = ?            |    0.000 |
| 3028058 | librenms | localhost       | librenms | Execute |    21 | Updating        | UPDATE `ports` set `ifInOctets`=?,`ifInOctets_prev`=?,`ifInOctets_rate`=?,`ifInOctets_delta`=?,`ifOu |    0.000 |
| 3028066 | librenms | localhost       | librenms | Execute |     0 | Sending data    | SELECT * FROM devices,sensors WHERE (devices.device_id = ? AND devices.device_id = sensors.device_id |    0.000 |
| 3028067 | librenms | localhost       | librenms | Execute |     0 | Statistics      | SELECT COUNT(*) FROM `links` WHERE `remote_hostname` = ? AND `local_port_id` = ? AND `protocol` = ?  |    0.000 |
| 3028068 | librenms | localhost       | librenms | Sleep   |   107 |                 | NULL                                                                                                 |    0.000 |
| 3028081 | librenms | localhost       | librenms | Execute |     0 | Sending data    | SELECT * FROM devices,sensors WHERE (devices.device_id = ? AND devices.device_id = sensors.device_id |    0.000 |
| 3028082 | librenms | localhost       | librenms | Execute |     0 | Sending data    | SELECT * FROM devices,sensors WHERE (devices.device_id = ? AND devices.device_id = sensors.device_id |    0.000 |
| 3028083 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028085 | librenms | localhost       | librenms | Execute |     0 | Sending data    | SELECT * FROM devices,sensors WHERE (devices.device_id = ? AND devices.device_id = sensors.device_id |    0.000 |
| 3028098 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028102 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028105 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028106 | librenms | localhost       | librenms | Execute |     0 | Statistics      | SELECT * FROM devices,sensors WHERE (devices.device_id = ? AND devices.device_id = sensors.device_id |    0.000 |
| 3028108 | librenms | localhost       | librenms | Execute |    86 | Updating        | update `devices` set `last_polled` = ?, `last_polled_timetaken` = ? where `device_id` = ?            |    0.000 |
| 3028109 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028115 | librenms | localhost       | librenms | Execute |    95 | Sending data    | DELETE T FROM `links` T LEFT JOIN `devices` ON `devices`.`device_id` = T.`local_device_id` WHERE `de |    0.000 |
| 3028125 | librenms | localhost       | librenms | Execute |     0 | Sending data    | SELECT * FROM devices,sensors WHERE (devices.device_id = ? AND devices.device_id = sensors.device_id |    0.000 |
| 3028128 | librenms | localhost       | librenms | Execute |     0 | Sending data    | SELECT * FROM devices,sensors WHERE (devices.device_id = ? AND devices.device_id = sensors.device_id |    0.000 |
| 3028133 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028135 | librenms | localhost       | librenms | Execute |     0 | Statistics      | SELECT * FROM devices,sensors WHERE (devices.device_id = ? AND devices.device_id = sensors.device_id |    0.000 |
| 3028138 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028139 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028140 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028142 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028143 | librenms | localhost       | librenms | Execute |     0 | Sending data    | select * from `device_outages` where `device_outages`.`device_id` = ? and `device_outages`.`device_i |    0.000 |
| 3028144 | librenms | localhost       | librenms | Execute |     0 | Statistics      | SELECT * FROM devices,sensors WHERE (devices.device_id = ? AND devices.device_id = sensors.device_id |    0.000 |
| 3028145 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028147 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028151 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028152 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028153 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028154 | librenms | localhost       | librenms | Execute |     0 | init for update | UPDATE `alerts` set `state`=?,`open`=?,`note`=?,`timestamp`=? WHERE device_id = ? && rule_id = ?     |    0.000 |
| 3028157 | librenms | localhost       | librenms | Sleep   |    90 |                 | NULL                                                                                                 |    0.000 |
| 3028162 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028166 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028165 | librenms | localhost       | librenms | Execute |     0 | Statistics      | SELECT * FROM devices,ports WHERE (devices.device_id = ? AND devices.device_id = ports.device_id) AN |    0.000 |
| 3028168 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028170 | librenms | localhost       | librenms | Sleep   |    87 |                 | NULL                                                                                                 |    0.000 |
| 3028174 | librenms | localhost       | librenms | Execute |     1 | Sending data    | select * from `device_outages` where `device_outages`.`device_id` = ? and `device_outages`.`device_i |    0.000 |
| 3028179 | librenms | localhost       | librenms | Execute |     0 | Statistics      | SELECT * FROM devices,ports WHERE (devices.device_id = ? AND devices.device_id = ports.device_id) AN |    0.000 |
| 3028185 | librenms | localhost       | librenms | Execute |    87 | Sending data    | DELETE T FROM `processors` T LEFT JOIN `devices` ON `devices`.`device_id` = T.`device_id` WHERE `dev |    0.000 |
| 3028186 | librenms | localhost       | librenms | Execute |    27 | Sending data    | select `ipv4_address_id`, `ipv4_address`, `ipv4_prefixlen`, `ipv4_network_id`, `ports`.`device_id`,  |    0.000 |
| 3028187 | librenms | localhost       | librenms | Execute |    26 | Sending data    | select `ipv4_address_id`, `ipv4_address`, `ipv4_prefixlen`, `ipv4_network_id`, `ports`.`device_id`,  |    0.000 |
| 3028188 | librenms | localhost       | librenms | Execute |    12 | Sending data    | select `ipv4_address_id`, `ipv4_address`, `ipv4_prefixlen`, `ipv4_network_id`, `ports`.`device_id`,  |    0.000 |
| 3028190 | librenms | localhost       | librenms | Execute |    63 | Sending data    | SELECT `alerts`.`id` AS `alert_id`, `devices`.`hostname` AS `hostname` FROM `alerts` LEFT JOIN `devi |    0.000 |
| 3028194 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028195 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028196 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028197 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028198 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028199 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028201 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028202 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028204 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028205 | librenms | localhost       | librenms | Sleep   |     0 |                 | NULL                                                                                                 |    0.000 |
| 3028208 | librenms | localhost       | librenms | Execute |     0 | Sending data    | SELECT alerts.id, alerts.alerted, alerts.device_id, alerts.rule_id, alerts.state, alerts.note, alert |    0.000 |
| 3028209 | root     | localhost       | NULL     | Query   |     0 | starting        | SHOW PROCESSLIST                                                                                     |    0.000 |
+---------+----------+-----------------+----------+---------+-------+-----------------+------------------------------------------------------------------------------------------------------+----------+
126 rows in set (0.018 sec)

Your database server is struggling. There should be only a few or even no pending queries.

This could be an IO issue. Try looking up how to troubleshoot slow MariaDB server.