DNS Discovery Flapping

vignesh-sp · April 29, 2022, 9:04am

Every now and then we see multiple backends flapping with this messages

is going DOWN for maintenance (entry removed from SRV record)

And sometimes when this message appears, traffic goes to BUSY for brief amount of time.

I couldn’t see any dns requests being failed, but we still see this flaps.

This is a gist of haproxy config

global
  user haproxy
  group haproxy
  nbproc 1
  nbthread 16
  cpu-map auto:1/1-16 0-15
  log /dev/log local2
  log /dev/log local0 notice
  chroot /path/to/haproxy
  pidfile /path/to/haproxy.pid
  daemon
  master-worker
  maxconn 200000
  hard-stop-after 1h
  stats socket /path/to/haproxy/stats mode 660 level admin expose-fd listeners
  tune.ssl.cachesize 3000000
  tune.ssl.lifetime 60000
  ssl-default-bind-ciphers ECDHE-ECDSA-AES256-GCM-SHA384:ECDHE-RSA-AES256-GCM-SHA384:ECDHE-ECDSA-CHACHA20-POLY1305:ECDHE-RSA-CHACHA20-POLY1305:ECDHE-ECDSA-AES128-GCM-SHA256:ECDHE-RSA-AES128-GCM-SHA256:ECDHE-ECDSA-AES256-SHA384:ECDHE-RSA-AES256-SHA384:ECDHE-ECDSA-AES128-SHA256:ECDHE-RSA-AES128-SHA256
  ssl-default-bind-options ssl-min-ver TLSv1.2 ssl-max-ver TLSv1.2
  server-state-file /path/to//haproxy_server_states
  tune.bufsize 40960

resolvers mydns
        nameserver dns1 stream+ipv4@127.0.0.3:53
        accepted_payload_size 65535 # allow larger DNS payloads
        timeout resolve 1s
        timeout retry   2s
        hold other    50000s
        hold refused  50000s
        hold timeout  50000s
        hold nx       5s
        hold valid    1s
        hold obsolete 1s

frontend listener
        bind x.x.x.x:80 mss 1440 alpn h2,http/1.1
        mode http
        default_backend server_group

backend server_group
        mode  http
        server-template template_name 30 _servers-0._tcp.servers.local init-addr last,libc,none resolvers mydns check  port 80  maxconn 2000 maxqueue 1
        errorfile 503 /path/to/static/busy-json.http

Kindly help if we are doing something wrong !

Topic		Replies	Views
Haproxy 1.8.2; 1.8.3 DNS auto discover stop working Help!	20	3709	March 28, 2018
Server is going DOWN for maintenance (DNS timeout status) Help!	3	3925	October 23, 2018
Backend goes into MAINT when DNS fails and does not come back UP Help!	2	2517	May 4, 2020
There is a problem with the DNS SRV record process Help!	2	740	August 4, 2020
DNS Resolution Sigh v1.7.1 Help!	6	2345	January 31, 2017

DNS Discovery Flapping

Related topics