Juan Alvarez

09/08/2021, 10:35 AM
Does anybody know a good way to troubleshoot this kind of errors?
Sep 08 10:27:24 devo-ea-manager fleet[860]: 2021/09/08 10:27:24 http: TLS handshake error from EOF
I am doing some scale testing, and i have an AWS t3.xlarge box with 2k agents where everything seems to work correctly but this error appears periodically, any suggestion to understand why this could be happening?
10:39 AM
I have read that this could be related with ulimit, but we have set limits to more than 16k and opened FDs at the time of the error is under 100.
Benjamin Edwards

Benjamin Edwards

09/08/2021, 3:51 PM
There is a new debug command that might be helpful https://github.com/fleetdm/fleet/pull/1706

Juan Alvarez

09/09/2021, 8:30 AM
Thanks for your answer. Mainly connectivity seems to work but the error appears sporadically for some agent connections (which seem to connect properly the next time they check in). That command seems to check only connection from a global pov which i would say it is working. Do you think i could use it to get more info on what could be going on anyway?
9:38 AM
I think i found my issue, when creating many dockers in the same box for testing i was hitting an issue with ARP overflow. Fixed that in the agents server and the issue got fixed.