"Optimise closure of fds before invoking scripts.
Every time before a script was invoked, closeall() was called,
which would spin through 1024 file descriptors closing them, even
though the vast majority were not open, resulting in 1024 system
calls. To avoid that, open all sockets and file descriptors
(except fd 0/1/2) with the CLOEXEC flag set, so that the fds will
be closed by the kernel when the script is exec'd."
I'm looking to setup sysdig in our native HA openshift environment, but having issues getting the agent to run on our infra nodes hosting keepalived and ha-proxy -- agent runs without issue on all the other nodes in our env.
After the agent has been running about an hour or two, the node hangs and our hypervisor reports 100% cpu utilization. A power reset is the only option to bring the node back to life. The problem may be with keepalived doing an extremely large number(around 17 million in a minute) of "close" syscall operations, and it looks like those close operations are on any available fd. Is this expected behavior of keepalived running in an OSEv3.1.1 HA environment?
users mailing list
users lists openshift redhat com