IMPORTANT ANNOUNCEMENT: On May 6, 2024, Carbon Black User eXchange (UeX) and Case Management will move to a new platform!
The Community will be in read-only mode starting April 19th, 7:00 AM PDT. Check out the blog post!
You will still be able to use the case portal to create and interact with your support cases until the transition, view more information here!

EDR: One or more Server Nodes fail to start because stuck process remains running

EDR: One or more Server Nodes fail to start because stuck process remains running

Environment

  • EDR: 6.x and Higher
  • Clustered environment

Symptoms

  • One or more secondary server nodes fails to start when a cluster start command is issued.
  • While the cluster is stopped, one or more unexpected Carbon Black processes remain running.

Cause

Some EDR processes may not shut down properly when the cluster is stopped. Their presence can prevent the cluster from restarting properly in the future. 

Resolution

  1. From the primary server, stop the EDR cluster:
# /usr/share/cb/cbcluster stop
  1. Check for stuck processes:
# ps -ef | grep cb
Example output:
cb  12345 10011  0 1 03:10 /usr/lib64/erlang/erts-9.3/bin/epmd -daemon
myusername  19846 19718  0 17:15 pts/1    00:00:00 ps -ef
myusername  19847 19718  0 17:15 pts/1    00:00:00 grep cb
  1. Stop the stuck process (PID 12345 in this example)
# kill 12345
  1. Verify the process was removed:
# ps -ef | grep cb
  1. Restart the cluster from the master server:
# /usr/share/cb/cbcluster start

 

Additional Notes

  • The /usr/lib64/erlang/erts-9.3/bin/epmd process is only an example
  • Other processes may be stuck

Related Content


Labels (1)
Tags (2)
Was this article helpful? Yes No
No ratings
Article Information
Author:
Creation Date:
‎09-09-2020
Views:
393
Contributors