Commit graph

7 commits

Author SHA1 Message Date
Patrik Nordwall
ee6e80d31a Add previousStatus in MemberRemoved, see #3252 2013-05-23 11:09:32 +02:00
Patrik Nordwall
7628889b43 Changed design of RemoteWatcher due to cleanup race, see #3265
* The problem was a race caused by HeartbeatReq sent out, and
  the watchee terminated immediately. That caused the RemoteWatcher
  peers watching each other without any other watch registered.
  It is racy.
* Instead of one-way heartbeats from the side beeing watched I
  changed to ping-pong style. That makes the problem go away
  and simplifies a lot of things in RemoteWatcher.
2013-05-04 17:35:12 +02:00
Patrik Nordwall
551e2d1321 Stop heartbeating when watching node crash, see #3265 2013-04-25 21:25:46 +02:00
Patrik Nordwall
49744e0b0f Only quarantine removed member that was unreachable, see #2594
* For graceful leaving and remove it should still be possible to
  communicate with the node after cluster removal.
* Otherwise the hand over in cluster singleton would break, for
  example.
* Also, skip selfAddress to avoid generation of AddressTerminated
  for the own node when removed from cluster.
2013-04-19 08:52:27 +02:00
Patrik Nordwall
58bd0a1460 Quarantine from ClusterRemoteWatcher also, see #2993
* This was an oversight in previous pull request
2013-04-18 16:55:02 +02:00
Patrik Nordwall
ee1b5879cf Race between termination and actorFor, see #3234 2013-04-17 21:03:16 +02:00
Patrik Nordwall
4606612bd1 Reliable remote supervision and death watch, see #2993
* RemoteWatcher that monitors node failures, with heartbeats
  and failure detector
* Move RemoteDeploymentWatcher from CARP to RARP
* ClusterRemoteWatcher that handles cluster nodes
* Update documentation
* UID in Heartbeat msg to be able to quarantine,
  actual implementation of quarantining will be implemented
  in ticket 2594
2013-04-17 19:42:51 +02:00