Detect failure when no heartbeats sent, see #2907
* Subscribe to InstantMemberEvent and start heartbeating when InstantMemberUp. Same for metrics. * HeartbeatNodeRing data structure for bidirectional mapping of heartbeat sender and receiver. Not using ConsistentHash anymore. Node addresses are hashed to ensure that neighbors are spread out. * HeartbeatRequest when receiver detects that it has not received expected heartbeats. * New test InitialHeartbeatSpec that simulates the problem * Add/remove some related conf properties * Add some more logging to be able to diagnose eventual problems * Explicit config of nr-of-end-heartbeats
This commit is contained in:
parent
c5685a0855
commit
8b4e903e7d
25 changed files with 466 additions and 146 deletions
|
|
@ -28,12 +28,14 @@ class ClusterConfigSpec extends AkkaSpec {
|
|||
PeriodicTasksInitialDelay must be(1 seconds)
|
||||
GossipInterval must be(1 second)
|
||||
HeartbeatInterval must be(1 second)
|
||||
NumberOfEndHeartbeats must be(4)
|
||||
NumberOfEndHeartbeats must be(8)
|
||||
MonitoredByNrOfMembers must be(5)
|
||||
HeartbeatRequestDelay must be(10 seconds)
|
||||
HeartbeatExpectedResponseAfter must be(3 seconds)
|
||||
HeartbeatRequestTimeToLive must be(1 minute)
|
||||
LeaderActionsInterval must be(1 second)
|
||||
UnreachableNodesReaperInterval must be(1 second)
|
||||
PublishStatsInterval must be(10 second)
|
||||
JoinTimeout must be(60 seconds)
|
||||
AutoJoin must be(true)
|
||||
AutoDown must be(false)
|
||||
MinNrOfMembers must be(1)
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue