pekko

Author	SHA1	Message	Date
Johan Andrén	b7cc50cdd6	2.5.10 wire protocol regression (#24625 )	2018-02-28 09:46:37 +01:00
Renato Cavalcanti	c83e4adfea	Rolling update config checker, #24009 * adds config compatibility check * doc'ed what happens when joining a cluster not supporting this feature * added extra docs over sensitive paths	2018-02-20 15:47:09 +01:00
Christopher Batey	009214ae07	Update copyright to 2018 (#24241 )	2018-01-04 17:26:29 +00:00
Christopher Batey	5a37cdc862	Cross DC gossip fixes #23803 * Adjust cross DC gossip probability for small nr of nodes in a DC When a Dc is being bootstrapped the initial node has no local peers and can not gossip if it selects a local gossip round. Start at a probability of 1.0 for a single node cluster and move down 0.25 per node until a 5 node DC is reached then use the cross-data-center-gossip-probability * Fix cross DC gossip selecting of oldest members This used to select the members based on the sort order members in Gossip (by address) rather than by upNumber	2017-11-02 09:17:24 +01:00
Arnout Engelen	9cb5849188	Accept 'Join' messages from nodes without dc (#23822 ) * Accept 'Join' messages from nodes without dc To allow a join from a 2.4 node to a 2.5.6 cluster. * Use "ClusterSettings.DefaultDataCenter" constant	2017-10-23 04:49:51 -05:00
Patrik Nordwall	6ed3295acd	Merge branch 'master' into wip-multi-dc-merge-master-patriknw	2017-08-31 10:51:12 +02:00
Sébastien Lorion	a95a94acff	Replace ClusterRouterGroup/Pool "use-role" with "use-role-set" #23496	2017-08-09 16:06:18 +02:00
Patrik Nordwall	bb9549263e	Rename team to data center, #23275	2017-07-04 17:11:21 +02:00
Arnout Engelen	58db22ca1e	Introduce missing team role if necessary (#23276 ) * Introduce missing team role if necessary (#23243) When receiving gossip from a node that did not contain any team information (such as gossip from a node running a previous version of Akka), add the default team role during deserialization. * Simpler implementation of adding default role * More efficient `rolesFromProto` Now actually outperforms the previous implementation. Still room for improvement as this probably checks for duplicates in the set on each add, but creating our own array-backed set here is probably going overboard :). * Fixes following rebase	2017-07-04 14:52:03 +02:00
Johan Andrén	164387a89e	[WIP] one leader per cluster team (#23239 ) * Guarantee no sneaky type puts more teams in the role list * Leader per team and initial tests * MiMa filters * Second iteration (not working though) * Verbose gossip logging etc. * Gossip to team-nodes even if there is inter-team unreachability * More work ... * Marking removed nodes with tombstones in Gossip * More test coverage for Gossip.remove * Bug failing other multi-node tests squashed * Multi-node test for team-split * Review fixes - only prune tombstones on leader ticks * Clean code is happy code. * All I want is for MiMa to be my friend * These constants are internal * Making the formatting gods happy * I used the wrong reachability for ignoring gossip :/ * Still hadn't quite gotten how reachability was supposed to work * Review feedback applied * Cross-team downing should still work * Actually prune tombstones in the prune tombstones method ... * Another round against reachability. Reachability leading with 15 - 2 so far.	2017-07-04 10:09:40 +02:00
Johan Andrén	3643f18ded	Protobuf serializers for remote deployment #22332	2017-03-16 15:12:35 +01:00
Patrik Nordwall	452b3f1406	remove old deprecated cluster metrics, #21423 * corresponding was moved to akka-cluster-metrics, see http://doc.akka.io/docs/akka/2.4/project/migration-guide-2.3.x-2.4.x.html#New_Cluster_Metrics_Extension	2017-01-20 13:48:36 +01:00
Patrik Nordwall	84ade6fdc3	add CoordinatedShutdown, #21537 * CoordinatedShutdown that can run tasks for configured phases in order (DAG) * coordinate handover/shutdown of singleton with cluster exiting/shutdown * phase config obj with depends-on list * integrate graceful leaving of sharding in coordinated shutdown * add timeout and recover * add some missing artery ports to tests * leave via CoordinatedShutdown.run * optionally exit-jvm in last phase * run via jvm shutdown hook * send ExitingConfirmed to leader before shutdown of Exiting to not have to wait for failure detector to mark it as unreachable before removing * the unreachable signal is still kept as a safe guard if message is lost or leader dies * PhaseClusterExiting vs MemberExited in ClusterSingletonManager * terminate ActorSystem when cluster shutdown (via Down) * add more predefined and custom phases * reference documentation * migration guide * problem when the leader order was sys2, sys1, sys3, then sys3 could not perform it's duties and move Leving sys1 to Exiting because it was observing sys1 as unreachable * exclude Leaving with exitingConfirmed from convergence condidtion	2017-01-16 09:01:57 +01:00
Philippus Baalman	6c7085252a	extended copyright into 2017	2017-01-04 17:37:15 +01:00
Johan Andrén	d6c048f59a	A simpler ActorRefProvider config #20649 (#20767 ) * Provide shorter aliases for the ActorRefProviders #20649 * Use the new actorefprovider aliases throughout code and docs * Cleaner alias replacement logic	2016-06-10 15:04:13 +02:00
Björn Antonsson	c66ce62d63	Update to a working version of Scalariform	2016-06-02 22:12:36 +02:00
Patrik Nordwall	9f659cf9b1	remove JUnitRunner annotation, #16112 * it was used for running tests from inside Eclipse, but since it caused some trouble we remove it	2016-04-05 17:06:58 +02:00
Johannes Rudolph	b6cbc7f13a	=all remove unused imports	2016-02-23 20:29:22 +01:00
Johan Andrén	62e30b3c08	Update copyrights and links to the new company name #19851	2016-02-23 12:58:39 +01:00
Prayag Verma	b7783968a0	=pro #19068 All copyrights ranges and single years updated to a range ending in 2016	2016-01-25 10:20:30 +01:00
Julian Tescher	00f6a58e7c	Changes all occurances of Typesafe copyright to extend to 2015	2015-03-10 14:12:19 -07:00
Patrik Nordwall	30df518421	=tes Use ConversionCheckedTripleEquals	2015-03-10 08:17:03 +01:00
Andrei Pozolotin	7b9f77a073	+ akka-cluster-metrics: new akka module * new akka module split from akka-cluster * provide sigar provisioning * fix ewma usage * resolve #16121 * see #16354	2015-01-19 10:23:54 -06:00
Adam Voss	cce29dfa51	Changes all occurances of Typesafe copyright to extend to 2014.	2014-02-04 21:20:09 -06:00
Björn Antonsson	003609c9c5	=pro #3759 Changed to using non-deprecated ScalaTest Matchers	2013-12-18 11:32:51 +01:00
Patrik Nordwall	eaad7ecf7e	!clu #3683 Change cluster heartbeat to req/rsp protocol * The previous one-way hearbeat was elegant, but comlicated to understand and without giving much extra value compared to this approach. * The previous one-way heartbeat have some kind of bug when joining several (10-20) nodes at approximately the same time (but not exactly the same time) with a false failure detection triggered by the extra heartbeat, which would not heal. * This ping-pong approach will increase network traffic slightly, but heartbeat messages are small and each node is limited to monitor (default) 5 peers.	2013-11-15 08:18:52 +01:00
Patrik Nordwall	7d5a3ec30b	!clu #3657 Lazy deserialization and TTL of Gossip message payload	2013-10-18 08:29:46 +02:00
Patrik Nordwall	dc9fe4f19c	!clu #2307 Allow transition from unreachable to reachable * Replace unreachable Set with Reachability table * Unreachable members stay in member Set * Downing a live member was moved it to the unreachable Set, and then removed from there by the leader. That will not work when flipping back to reachable, so a Down member must be detected as unreachable before beeing removed. Similar to Exiting. Member shuts down itself if it sees itself as Down. * Flip back to reachable when failure detector monitors it as available again * ReachableMember event * Can't ignore gossip from aggregated unreachable (see SurviveNetworkInstabilitySpec) * Make use of ReachableMember event in cluster router * End heartbeat when acknowledged, EndHeartbeatAck * Remove nr-of-end-heartbeats from conf * Full reachability info in JMX cluster status * Don't use interval after unreachable for AccrualFailureDetector history * Add QuarantinedEvent to remoting, used for Reachability.Terminated * Prune reachability table when all reachable * Update documentation * Performance testing and optimizations	2013-09-11 13:10:29 +02:00
Patrik Nordwall	a0a0f39613	Hardening of cluster member leaving path, see #3309 * Removed leader commands for Shutdown and Exit * Member shutdown itself when it sees itself as Exiting * Singleton cluster with status Exiting will shutdown itself, in case the Exiting gossip never arrives * Exiting member not part convergence check * Exiting member is removed by leader (on convergence) when the exiting member is in the unreachable set, i.e. sucessfully shutdown * Reverted the change made for #3266, i.e. Exiting is detected as unreachable again. * Adjust ClusterSingletonManager to new Exiting behaviour * Fix bug in HeartbeatSender, which caused it to continue to send heartbeats to removed nodes, instead of rebalancing * Refactoring of leaderActions method * Leaving section in docs	2013-05-17 11:39:49 +02:00
Patrik Nordwall	6635ac4032	Reduce amount of gossip data transferred in idle cluster, see #3279 * When seen same the gossip chat is initated with GossipStatus message containing the vclock only * Remove conversation flag in GossipEnvelope * Ordinary tell instead of actorSelection when replying	2013-05-02 19:17:09 +02:00
Patrik Nordwall	671ebf8909	Additional tests of ClusterMessageSerializer	2013-05-02 19:17:08 +02:00
Patrik Nordwall	9e56ab6fe5	Disallow re-joining, see #2873 * Disallow join requests when already part of a cluster * Remove wipe state when joining, since join can only be performed from empty state * When trying to join, only accept gossip from that member * Ignore gossips from unknown (and unreachable) members * Make sure received gossip contains selfAddress * Test join of fresh node with same host:port * Remove JoinTwoClustersSpec * Welcome message as reply to Join * Retry unsucessful join request * AddressUidExtension * Uid in cluster Member identifier To be able to distinguish nodes with same host:port after restart. * Ignore gossip with wrong uid * Renamed Remove command to Shutdown * Use uid in vclock identifier * Update sample, Member apply is private * Disabled config duration syntax and cleanup of io settings * Update documentation	2013-04-17 16:48:18 +02:00
Björn Antonsson	73f0f44ddb	Protobuf serialization of cluster messages. See #1910	2013-04-11 10:09:05 +02:00

33 commits