pekko

Author	SHA1	Message	Date
Patrik Nordwall	0826689c47	=clu #3603 Handle removed member in Gossip and Reachability merge * It was a regression introduced in `dc9fe4f` * Two problems: 1) Gossip merge could pop back removed member (was previously covered by the filter of unreachable) 2) Reachability merge didn't handle all cases for removed member, i.e. when node not in allowed set	2013-09-13 17:18:27 +02:00
Patrik Nordwall	dc9fe4f19c	!clu #2307 Allow transition from unreachable to reachable * Replace unreachable Set with Reachability table * Unreachable members stay in member Set * Downing a live member was moved it to the unreachable Set, and then removed from there by the leader. That will not work when flipping back to reachable, so a Down member must be detected as unreachable before beeing removed. Similar to Exiting. Member shuts down itself if it sees itself as Down. * Flip back to reachable when failure detector monitors it as available again * ReachableMember event * Can't ignore gossip from aggregated unreachable (see SurviveNetworkInstabilitySpec) * Make use of ReachableMember event in cluster router * End heartbeat when acknowledged, EndHeartbeatAck * Remove nr-of-end-heartbeats from conf * Full reachability info in JMX cluster status * Don't use interval after unreachable for AccrualFailureDetector history * Add QuarantinedEvent to remoting, used for Reachability.Terminated * Prune reachability table when all reachable * Update documentation * Performance testing and optimizations	2013-09-11 13:10:29 +02:00
Patrik Nordwall	cd2b77157c	Log dead letters, see #3453	2013-06-20 12:09:09 +02:00
Patrik Nordwall	ad1eaa6d4a	Remove auto-join config, derive from seed-nodes, see #3359	2013-05-17 13:54:51 +02:00
Roland	b3db19ee05	Merge branch 'wip-3281-NullMessage-∂π'	2013-05-03 19:40:36 +02:00
Patrik Nordwall	6635ac4032	Reduce amount of gossip data transferred in idle cluster, see #3279 * When seen same the gossip chat is initated with GossipStatus message containing the vclock only * Remove conversation flag in GossipEnvelope * Ordinary tell instead of actorSelection when replying	2013-05-02 19:17:09 +02:00
Patrik Nordwall	33a8808a6d	Enable usage of MultiJvm nrOfNodes in cluster StressSpec, see #2787 * Adjustments to StressSpec for testing large clusters * Performance improvement of mute deadLetters	2013-05-02 19:17:08 +02:00
Roland	738796c625	remove NullMessage, see #3281	2013-05-02 18:48:36 +02:00
Patrik Nordwall	2ebb2a0b9c	Solve UnreachableNodeJoinsAgainSpec problem, see #3247 * UnreachableNodeJoinsAgain failed because of gated connection * Removed default test value of retry-gate-closed-for, instead default from reference.conf is used, i.e. 0s * deadLetters logging love	2013-04-23 15:43:10 +02:00
Patrik Nordwall	428e71690f	Coordinate shutdown of ClusterDeathWatchSpec with messages, see #3255 * Added MultiNodeClusterSpec.EndActor for this purpose * Changed UnreachableNodeJoinsAgainSpec to use the same	2013-04-23 11:56:12 +02:00
Patrik Nordwall	806fc0c525	Use awaitAssert in cluster tests, see #3168	2013-03-25 13:08:06 +01:00
Björn Antonsson	5827a27b94	Make joining to the same node multiple times work, and reenable blackhole test. See #2930	2013-03-20 12:22:12 +01:00
Patrik Nordwall	a67fa18f8d	Reduce unwanted logging from remoting, see #2826 * Handle logging in EndpointManager supervisorStrategy * Added some more exception types to be able to differentiate failures	2013-03-11 13:36:00 +01:00
Rich Dougherty	a78e26ae41	Merge pull request #1211 from akka/wip-3110-test-thread-dump-rich Dump diagnostics (Coroner's Report) if a test takes too long. Fixes #3110	2013-03-07 23:21:35 -08:00
Rich Dougherty	f6a14cb5ac	Dump diagnostics (Coroner's Report) if a test takes too long. Fixes #3110	2013-03-08 12:39:19 +13:00
Patrik Nordwall	d72d48301a	Joining member should not be leader, see #3021 * Prefer members with status Up or Leaving, and as fallback use Joining, Exiting, Down * Minor scaladoc fixes	2013-03-07 14:07:17 +01:00
Patrik Nordwall	5b844ec1e6	Publish member events when state change first seen, see #3075 * Remove InstantMemberEvent	2013-03-07 14:07:17 +01:00
Patrik Nordwall	2476831705	Rename event-handlers to loggers, see #2979 * Rename config akka.event-handlers to akka.loggers * Rename config akka.event-handler-startup-timeout to akka.logger-startup-timeout * Rename JulEventHandler to JavaLogger * Rename Slf4jEventHandler to Slf4jLogger * Change all places in tests and docs * Deprecation, old still works, but with warnings * Migration guide * Test for the deprecated event-handler config	2013-02-05 11:19:02 +01:00
Patrik Nordwall	157a25bcde	Failure detector refactoring, see #2690 * Failure detector was previously copied with refactoring to akka-remote and this refactoring makes use of that and removes the failure detector in akka-cluster * Adjustments to reference.conf * Refactoring of FailureDetectorPuppet	2013-02-01 10:08:39 +01:00
Patrik Nordwall	5dc108567d	Style change of def starting with if * When a def starts with if and is not a oneliner the if should be on a new line. * The reason is that it might be easy to miss the if when reading the code.	2013-01-18 13:28:49 +01:00
Patrik Nordwall	8b4e903e7d	Detect failure when no heartbeats sent, see #2907 * Subscribe to InstantMemberEvent and start heartbeating when InstantMemberUp. Same for metrics. * HeartbeatNodeRing data structure for bidirectional mapping of heartbeat sender and receiver. Not using ConsistentHash anymore. Node addresses are hashed to ensure that neighbors are spread out. * HeartbeatRequest when receiver detects that it has not received expected heartbeats. * New test InitialHeartbeatSpec that simulates the problem * Add/remove some related conf properties * Add some more logging to be able to diagnose eventual problems * Explicit config of nr-of-end-heartbeats	2013-01-18 12:54:09 +01:00
Viktor Klang	adfeb2c1f0	#2879 - updating copyright info	2013-01-09 11:38:00 +01:00
Patrik Nordwall	f147f4d3d2	Stress / long running test of cluster, see #2786 * akka.cluster.StressSpec * Configurable number of nodes and duration for each step * Report metrics and phi periodically to see progress * Configurable payload size * Test of various join and remove scenarios * Test of watch * Exercise supervision * Report cluster stats * Test with many actors in tree structure Apart from the test this commit also solves some issues: * Avoid adding back members when downed in ClusterHeartbeatSender * Avoid duplicate close of ClusterReadView * Add back the publish of AddressTerminated when MemberDowned/Removed it was lost in merge of "publish on convergence", see #2779	2013-01-07 14:44:36 +01:00
Björn Antonsson	a03460329d	Change cluster MemberEvents to only be published on convergence. See #2692 Conflicts: akka-cluster/src/main/scala/akka/cluster/ClusterEvent.scala akka-cluster/src/main/scala/akka/cluster/ClusterJmx.scala akka-cluster/src/main/scala/akka/cluster/ClusterMetricsCollector.scala akka-cluster/src/main/scala/akka/cluster/ClusterReadView.scala akka-cluster/src/multi-jvm/scala/akka/cluster/MultiNodeClusterSpec.scala akka-docs/rst/cluster/cluster-usage-java.rst akka-docs/rst/cluster/cluster-usage-scala.rst akka-kernel/src/main/dist/bin/akka-cluster	2012-12-14 12:46:13 +01:00
Patrik Nordwall	503e992d44	Await leader in awaitUpConvergence, see #2752 It's a glitch in how ClusterReadView (used in tests) is updated. First we do awaitUpConvergence, which checks readView.members and readView.convergence. Then we do assertLeader, which checks readView.leader. The problem is that readView.leader might not have been updated yet (if there already was convergence). Solution is to await the expected leader in awaitUpConvergence, so that readView is in a consistent state after awaitUpConvergence. We will change the semantics in ticket #2692, but this change will be needed and should work with that as well.	2012-12-04 17:07:08 +01:00
Patrik Nordwall	1914be7069	Merge branch 'master' into wip-2547-metrics-router-patriknw Conflicts: akka-actor/src/main/scala/akka/actor/Deployer.scala akka-cluster/src/main/scala/akka/cluster/ClusterMetricsCollector.scala akka-cluster/src/test/scala/akka/cluster/MetricsCollectorSpec.scala	2012-11-15 12:33:11 +01:00
Viktor Klang	8f131c680f	Switching to immutable.Seq instead of Seq	2012-11-12 14:17:47 +01:00
Patrik Nordwall	c9d206764a	ClusterLoadBalancingRouter and refactoring of metrics, see #2547 * MetricsSelector, calculate capacity, weights and allocate weighted routee refs * ClusterLoadBalancingRouterSpec * Optional heap max * Constants for the metric fields * Refactoring of Metric and decay * Rewrite of DataStreamSpec * Correction of EWMA and removal of BigInt, BigDecimal * Separation of MetricsCollector into trait and two classes, SigarMetricsCollector and JmxMetricsCollector * This will reduce cost when sigar is not installed, such as avoiding throwing and catching exc for every call * Improved error handling for loading sigar * Made MetricsCollector implementation configurable * Tested with sigar	2012-11-07 20:36:24 +01:00
Björn Antonsson	977194ff8e	Improve MultiNodeSpec ifNode syntax. #2126	2012-11-01 15:16:02 +01:00
Patrik Nordwall	b52a082279	Even better assert message in case of failure of assertLeader, see #2641	2012-10-19 17:28:20 +02:00
Patrik Nordwall	2999e9a43b	Better assert message in case of failure of assertLeader, see #2641	2012-10-19 17:03:45 +02:00
Roland	0f04239f67	move Duration classes according to scala 2.10 nightly and remove casts to FiniteDuration, see #2504	2012-10-11 15:18:10 -07:00
Patrik Nordwall	8476b2195c	Mute expected log messages, see #2010	2012-10-01 20:12:36 +02:00
Patrik Nordwall	88a9123724	Move heartbeat-interval into failure-detector section	2012-09-20 15:23:18 +02:00
Patrik Nordwall	9423d37da9	Merge branch 'master' into wip-cluster-docs-patriknw Conflicts: project/AkkaBuild.scala	2012-09-20 10:40:08 +02:00
Patrik Nordwall	068335789c	Cluster config setting to disable jmx, see #2531	2012-09-20 08:09:01 +02:00
Roland	35b7a9e338	second round of FiniteDuration business, including cluster fixes - make Scheduler only accept FiniteDuration, which has quite some knock-on effects	2012-09-18 09:58:30 +02:00
Björn Antonsson	afe30e9038	Removed all dependencies to ScalaTest in the published artifacts. See #1802	2012-09-12 15:12:13 +02:00
Patrik Nordwall	018a949678	Merge branch 'master' into wip-2103-cluster-routers-patriknw Conflicts: akka-cluster/src/multi-jvm/scala/akka/cluster/ClientDowningNodeThatIsUnreachableSpec.scala akka-cluster/src/multi-jvm/scala/akka/cluster/ClientDowningNodeThatIsUpSpec.scala akka-cluster/src/multi-jvm/scala/akka/cluster/ConvergenceSpec.scala akka-cluster/src/multi-jvm/scala/akka/cluster/LeaderDowningNodeThatIsUnreachableSpec.scala akka-cluster/src/multi-jvm/scala/akka/cluster/LeaderElectionSpec.scala akka-cluster/src/multi-jvm/scala/akka/cluster/MultiNodeClusterSpec.scala akka-cluster/src/multi-jvm/scala/akka/cluster/SingletonClusterSpec.scala akka-cluster/src/multi-jvm/scala/akka/cluster/SplitBrainSpec.scala akka-cluster/src/multi-jvm/scala/akka/cluster/UnreachableNodeRejoinsClusterSpec.scala akka-cluster/src/test/scala/akka/cluster/ClusterSpec.scala akka-remote/src/main/scala/akka/routing/RemoteRouterConfig.scala	2012-09-11 10:55:47 +02:00
Patrik Nordwall	bd6c39178c	Fix leaking this in constructor of Cluster, see #2473 * Major refactoring to remove the need to use special Cluster instance for testing. Use default Cluster extension instead. Most of it is trivial changes. * Used failure-detector.implementation-class from config to swap to Puppet * Removed FailureDetectorStrategy, since it doesn't add any value * Added Cluster.joinSeedNodes to be able to test seedNodes when Addresses are unknown before startup time. * Removed ClusterEnvironment that was passed around among the actors, instead they use the ordinary Cluster extension. * Overall much cleaner design	2012-09-06 21:48:40 +02:00
Patrik Nordwall	f4cc8f8649	Make it more intuitive for tests using real Cluster(system) extension, see #2103 * We will write more tests that rely on real Cluster(system) extension, such as ClusterRoundRobinRoutedActorSpec * When not using FailureDetectorStrategy or overriding seed nodes MultiNodeClusterSpec will use the real Cluster(system) extension instead of a new Cluster instance with additional test facilities	2012-08-28 16:38:05 +02:00
Patrik Nordwall	417bdc2dfb	Prototype of cluster aware routers, see #2103 * Several FIXME that needs to be discussed * ClusterRouterConfig created via ClusterActorRefProvider	2012-08-28 11:54:52 +02:00
Patrik Nordwall	1700edb863	Merge branch 'master' into wip-2202-cluster-domain-events-patriknw Conflicts: akka-cluster/src/main/scala/akka/cluster/ClusterDaemon.scala	2012-08-16 18:54:10 +02:00
Patrik Nordwall	4e2d7b0495	Move Cluster query methods to ClusterReadView, see #2202 * Better separation of concerns * Internal API (could be made public if requested)	2012-08-16 18:28:01 +02:00
Patrik Nordwall	06f81f4373	Improve publish of domain events, see #2202 * Gossip is not exposed in user api * Better and more events * Snapshot event sent to new subscriber * Updated tests * Periodic publish only for internal stats	2012-08-15 16:47:34 +02:00
Patrik Nordwall	d7b0089d7e	Support concurrent startup of seed nodes, see #2270 * Implemented the startup sequence of seed nodes as described in #2305 * Test that verifies concurrent startup of seed nodes	2012-08-14 15:16:23 +02:00
Viktor Klang	1114da2198	Importing language features used by akka-cluster and akka-remote-tests	2012-07-26 14:47:21 +02:00
Viktor Klang	ac5b5de90a	Merging in master, huge work trying to get things to compile, tests not green at this stage	2012-07-06 17:04:04 +02:00
Patrik Nordwall	c708d2ad8a	First step in refactoring of cluster internals to actors, see #2311 * Move clustering code to ClusterCore actor * More will be done, comitting this for early review	2012-07-04 13:52:14 +02:00
Viktor Klang	54a3a44bf8	#2292 - Removing akka.util.Duration etc and replace it with scala.concurrent.util.Duration	2012-06-29 13:33:20 +02:00

1 2

87 commits