Commit graph

431 commits

Author SHA1 Message Date
Patrik Nordwall
c449f5afff +clu #3636 Revert join to uninitialized
* Revert the change introduced in
  https://github.com/akka/akka/pull/1738/files
* The cleanup/improvements aside of the actual
  feature is not reverted by this patch
* Clarify the documentation
2013-10-14 17:53:07 +02:00
Patrik Nordwall
d3f295e5fe Merge pull request #1738 from akka/wip-3612-join-self-patriknw
+clu #3612 Allow join to uninitialized node
2013-09-29 22:41:15 -07:00
Patrik Nordwall
cb42bf0785 +clu #3612 Allow join to uninitialized node
* join to self not needed when performing manual joining
2013-09-27 14:40:09 +02:00
Patrik Nordwall
d5b25cbbc6 !act #3583 Timer based auto-down
* Replace (deprecate) akka.cluster.auto-down config setting with
  akka.cluster.auto-down-unreachable-after
* AutoDown actor that keeps track of unreachable members
  and performs down from the leader node when they have been
  unreachable for the specified duration
* Migration guide
2013-09-27 14:32:03 +02:00
Patrik Nordwall
0826689c47 =clu #3603 Handle removed member in Gossip and Reachability merge
* It was a regression introduced in dc9fe4f
* Two problems:
  1) Gossip merge could pop back removed member (was previously
     covered by the filter of unreachable)
  2) Reachability merge didn't handle all cases for removed member,
     i.e. when node not in allowed set
2013-09-13 17:18:27 +02:00
Patrik Nordwall
dc9fe4f19c !clu #2307 Allow transition from unreachable to reachable
* Replace unreachable Set with Reachability table
* Unreachable members stay in member Set
* Downing a live member was moved it to the unreachable Set,
  and then removed from there by the leader. That will not
  work when flipping back to reachable, so a Down member must
  be detected as unreachable before beeing removed. Similar
  to Exiting. Member shuts down itself if it sees itself as
  Down.
* Flip back to reachable when failure detector monitors it as
  available again
* ReachableMember event
* Can't ignore gossip from aggregated unreachable (see SurviveNetworkInstabilitySpec)
* Make use of ReachableMember event in cluster router
* End heartbeat when acknowledged, EndHeartbeatAck
* Remove nr-of-end-heartbeats from conf
* Full reachability info in JMX cluster status
* Don't use interval after unreachable for AccrualFailureDetector history
* Add QuarantinedEvent to remoting, used for Reachability.Terminated
* Prune reachability table when all reachable
* Update documentation
* Performance testing and optimizations
2013-09-11 13:10:29 +02:00
Endre Sándor Varga
b566e9393d =act, rem, clu #3521: make serialize-messages work with core modules 2013-08-27 11:05:54 +02:00
Patrik Nordwall
f34909c490 unhandled msg in cluster StressSpec 2013-06-25 22:10:05 +02:00
Endre Sándor Varga
bf3180c0f3 Test was not using the view from the fresh system #3460 2013-06-24 11:54:46 +02:00
Patrik Nordwall
cd2b77157c Log dead letters, see #3453 2013-06-20 12:09:09 +02:00
Roland Kuhn
8df8541801 Merge pull request #1500 from akka/wip-3210-local-only-∂π
make LocalScope mean “purely local” and avoid Props serialization check,...
2013-05-30 08:03:32 -07:00
Roland
92db59183e make LocalScope mean “purely local” and avoid Props serialization check, see #3210 2013-05-29 23:36:39 +02:00
Patrik Nordwall
a323936299 Disable cluster stats by default, see #3348
* Add VectorClockStats
2013-05-28 16:15:57 +02:00
Patrik Nordwall
c05262f406 Change default tick-duration to 10ms, see #3387 2013-05-27 09:20:25 +02:00
Patrik Nordwall
ee6e80d31a Add previousStatus in MemberRemoved, see #3252 2013-05-23 11:09:32 +02:00
Patrik Nordwall
8f04b53ac7 Merge pull request #1443 from akka/wip-3359-auto-join-patriknw
Remove auto-join config, derive from seed-nodes, see #3359
2013-05-17 04:57:07 -07:00
Patrik Nordwall
ad1eaa6d4a Remove auto-join config, derive from seed-nodes, see #3359 2013-05-17 13:54:51 +02:00
Patrik Nordwall
a0a0f39613 Hardening of cluster member leaving path, see #3309
* Removed leader commands for Shutdown and Exit
* Member shutdown itself  when it sees itself as Exiting
* Singleton cluster with status Exiting will shutdown itself,
  in case the Exiting gossip never arrives
* Exiting member not part convergence check
* Exiting member is removed by leader (on convergence) when the
  exiting member is in the unreachable set, i.e. sucessfully shutdown
* Reverted the change made for #3266, i.e. Exiting is
  detected as unreachable again.
* Adjust ClusterSingletonManager to new Exiting behaviour
* Fix bug in HeartbeatSender, which caused it to continue to
  send heartbeats to removed nodes, instead of rebalancing
* Refactoring of leaderActions method
* Leaving section in docs
2013-05-17 11:39:49 +02:00
Patrik Nordwall
9ec0b7ca71 Better test timeouts in UnreachableNodeJoinsAgainSpec, see #3285 2013-05-09 21:27:23 +02:00
Björn Antonsson
e00ab533bb Wait on shutdown of extra actor systems in tests. See #3217 2013-05-07 11:02:03 +02:00
Roland
b3db19ee05 Merge branch 'wip-3281-NullMessage-∂π' 2013-05-03 19:40:36 +02:00
Patrik Nordwall
b8b65c9153 Cluster member age, and usage in singleton, see #3195
* Assign internal upNumber when member is moved to Up
* Public API Member.isOlder
* Change cluster singleton to use oldest member instead of leader
* Update samples and docs
2013-05-03 13:38:35 +02:00
Patrik Nordwall
e2b0fdeff9 Move barriers of conditional tests in StressSpec, see #2787 2013-05-02 20:23:14 +02:00
Patrik Nordwall
6635ac4032 Reduce amount of gossip data transferred in idle cluster, see #3279
* When seen same the gossip chat is initated with GossipStatus
  message containing the vclock only
* Remove conversation flag in GossipEnvelope
* Ordinary tell instead of actorSelection when replying
2013-05-02 19:17:09 +02:00
Patrik Nordwall
33a8808a6d Enable usage of MultiJvm nrOfNodes in cluster StressSpec, see #2787
* Adjustments to StressSpec for testing large clusters
* Performance improvement of mute deadLetters
2013-05-02 19:17:08 +02:00
Roland
738796c625 remove NullMessage, see #3281 2013-05-02 18:48:36 +02:00
dario.rexin
3e8597d94b more deprecation warnings removed 2013-04-26 13:54:10 +02:00
Patrik Nordwall
b62f5f46f5 Merge pull request #1361 from akka/wip-3247-UnreachableNodeJoinsAgainSpec-patriknw
Solve UnreachableNodeJoinsAgainSpec problem, see #3247
2013-04-25 05:32:08 -07:00
Patrik Nordwall
19283207ff Additional coordination of shutdown in ClusterDeathWatchSpec, see #3255
* Added testconductor shutdown of actor system
* Renamed previous shutdown to exit
2013-04-24 11:58:07 +02:00
Patrik Nordwall
2ebb2a0b9c Solve UnreachableNodeJoinsAgainSpec problem, see #3247
* UnreachableNodeJoinsAgain failed because of gated connection
* Removed default test value of retry-gate-closed-for, instead
  default from reference.conf is used, i.e. 0s
* deadLetters logging love
2013-04-23 15:43:10 +02:00
Patrik Nordwall
428e71690f Coordinate shutdown of ClusterDeathWatchSpec with messages, see #3255
* Added MultiNodeClusterSpec.EndActor for this purpose
* Changed UnreachableNodeJoinsAgainSpec to use the same
2013-04-23 11:56:12 +02:00
Endre Sándor Varga
fa5ab15450 Ignoring test that relies on actorFor and remote watch 2013-04-19 14:51:21 +02:00
Endre Sándor Varga
7735cb87b9 Adding workaround for UnreachableNodeJoinsAgain
- see #3247
2013-04-18 20:31:31 +02:00
Patrik Nordwall
d2939fa39a Fix failing UnreachableNodeJoinsAgainSpec
* Shutting down test conductor controller is never a good idea
2013-04-17 22:22:22 +02:00
Patrik Nordwall
4606612bd1 Reliable remote supervision and death watch, see #2993
* RemoteWatcher that monitors node failures, with heartbeats
  and failure detector
* Move RemoteDeploymentWatcher from CARP to RARP
* ClusterRemoteWatcher that handles cluster nodes
* Update documentation
* UID in Heartbeat msg to be able to quarantine,
  actual implementation of quarantining will be implemented
  in ticket 2594
2013-04-17 19:42:51 +02:00
Patrik Nordwall
9e56ab6fe5 Disallow re-joining, see #2873
* Disallow join requests when already part of a cluster
* Remove wipe state when joining, since join can only be
  performed from empty state
* When trying to join, only accept gossip from that member
* Ignore gossips from unknown (and unreachable) members
* Make sure received gossip contains selfAddress
* Test join of fresh node with same host:port
* Remove JoinTwoClustersSpec
* Welcome message as reply to Join
* Retry unsucessful join request
* AddressUidExtension
* Uid in cluster Member identifier
  To be able to distinguish nodes with same host:port
  after restart.
* Ignore gossip with wrong uid
* Renamed Remove command to Shutdown
* Use uid in vclock identifier
* Update sample, Member apply is private
* Disabled config duration syntax and cleanup of io settings
* Update documentation
2013-04-17 16:48:18 +02:00
drewhk
ac9376d8dd Merge pull request #1320 from drewhk/wip-3156-remotingspec-test-interference-drewhk
Fixed wrongly configured retry-gate to avoid test-interference
2013-04-10 04:43:29 -07:00
Endre Sándor Varga
d97e6eb003 Fixed all occurences of "retry-latch" 2013-04-09 16:59:30 +02:00
Viktor Klang
e59394628c #3206 - Adding expectTerminated to TestKit 2013-04-09 14:48:17 +02:00
Patrik Nordwall
9270bfa70b Merge pull request #1313 from akka/wip-2797-cluster-router-stop-routees-patriknw
ClusterRouterConfig should stop remote deployed routees, see #2797
2013-04-08 12:38:26 -07:00
Patrik Nordwall
887af975ae Deprecate actorFor in favor of ActorSelection, see #3074
* Deprecate all actorFor methods
* resolveActorRef in provider
* Identify auto receive message
* Support ActorPath in actorSelection
* Support remote actor selections
* Additional tests of actor selection
* Update tests (keep most actorFor tests)
* Update samples to use actorSelection
* Updates to documentation
* Migration guide, including motivation
2013-04-08 18:11:52 +02:00
Patrik Nordwall
7ff77471a1 ClusterRouterConfig should stop remote deployed routees, see #2797 2013-04-08 16:14:21 +02:00
Viktor Klang
c883705242 #3018 - Enabling -Xlint and dealing with the situation that occurs 2013-03-29 01:43:17 +01:00
Patrik Nordwall
806fc0c525 Use awaitAssert in cluster tests, see #3168 2013-03-25 13:08:06 +01:00
Björn Antonsson
6ec5fc6006 Merge pull request #1264 from akka/wip-2930-unreachablenoderejoinsclusterspec-failed-ban
Fixes to Throttler and UnreachableNodeRejoinsCluster #2930
2013-03-25 03:17:18 -07:00
Patrik Nordwall
b738487dc8 Add UID to RemoteActorRef, see #3072
* Sending to a previous incarnation of an actor shall fail,
  to make remote actors work the same way as local ones (in
  the sense that after Terminated() the ref is not working anymore)
* Changed equality of ActorRef to take the uid into account
* Parse uid fragment in RelativeActorPath and ActorPathExtractor
* Handle uid in getChild and in RemoteSystemDaemon
* Use toSerializationFormat and toSerializationFormatWithAddress
  in serialization
* Replaced var uid in ActorCell and ChildRestartStats with
  constructor parameters (path)
* Create the uid in one single place, in makeChild in parent
* Handle ActorRef with and without uid in DeathWatch
* Optimize ActorPath.toString and friends
* Update documentation and migration guide
2013-03-21 20:25:16 +01:00
Björn Antonsson
b6ae5df920 Wait for the second member to become Up. See #3140 2013-03-20 13:29:19 +01:00
Björn Antonsson
5827a27b94 Make joining to the same node multiple times work, and reenable blackhole test. See #2930 2013-03-20 12:22:12 +01:00
Patrik Nordwall
7eac88f372 Cluster node roles, see #3049
* Config of node roles cluster.role
* Cluster router configurable with use-role
* RoleLeaderChanged event
* Cluster singleton per role
* Cluster only starts once all required per-role node
  counts are reached,
  role.<role-name>.min-nr-of-members config
*  Update documentation and make use of the roles in the examples
2013-03-18 11:56:11 +01:00
Viktor Klang (√)
05593f5dd8 Merge pull request #1230 from akka/wip-3076-gossip-merge-changes-ban
Don't increment vector-clock on merge and merge locally. See #3076
2013-03-12 08:49:30 -07:00