Merge paradox/scala/cluster-usage.md and java/cluster-usage.md (#23154)
* Identical contents in scala/cluster-usage.md and java/cluster-usage.md for easier review * Replace java/cluster-usage.md with a symlink to scala/cluster-usage.md
This commit is contained in:
parent
17e07c0ae2
commit
48c8c1ab91
4 changed files with 160 additions and 876 deletions
|
|
@ -1,823 +0,0 @@
|
|||
# Cluster Usage
|
||||
|
||||
For introduction to the Akka Cluster concepts please see @ref:[Cluster Specification](common/cluster.md).
|
||||
|
||||
## Preparing Your Project for Clustering
|
||||
|
||||
The Akka cluster is a separate jar file. Make sure that you have the following dependency in your project:
|
||||
|
||||
@@@vars
|
||||
```
|
||||
<dependency>
|
||||
<groupId>com.typesafe.akka</groupId>
|
||||
<artifactId>akka-cluster_$scala.binary_version$</artifactId>
|
||||
<version>$akka.version$</version>
|
||||
</dependency>
|
||||
```
|
||||
@@@
|
||||
|
||||
<a id="cluster-simple-example"></a>
|
||||
## A Simple Cluster Example
|
||||
|
||||
The following configuration enables the `Cluster` extension to be used.
|
||||
It joins the cluster and an actor subscribes to cluster membership events and logs them.
|
||||
|
||||
The `application.conf` configuration looks like this:
|
||||
|
||||
```
|
||||
akka {
|
||||
actor {
|
||||
provider = "cluster"
|
||||
}
|
||||
remote {
|
||||
log-remote-lifecycle-events = off
|
||||
netty.tcp {
|
||||
hostname = "127.0.0.1"
|
||||
port = 0
|
||||
}
|
||||
}
|
||||
|
||||
cluster {
|
||||
seed-nodes = [
|
||||
"akka.tcp://ClusterSystem@127.0.0.1:2551",
|
||||
"akka.tcp://ClusterSystem@127.0.0.1:2552"]
|
||||
|
||||
# auto downing is NOT safe for production deployments.
|
||||
# you may want to use it during development, read more about it in the docs.
|
||||
#
|
||||
# auto-down-unreachable-after = 10s
|
||||
}
|
||||
}
|
||||
|
||||
# Disable legacy metrics in akka-cluster.
|
||||
akka.cluster.metrics.enabled=off
|
||||
|
||||
# Enable metrics extension in akka-cluster-metrics.
|
||||
akka.extensions=["akka.cluster.metrics.ClusterMetricsExtension"]
|
||||
|
||||
# Sigar native library extract location during tests.
|
||||
# Note: use per-jvm-instance folder when running multiple jvm on one host.
|
||||
akka.cluster.metrics.native-library-extract-folder=${user.dir}/target/native
|
||||
```
|
||||
|
||||
To enable cluster capabilities in your Akka project you should, at a minimum, add the @ref:[Remoting](remoting.md)
|
||||
settings, but with `cluster`.
|
||||
The `akka.cluster.seed-nodes` should normally also be added to your `application.conf` file.
|
||||
|
||||
@@@ note
|
||||
|
||||
If you are running Akka in a Docker container or the nodes for some other reason have separate internal and
|
||||
external ip addresses you must configure remoting according to @ref:[Akka behind NAT or in a Docker container](remoting.md#remote-configuration-nat)
|
||||
|
||||
@@@
|
||||
|
||||
The seed nodes are configured contact points for initial, automatic, join of the cluster.
|
||||
|
||||
Note that if you are going to start the nodes on different machines you need to specify the
|
||||
ip-addresses or host names of the machines in `application.conf` instead of `127.0.0.1`
|
||||
|
||||
An actor that uses the cluster extension may look like this:
|
||||
|
||||
@@snip [SimpleClusterListener.java]($code$/java/jdocs/cluster/SimpleClusterListener.java) { type=java }
|
||||
|
||||
The actor registers itself as subscriber of certain cluster events. It receives events corresponding to the current state
|
||||
of the cluster when the subscription starts and then it receives events for changes that happen in the cluster.
|
||||
|
||||
The easiest way to run this example yourself is to download the ready to run
|
||||
@extref[Akka Cluster Sample with Scala](ecs:akka-samples-cluster-java)
|
||||
together with the tutorial. It contains instructions on how to run the `SimpleClusterApp`.
|
||||
The source code of this sample can be found in the @extref[Akka Samples Repository](samples:akka-sample-cluster-java).
|
||||
|
||||
## Joining to Seed Nodes
|
||||
|
||||
You may decide if joining to the cluster should be done manually or automatically
|
||||
to configured initial contact points, so-called seed nodes. When a new node is started
|
||||
it sends a message to all seed nodes and then sends join command to the one that
|
||||
answers first. If no one of the seed nodes replied (might not be started yet)
|
||||
it retries this procedure until successful or shutdown.
|
||||
|
||||
You define the seed nodes in the [configuration](#cluster-configuration) file (application.conf):
|
||||
|
||||
```
|
||||
akka.cluster.seed-nodes = [
|
||||
"akka.tcp://ClusterSystem@host1:2552",
|
||||
"akka.tcp://ClusterSystem@host2:2552"]
|
||||
```
|
||||
|
||||
This can also be defined as Java system properties when starting the JVM using the following syntax:
|
||||
|
||||
```
|
||||
-Dakka.cluster.seed-nodes.0=akka.tcp://ClusterSystem@host1:2552
|
||||
-Dakka.cluster.seed-nodes.1=akka.tcp://ClusterSystem@host2:2552
|
||||
```
|
||||
|
||||
The seed nodes can be started in any order and it is not necessary to have all
|
||||
seed nodes running, but the node configured as the first element in the `seed-nodes`
|
||||
configuration list must be started when initially starting a cluster, otherwise the
|
||||
other seed-nodes will not become initialized and no other node can join the cluster.
|
||||
The reason for the special first seed node is to avoid forming separated islands when
|
||||
starting from an empty cluster.
|
||||
It is quickest to start all configured seed nodes at the same time (order doesn't matter),
|
||||
otherwise it can take up to the configured `seed-node-timeout` until the nodes
|
||||
can join.
|
||||
|
||||
Once more than two seed nodes have been started it is no problem to shut down the first
|
||||
seed node. If the first seed node is restarted, it will first try to join the other
|
||||
seed nodes in the existing cluster.
|
||||
|
||||
If you don't configure seed nodes you need to join the cluster programmatically or manually.
|
||||
|
||||
Manual joining can be performed by using [JMX](#cluster-jmx) or [HTTP](#cluster-http).
|
||||
Joining programmatically can be performed with `Cluster.get(system).join`. Unsuccessful join attempts are
|
||||
automatically retried after the time period defined in configuration property `retry-unsuccessful-join-after`.
|
||||
Retries can be disabled by setting the property to `off`.
|
||||
|
||||
You can join to any node in the cluster. It does not have to be configured as a seed node.
|
||||
Note that you can only join to an existing cluster member, which means that for bootstrapping some
|
||||
node must join itself,and then the following nodes could join them to make up a cluster.
|
||||
|
||||
You may also use `Cluster.get(system).joinSeedNodes` to join programmatically,
|
||||
which is attractive when dynamically discovering other nodes at startup by using some external tool or API.
|
||||
When using `joinSeedNodes` you should not include the node itself except for the node that is
|
||||
supposed to be the first seed node, and that should be placed first in parameter to `joinSeedNodes`.
|
||||
|
||||
Unsuccessful attempts to contact seed nodes are automatically retried after the time period defined in
|
||||
configuration property `seed-node-timeout`. Unsuccessful attempt to join a specific seed node is
|
||||
automatically retried after the configured `retry-unsuccessful-join-after`. Retrying means that it
|
||||
tries to contact all seed nodes and then joins the node that answers first. The first node in the list
|
||||
of seed nodes will join itself if it cannot contact any of the other seed nodes within the
|
||||
configured `seed-node-timeout`.
|
||||
|
||||
An actor system can only join a cluster once. Additional attempts will be ignored.
|
||||
When it has successfully joined it must be restarted to be able to join another
|
||||
cluster or to join the same cluster again. It can use the same host name and port
|
||||
after the restart, when it come up as new incarnation of existing member in the cluster,
|
||||
trying to join in, then the existing one will be removed from the cluster and then it will
|
||||
be allowed to join.
|
||||
|
||||
@@@ note
|
||||
|
||||
The name of the `ActorSystem` must be the same for all members of a cluster. The name is given
|
||||
when you start the `ActorSystem`.
|
||||
|
||||
@@@
|
||||
|
||||
<a id="automatic-vs-manual-downing"></a>
|
||||
## Downing
|
||||
|
||||
When a member is considered by the failure detector to be unreachable the
|
||||
leader is not allowed to perform its duties, such as changing status of
|
||||
new joining members to 'Up'. The node must first become reachable again, or the
|
||||
status of the unreachable member must be changed to 'Down'. Changing status to 'Down'
|
||||
can be performed automatically or manually. By default it must be done manually, using
|
||||
[JMX](#cluster-jmx) or [HTTP](#cluster-http).
|
||||
|
||||
It can also be performed programmatically with `Cluster.get(system).down(address)`.
|
||||
|
||||
A pre-packaged solution for the downing problem is provided by
|
||||
[Split Brain Resolver](http://developer.lightbend.com/docs/akka-commercial-addons/current/split-brain-resolver.html),
|
||||
which is part of the [Lightbend Reactive Platform](http://www.lightbend.com/platform).
|
||||
If you don’t use RP, you should anyway carefully read the [documentation](http://developer.lightbend.com/docs/akka-commercial-addons/current/split-brain-resolver.html)
|
||||
of the Split Brain Resolver and make sure that the solution you are using handles the concerns
|
||||
described there.
|
||||
|
||||
### Auto-downing (DO NOT USE)
|
||||
|
||||
There is an atomatic downing feature that you should not use in production. For testing purpose you can enable it with configuration:
|
||||
|
||||
```
|
||||
akka.cluster.auto-down-unreachable-after = 120s
|
||||
```
|
||||
|
||||
This means that the cluster leader member will change the `unreachable` node
|
||||
status to `down` automatically after the configured time of unreachability.
|
||||
|
||||
This is a naïve approach to remove unreachable nodes from the cluster membership. It
|
||||
works great for crashes and short transient network partitions, but not for long network
|
||||
partitions. Both sides of the network partition will see the other side as unreachable
|
||||
and after a while remove it from its cluster membership. Since this happens on both
|
||||
sides the result is that two separate disconnected clusters have been created. This
|
||||
can also happen because of long GC pauses or system overload.
|
||||
|
||||
@@@ warning
|
||||
|
||||
We recommend against using the auto-down feature of Akka Cluster in production.
|
||||
This is crucial for correct behavior if you use @ref:[Cluster Singleton](cluster-singleton.md) or
|
||||
@ref:[Cluster Sharding](cluster-sharding.md), especially together with Akka @ref:[Persistence](persistence.md).
|
||||
For Akka Persistence with Cluster Sharding it can result in corrupt data in case
|
||||
of network partitions.
|
||||
|
||||
@@@
|
||||
|
||||
## Leaving
|
||||
|
||||
There are two ways to remove a member from the cluster.
|
||||
|
||||
You can just stop the actor system (or the JVM process). It will be detected
|
||||
as unreachable and removed after the automatic or manual downing as described
|
||||
above.
|
||||
|
||||
A more graceful exit can be performed if you tell the cluster that a node shall leave.
|
||||
This can be performed using [JMX](#cluster-jmx) or [HTTP](#cluster-http).
|
||||
It can also be performed programmatically with:
|
||||
|
||||
@@snip [ClusterDocTest.java]($code$/java/jdocs/cluster/ClusterDocTest.java) { #leave }
|
||||
|
||||
Note that this command can be issued to any member in the cluster, not necessarily the
|
||||
one that is leaving.
|
||||
|
||||
The @ref:[Coordinated Shutdown](actors.md#coordinated-shutdown) will automatically run when the cluster node sees itself as
|
||||
`Exiting`, i.e. leaving from another node will trigger the shutdown process on the leaving node.
|
||||
Tasks for graceful leaving of cluster including graceful shutdown of Cluster Singletons and
|
||||
Cluster Sharding are added automatically when Akka Cluster is used, i.e. running the shutdown
|
||||
process will also trigger the graceful leaving if it's not already in progress.
|
||||
|
||||
Normally this is handled automatically, but in case of network failures during this process it might still
|
||||
be necessary to set the node’s status to `Down` in order to complete the removal.
|
||||
|
||||
<a id="weakly-up"></a>
|
||||
## WeaklyUp Members
|
||||
|
||||
If a node is `unreachable` then gossip convergence is not possible and therefore any
|
||||
`leader` actions are also not possible. However, we still might want new nodes to join
|
||||
the cluster in this scenario.
|
||||
|
||||
`Joining` members will be promoted to `WeaklyUp` and become part of the cluster if
|
||||
convergence can't be reached. Once gossip convergence is reached, the leader will move `WeaklyUp`
|
||||
members to `Up`.
|
||||
|
||||
This feature is enabled by default, but it can be disabled with configuration option:
|
||||
|
||||
```
|
||||
akka.cluster.allow-weakly-up-members = off
|
||||
```
|
||||
|
||||
You can subscribe to the `WeaklyUp` membership event to make use of the members that are
|
||||
in this state, but you should be aware of that members on the other side of a network partition
|
||||
have no knowledge about the existence of the new members. You should for example not count
|
||||
`WeaklyUp` members in quorum decisions.
|
||||
|
||||
<a id="cluster-subscriber"></a>
|
||||
## Subscribe to Cluster Events
|
||||
|
||||
You can subscribe to change notifications of the cluster membership by using
|
||||
`Cluster.get(system).subscribe`.
|
||||
|
||||
@@snip [SimpleClusterListener2.java]($code$/java/jdocs/cluster/SimpleClusterListener2.java) { #subscribe }
|
||||
|
||||
A snapshot of the full state, `akka.cluster.ClusterEvent.CurrentClusterState`, is sent to the subscriber
|
||||
as the first message, followed by events for incremental updates.
|
||||
|
||||
Note that you may receive an empty `CurrentClusterState`, containing no members,
|
||||
if you start the subscription before the initial join procedure has completed.
|
||||
This is expected behavior. When the node has been accepted in the cluster you will
|
||||
receive `MemberUp` for that node, and other nodes.
|
||||
|
||||
If you find it inconvenient to handle the `CurrentClusterState` you can use
|
||||
`ClusterEvent.initialStateAsEvents()` as parameter to `subscribe`.
|
||||
That means that instead of receiving `CurrentClusterState` as the first message you will receive
|
||||
the events corresponding to the current state to mimic what you would have seen if you were
|
||||
listening to the events when they occurred in the past. Note that those initial events only correspond
|
||||
to the current state and it is not the full history of all changes that actually has occurred in the cluster.
|
||||
|
||||
@@snip [SimpleClusterListener.java]($code$/java/jdocs/cluster/SimpleClusterListener.java) { #subscribe }
|
||||
|
||||
The events to track the life-cycle of members are:
|
||||
|
||||
* `ClusterEvent.MemberJoined` - A new member has joined the cluster and its status has been changed to `Joining`.
|
||||
* `ClusterEvent.MemberUp` - A new member has joined the cluster and its status has been changed to `Up`.
|
||||
* `ClusterEvent.MemberExited` - A member is leaving the cluster and its status has been changed to `Exiting`
|
||||
Note that the node might already have been shutdown when this event is published on another node.
|
||||
* `ClusterEvent.MemberRemoved` - Member completely removed from the cluster.
|
||||
* `ClusterEvent.UnreachableMember` - A member is considered as unreachable, detected by the failure detector
|
||||
of at least one other node.
|
||||
* `ClusterEvent.ReachableMember` - A member is considered as reachable again, after having been unreachable.
|
||||
All nodes that previously detected it as unreachable has detected it as reachable again.
|
||||
|
||||
There are more types of change events, consult the API documentation
|
||||
of classes that extends `akka.cluster.ClusterEvent.ClusterDomainEvent`
|
||||
for details about the events.
|
||||
|
||||
Instead of subscribing to cluster events it can sometimes be convenient to only get the full membership state with
|
||||
`Cluster.get(system).state()`. Note that this state is not necessarily in sync with the events published to a
|
||||
cluster subscription.
|
||||
|
||||
### Worker Dial-in Example
|
||||
|
||||
Let's take a look at an example that illustrates how workers, here named *backend*,
|
||||
can detect and register to new master nodes, here named *frontend*.
|
||||
|
||||
The example application provides a service to transform text. When some text
|
||||
is sent to one of the frontend services, it will be delegated to one of the
|
||||
backend workers, which performs the transformation job, and sends the result back to
|
||||
the original client. New backend nodes, as well as new frontend nodes, can be
|
||||
added or removed to the cluster dynamically.
|
||||
|
||||
Messages:
|
||||
|
||||
@@snip [TransformationMessages.java]($code$/java/jdocs/cluster/TransformationMessages.java) { #messages }
|
||||
|
||||
The backend worker that performs the transformation job:
|
||||
|
||||
@@snip [TransformationBackend.java]($code$/java/jdocs/cluster/TransformationBackend.java) { #backend }
|
||||
|
||||
Note that the `TransformationBackend` actor subscribes to cluster events to detect new,
|
||||
potential, frontend nodes, and send them a registration message so that they know
|
||||
that they can use the backend worker.
|
||||
|
||||
The frontend that receives user jobs and delegates to one of the registered backend workers:
|
||||
|
||||
@@snip [TransformationFrontend.java]($code$/java/jdocs/cluster/TransformationFrontend.java) { #frontend }
|
||||
|
||||
Note that the `TransformationFrontend` actor watch the registered backend
|
||||
to be able to remove it from its list of available backend workers.
|
||||
Death watch uses the cluster failure detector for nodes in the cluster, i.e. it detects
|
||||
network failures and JVM crashes, in addition to graceful termination of watched
|
||||
actor. Death watch generates the `Terminated` message to the watching actor when the
|
||||
unreachable cluster node has been downed and removed.
|
||||
|
||||
The Akka sample named
|
||||
[Akka Cluster Sample with Java](https://github.com/akka/akka-samples/tree/master/akka-sample-cluster-java).
|
||||
contains the full source code and instructions of how to run the **Worker Dial-in Example**.
|
||||
|
||||
## Node Roles
|
||||
|
||||
Not all nodes of a cluster need to perform the same function: there might be one sub-set which runs the web front-end,
|
||||
one which runs the data access layer and one for the number-crunching. Deployment of actors—for example by cluster-aware
|
||||
routers—can take node roles into account to achieve this distribution of responsibilities.
|
||||
|
||||
The roles of a node is defined in the configuration property named `akka.cluster.roles`
|
||||
and it is typically defined in the start script as a system property or environment variable.
|
||||
|
||||
The roles of the nodes is part of the membership information in `MemberEvent` that you can subscribe to.
|
||||
|
||||
<a id="min-members"></a>
|
||||
## How To Startup when Cluster Size Reached
|
||||
|
||||
A common use case is to start actors after the cluster has been initialized,
|
||||
members have joined, and the cluster has reached a certain size.
|
||||
|
||||
With a configuration option you can define required number of members
|
||||
before the leader changes member status of 'Joining' members to 'Up'.:
|
||||
|
||||
```
|
||||
akka.cluster.min-nr-of-members = 3
|
||||
```
|
||||
|
||||
In a similar way you can define required number of members of a certain role
|
||||
before the leader changes member status of 'Joining' members to 'Up'.:
|
||||
|
||||
```
|
||||
akka.cluster.role {
|
||||
frontend.min-nr-of-members = 1
|
||||
backend.min-nr-of-members = 2
|
||||
}
|
||||
```
|
||||
|
||||
You can start the actors in a `registerOnMemberUp` callback, which will
|
||||
be invoked when the current member status is changed to 'Up', i.e. the cluster
|
||||
has at least the defined number of members.
|
||||
|
||||
@@snip [FactorialFrontendMain.java]($code$/java/jdocs/cluster/FactorialFrontendMain.java) { #registerOnUp }
|
||||
|
||||
This callback can be used for other things than starting actors.
|
||||
|
||||
## How To Cleanup when Member is Removed
|
||||
|
||||
You can do some clean up in a `registerOnMemberRemoved` callback, which will
|
||||
be invoked when the current member status is changed to 'Removed' or the cluster have been shutdown.
|
||||
|
||||
An alternative is to register tasks to the @ref:[Coordinated Shutdown](actors.md#coordinated-shutdown).
|
||||
|
||||
@@@ note
|
||||
|
||||
Register a OnMemberRemoved callback on a cluster that have been shutdown, the callback will be invoked immediately on
|
||||
the caller thread, otherwise it will be invoked later when the current member status changed to 'Removed'. You may
|
||||
want to install some cleanup handling after the cluster was started up, but the cluster might already be shutting
|
||||
down when you installing, and depending on the race is not healthy.
|
||||
|
||||
@@@
|
||||
|
||||
## Cluster Singleton
|
||||
|
||||
For some use cases it is convenient and sometimes also mandatory to ensure that
|
||||
you have exactly one actor of a certain type running somewhere in the cluster.
|
||||
|
||||
This can be implemented by subscribing to member events, but there are several corner
|
||||
cases to consider. Therefore, this specific use case is made easily accessible by the
|
||||
@ref:[Cluster Singleton](cluster-singleton.md).
|
||||
|
||||
## Cluster Sharding
|
||||
|
||||
Distributes actors across several nodes in the cluster and supports interaction
|
||||
with the actors using their logical identifier, but without having to care about
|
||||
their physical location in the cluster.
|
||||
|
||||
See @ref:[Cluster Sharding](cluster-sharding.md).
|
||||
|
||||
## Distributed Publish Subscribe
|
||||
|
||||
Publish-subscribe messaging between actors in the cluster, and point-to-point messaging
|
||||
using the logical path of the actors, i.e. the sender does not have to know on which
|
||||
node the destination actor is running.
|
||||
|
||||
See @ref:[Distributed Publish Subscribe in Cluster](distributed-pub-sub.md).
|
||||
|
||||
## Cluster Client
|
||||
|
||||
Communication from an actor system that is not part of the cluster to actors running
|
||||
somewhere in the cluster. The client does not have to know on which node the destination
|
||||
actor is running.
|
||||
|
||||
See @ref:[Cluster Client](cluster-client.md).
|
||||
|
||||
## Distributed Data
|
||||
|
||||
*Akka Distributed Data* is useful when you need to share data between nodes in an
|
||||
Akka Cluster. The data is accessed with an actor providing a key-value store like API.
|
||||
|
||||
See @ref:[Distributed Data](distributed-data.md).
|
||||
|
||||
## Failure Detector
|
||||
|
||||
In a cluster each node is monitored by a few (default maximum 5) other nodes, and when
|
||||
any of these detects the node as `unreachable` that information will spread to
|
||||
the rest of the cluster through the gossip. In other words, only one node needs to
|
||||
mark a node `unreachable` to have the rest of the cluster mark that node `unreachable`.
|
||||
|
||||
The failure detector will also detect if the node becomes `reachable` again. When
|
||||
all nodes that monitored the `unreachable` node detects it as `reachable` again
|
||||
the cluster, after gossip dissemination, will consider it as `reachable`.
|
||||
|
||||
If system messages cannot be delivered to a node it will be quarantined and then it
|
||||
cannot come back from `unreachable`. This can happen if the there are too many
|
||||
unacknowledged system messages (e.g. watch, Terminated, remote actor deployment,
|
||||
failures of actors supervised by remote parent). Then the node needs to be moved
|
||||
to the `down` or `removed` states and the actor system of the quarantined node
|
||||
must be restarted before it can join the cluster again.
|
||||
|
||||
The nodes in the cluster monitor each other by sending heartbeats to detect if a node is
|
||||
unreachable from the rest of the cluster. The heartbeat arrival times is interpreted
|
||||
by an implementation of
|
||||
[The Phi Accrual Failure Detector](http://www.jaist.ac.jp/~defago/files/pdf/IS_RR_2004_010.pdf).
|
||||
|
||||
The suspicion level of failure is given by a value called *phi*.
|
||||
The basic idea of the phi failure detector is to express the value of *phi* on a scale that
|
||||
is dynamically adjusted to reflect current network conditions.
|
||||
|
||||
The value of *phi* is calculated as:
|
||||
|
||||
```
|
||||
phi = -log10(1 - F(timeSinceLastHeartbeat))
|
||||
```
|
||||
|
||||
where F is the cumulative distribution function of a normal distribution with mean
|
||||
and standard deviation estimated from historical heartbeat inter-arrival times.
|
||||
|
||||
In the [configuration](#cluster-configuration) you can adjust the `akka.cluster.failure-detector.threshold`
|
||||
to define when a *phi* value is considered to be a failure.
|
||||
|
||||
A low `threshold` is prone to generate many false positives but ensures
|
||||
a quick detection in the event of a real crash. Conversely, a high `threshold`
|
||||
generates fewer mistakes but needs more time to detect actual crashes. The
|
||||
default `threshold` is 8 and is appropriate for most situations. However in
|
||||
cloud environments, such as Amazon EC2, the value could be increased to 12 in
|
||||
order to account for network issues that sometimes occur on such platforms.
|
||||
|
||||
The following chart illustrates how *phi* increase with increasing time since the
|
||||
previous heartbeat.
|
||||
|
||||

|
||||
|
||||
Phi is calculated from the mean and standard deviation of historical
|
||||
inter arrival times. The previous chart is an example for standard deviation
|
||||
of 200 ms. If the heartbeats arrive with less deviation the curve becomes steeper,
|
||||
i.e. it is possible to determine failure more quickly. The curve looks like this for
|
||||
a standard deviation of 100 ms.
|
||||
|
||||

|
||||
|
||||
To be able to survive sudden abnormalities, such as garbage collection pauses and
|
||||
transient network failures the failure detector is configured with a margin,
|
||||
`akka.cluster.failure-detector.acceptable-heartbeat-pause`. You may want to
|
||||
adjust the [configuration](#cluster-configuration) of this depending on you environment.
|
||||
This is how the curve looks like for `acceptable-heartbeat-pause` configured to
|
||||
3 seconds.
|
||||
|
||||

|
||||
|
||||
Death watch uses the cluster failure detector for nodes in the cluster, i.e. it detects
|
||||
network failures and JVM crashes, in addition to graceful termination of watched
|
||||
actor. Death watch generates the `Terminated` message to the watching actor when the
|
||||
unreachable cluster node has been downed and removed.
|
||||
|
||||
If you encounter suspicious false positives when the system is under load you should
|
||||
define a separate dispatcher for the cluster actors as described in [Cluster Dispatcher](#cluster-dispatcher).
|
||||
|
||||
## Cluster Aware Routers
|
||||
|
||||
All @ref:[routers](routing.md) can be made aware of member nodes in the cluster, i.e.
|
||||
deploying new routees or looking up routees on nodes in the cluster.
|
||||
When a node becomes unreachable or leaves the cluster the routees of that node are
|
||||
automatically unregistered from the router. When new nodes join the cluster additional
|
||||
routees are added to the router, according to the configuration. Routees are also added
|
||||
when a node becomes reachable again, after having been unreachable.
|
||||
|
||||
Cluster aware routers make use of members with status [WeaklyUp](#weakly-up).
|
||||
|
||||
There are two distinct types of routers.
|
||||
|
||||
* **Group - router that sends messages to the specified path using actor selection**
|
||||
The routees can be shared between routers running on different nodes in the cluster.
|
||||
One example of a use case for this type of router is a service running on some backend
|
||||
nodes in the cluster and used by routers running on front-end nodes in the cluster.
|
||||
* **Pool - router that creates routees as child actors and deploys them on remote nodes.**
|
||||
Each router will have its own routee instances. For example, if you start a router
|
||||
on 3 nodes in a 10 nodes cluster you will have 30 routee actors in total if the router is
|
||||
configured to use one instance per node. The routees created by the different routers
|
||||
will not be shared between the routers. One example of a use case for this type of router
|
||||
is a single master that coordinate jobs and delegates the actual work to routees running
|
||||
on other nodes in the cluster.
|
||||
|
||||
### Router with Group of Routees
|
||||
|
||||
When using a `Group` you must start the routee actors on the cluster member nodes.
|
||||
That is not done by the router. The configuration for a group looks like this::
|
||||
|
||||
```
|
||||
akka.actor.deployment {
|
||||
/statsService/workerRouter {
|
||||
router = consistent-hashing-group
|
||||
routees.paths = ["/user/statsWorker"]
|
||||
cluster {
|
||||
enabled = on
|
||||
allow-local-routees = on
|
||||
use-role = compute
|
||||
}
|
||||
}
|
||||
}
|
||||
```
|
||||
|
||||
@@@ note
|
||||
|
||||
The routee actors should be started as early as possible when starting the actor system, because
|
||||
the router will try to use them as soon as the member status is changed to 'Up'.
|
||||
|
||||
@@@
|
||||
|
||||
The actor paths without address information that are defined in `routees.paths` are used for selecting the
|
||||
actors to which the messages will be forwarded to by the router.
|
||||
Messages will be forwarded to the routees using @ref:[ActorSelection](actors.md#actorselection), so the same delivery semantics should be expected.
|
||||
It is possible to limit the lookup of routees to member nodes tagged with a certain role by specifying `use-role`.
|
||||
|
||||
`max-total-nr-of-instances` defines total number of routees in the cluster. By default `max-total-nr-of-instances`
|
||||
is set to a high value (10000) that will result in new routees added to the router when nodes join the cluster.
|
||||
Set it to a lower value if you want to limit total number of routees.
|
||||
|
||||
The same type of router could also have been defined in code:
|
||||
|
||||
@@snip [StatsService.java]($code$/java/jdocs/cluster/StatsService.java) { #router-lookup-in-code }
|
||||
|
||||
See [configuration](#cluster-configuration) section for further descriptions of the settings.
|
||||
|
||||
### Router Example with Group of Routees
|
||||
|
||||
Let's take a look at how to use a cluster aware router with a group of routees,
|
||||
i.e. router sending to the paths of the routees.
|
||||
|
||||
The example application provides a service to calculate statistics for a text.
|
||||
When some text is sent to the service it splits it into words, and delegates the task
|
||||
to count number of characters in each word to a separate worker, a routee of a router.
|
||||
The character count for each word is sent back to an aggregator that calculates
|
||||
the average number of characters per word when all results have been collected.
|
||||
|
||||
Messages:
|
||||
|
||||
@@snip [StatsMessages.java]($code$/java/jdocs/cluster/StatsMessages.java) { #messages }
|
||||
|
||||
The worker that counts number of characters in each word:
|
||||
|
||||
@@snip [StatsWorker.java]($code$/java/jdocs/cluster/StatsWorker.java) { #worker }
|
||||
|
||||
The service that receives text from users and splits it up into words, delegates to workers and aggregates:
|
||||
|
||||
@@snip [StatsService.java]($code$/java/jdocs/cluster/StatsService.java) { #service }
|
||||
|
||||
@@snip [StatsAggregator.java]($code$/java/jdocs/cluster/StatsAggregator.java) { #aggregator }
|
||||
|
||||
Note, nothing cluster specific so far, just plain actors.
|
||||
|
||||
All nodes start `StatsService` and `StatsWorker` actors. Remember, routees are the workers in this case.
|
||||
The router is configured with `routees.paths`::
|
||||
|
||||
```
|
||||
akka.actor.deployment {
|
||||
/statsService/workerRouter {
|
||||
router = consistent-hashing-group
|
||||
routees.paths = ["/user/statsWorker"]
|
||||
cluster {
|
||||
enabled = on
|
||||
allow-local-routees = on
|
||||
use-role = compute
|
||||
}
|
||||
}
|
||||
}
|
||||
```
|
||||
|
||||
This means that user requests can be sent to `StatsService` on any node and it will use
|
||||
`StatsWorker` on all nodes.
|
||||
|
||||
The Akka sample named
|
||||
[Akka Cluster Sample with Java](https://github.com/akka/akka-samples/tree/master/akka-sample-cluster-java).
|
||||
contains the full source code and instructions of how to run the **Router Example with Group of Routees**.
|
||||
|
||||
### Router with Pool of Remote Deployed Routees
|
||||
|
||||
When using a `Pool` with routees created and deployed on the cluster member nodes
|
||||
the configuration for a router looks like this::
|
||||
|
||||
```
|
||||
akka.actor.deployment {
|
||||
/statsService/singleton/workerRouter {
|
||||
router = consistent-hashing-pool
|
||||
cluster {
|
||||
enabled = on
|
||||
max-nr-of-instances-per-node = 3
|
||||
allow-local-routees = on
|
||||
use-role = compute
|
||||
}
|
||||
}
|
||||
}
|
||||
```
|
||||
|
||||
It is possible to limit the deployment of routees to member nodes tagged with a certain role by
|
||||
specifying `use-role`.
|
||||
|
||||
`max-total-nr-of-instances` defines total number of routees in the cluster, but the number of routees
|
||||
per node, `max-nr-of-instances-per-node`, will not be exceeded. By default `max-total-nr-of-instances`
|
||||
is set to a high value (10000) that will result in new routees added to the router when nodes join the cluster.
|
||||
Set it to a lower value if you want to limit total number of routees.
|
||||
|
||||
The same type of router could also have been defined in code:
|
||||
|
||||
@@snip [StatsService.java]($code$/java/jdocs/cluster/StatsService.java) { #router-deploy-in-code }
|
||||
|
||||
See [configuration](#cluster-configuration) section for further descriptions of the settings.
|
||||
|
||||
### Router Example with Pool of Remote Deployed Routees
|
||||
|
||||
Let's take a look at how to use a cluster aware router on single master node that creates
|
||||
and deploys workers. To keep track of a single master we use the @ref:[Cluster Singleton](cluster-singleton.md)
|
||||
in the cluster-tools module. The `ClusterSingletonManager` is started on each node.
|
||||
|
||||
@@snip [StatsSampleOneMasterMain.java]($code$/java/jdocs/cluster/StatsSampleOneMasterMain.java) { #create-singleton-manager }
|
||||
|
||||
We also need an actor on each node that keeps track of where current single master exists and
|
||||
delegates jobs to the `StatsService`. That is provided by the `ClusterSingletonProxy`.
|
||||
|
||||
@@snip [StatsSampleOneMasterMain.java]($code$/java/jdocs/cluster/StatsSampleOneMasterMain.java) { #singleton-proxy }
|
||||
|
||||
The `ClusterSingletonProxy` receives text from users and delegates to the current `StatsService`, the single
|
||||
master. It listens to cluster events to lookup the `StatsService` on the oldest node.
|
||||
|
||||
All nodes start `ClusterSingletonProxy` and the `ClusterSingletonManager`. The router is now configured like this::
|
||||
|
||||
```
|
||||
akka.actor.deployment {
|
||||
/statsService/singleton/workerRouter {
|
||||
router = consistent-hashing-pool
|
||||
cluster {
|
||||
enabled = on
|
||||
max-nr-of-instances-per-node = 3
|
||||
allow-local-routees = on
|
||||
use-role = compute
|
||||
}
|
||||
}
|
||||
}
|
||||
```
|
||||
|
||||
The Akka sample named
|
||||
[Akka Cluster Sample with Java](https://github.com/akka/akka-samples/tree/master/akka-sample-cluster-java).
|
||||
contains the full source code and instructions of how to run the **Router Example with Pool of Remote Deployed Routees**.
|
||||
|
||||
## Cluster Metrics
|
||||
|
||||
The member nodes of the cluster can collect system health metrics and publish that to other cluster nodes
|
||||
and to the registered subscribers on the system event bus with the help of `cluster-metrics`.
|
||||
|
||||
## Management
|
||||
|
||||
<a id="cluster-http"></a>
|
||||
### HTTP
|
||||
|
||||
Information and management of the cluster is available with a HTTP API.
|
||||
See documentation of [Akka Management](http://developer.lightbend.com/docs/akka-management/current/).
|
||||
|
||||
<a id="cluster-jmx"></a>
|
||||
### JMX
|
||||
|
||||
Information and management of the cluster is available as JMX MBeans with the root name `akka.Cluster`.
|
||||
The JMX information can be displayed with an ordinary JMX console such as JConsole or JVisualVM.
|
||||
|
||||
From JMX you can:
|
||||
|
||||
* see what members that are part of the cluster
|
||||
* see status of this node
|
||||
* see roles of each member
|
||||
* join this node to another node in cluster
|
||||
* mark any node in the cluster as down
|
||||
* tell any node in the cluster to leave
|
||||
|
||||
Member nodes are identified by their address, in format *akka.<protocol>://<actor-system-name>@<hostname>:<port>*.
|
||||
|
||||
<a id="cluster-command-line"></a>
|
||||
### Command Line
|
||||
|
||||
@@@ warning
|
||||
|
||||
**Deprecation warning** - The command line script has been deprecated and is scheduled for removal
|
||||
in the next major version. Use the [HTTP management](#cluster-http) API with [curl](https://curl.haxx.se/)
|
||||
or similar instead.
|
||||
|
||||
@@@
|
||||
|
||||
The cluster can be managed with the script `akka-cluster` provided in the Akka GitHub repository @extref[here](github:akka-cluster/jmx-client). Place the script and the `jmxsh-R5.jar` library in the same directory.
|
||||
|
||||
Run it without parameters to see instructions about how to use the script:
|
||||
|
||||
```
|
||||
Usage: ./akka-cluster <node-hostname> <jmx-port> <command> ...
|
||||
|
||||
Supported commands are:
|
||||
join <node-url> - Sends request a JOIN node with the specified URL
|
||||
leave <node-url> - Sends a request for node with URL to LEAVE the cluster
|
||||
down <node-url> - Sends a request for marking node with URL as DOWN
|
||||
member-status - Asks the member node for its current status
|
||||
members - Asks the cluster for addresses of current members
|
||||
unreachable - Asks the cluster for addresses of unreachable members
|
||||
cluster-status - Asks the cluster for its current status (member ring,
|
||||
unavailable nodes, meta data etc.)
|
||||
leader - Asks the cluster who the current leader is
|
||||
is-singleton - Checks if the cluster is a singleton cluster (single
|
||||
node cluster)
|
||||
is-available - Checks if the member node is available
|
||||
Where the <node-url> should be on the format of
|
||||
'akka.<protocol>://<actor-system-name>@<hostname>:<port>'
|
||||
|
||||
Examples: ./akka-cluster localhost 9999 is-available
|
||||
./akka-cluster localhost 9999 join akka.tcp://MySystem@darkstar:2552
|
||||
./akka-cluster localhost 9999 cluster-status
|
||||
```
|
||||
|
||||
To be able to use the script you must enable remote monitoring and management when starting the JVMs of the cluster nodes,
|
||||
as described in [Monitoring and Management Using JMX Technology](http://docs.oracle.com/javase/8/jdocs/technotes/guides/management/agent.html).
|
||||
Make sure you understand the security implications of enabling remote monitoring and management.
|
||||
|
||||
<a id="cluster-configuration"></a>
|
||||
## Configuration
|
||||
|
||||
There are several configuration properties for the cluster. We refer to the
|
||||
@ref:[reference configuration](general/configuration.md#config-akka-cluster) for more information.
|
||||
|
||||
### Cluster Info Logging
|
||||
|
||||
You can silence the logging of cluster events at info level with configuration property:
|
||||
|
||||
```
|
||||
akka.cluster.log-info = off
|
||||
```
|
||||
|
||||
<a id="cluster-dispatcher"></a>
|
||||
### Cluster Dispatcher
|
||||
|
||||
Under the hood the cluster extension is implemented with actors and it can be necessary
|
||||
to create a bulkhead for those actors to avoid disturbance from other actors. Especially
|
||||
the heartbeating actors that is used for failure detection can generate false positives
|
||||
if they are not given a chance to run at regular intervals.
|
||||
For this purpose you can define a separate dispatcher to be used for the cluster actors:
|
||||
|
||||
```
|
||||
akka.cluster.use-dispatcher = cluster-dispatcher
|
||||
|
||||
cluster-dispatcher {
|
||||
type = "Dispatcher"
|
||||
executor = "fork-join-executor"
|
||||
fork-join-executor {
|
||||
parallelism-min = 2
|
||||
parallelism-max = 4
|
||||
}
|
||||
}
|
||||
```
|
||||
|
||||
@@@ note
|
||||
|
||||
Normally it should not be necessary to configure a separate dispatcher for the Cluster.
|
||||
The default-dispatcher should be sufficient for performing the Cluster tasks, i.e. `akka.cluster.use-dispatcher`
|
||||
should not be changed. If you have Cluster related problems when using the default-dispatcher that is typically an
|
||||
indication that you are running blocking or CPU intensive actors/tasks on the default-dispatcher.
|
||||
Use dedicated dispatchers for such actors/tasks instead of running them on the default-dispatcher,
|
||||
because that may starve system internal tasks.
|
||||
Related config properties: `akka.cluster.use-dispatcher = akka.cluster.cluster-dispatcher`.
|
||||
Corresponding default values: `akka.cluster.use-dispatcher =`.
|
||||
|
||||
@@@
|
||||
1
akka-docs/src/main/paradox/java/cluster-usage.md
Symbolic link
1
akka-docs/src/main/paradox/java/cluster-usage.md
Symbolic link
|
|
@ -0,0 +1 @@
|
|||
../scala/cluster-usage.md
|
||||
1
akka-docs/src/main/paradox/java/multi-jvm-testing.md
Symbolic link
1
akka-docs/src/main/paradox/java/multi-jvm-testing.md
Symbolic link
|
|
@ -0,0 +1 @@
|
|||
../scala/multi-jvm-testing.md
|
||||
1
akka-docs/src/main/paradox/java/multi-node-testing.md
Symbolic link
1
akka-docs/src/main/paradox/java/multi-node-testing.md
Symbolic link
|
|
@ -0,0 +1 @@
|
|||
../scala/multi-node-testing.md
|
||||
|
|
@ -6,11 +6,23 @@ For introduction to the Akka Cluster concepts please see @ref:[Cluster Specifica
|
|||
|
||||
The Akka cluster is a separate jar file. Make sure that you have the following dependency in your project:
|
||||
|
||||
@@@vars
|
||||
```
|
||||
"com.typesafe.akka" %% "akka-cluster" % "$akka.version$"
|
||||
```
|
||||
@@@
|
||||
Scala
|
||||
: @@@vars
|
||||
```
|
||||
"com.typesafe.akka" %% "akka-cluster" % "$akka.version$"
|
||||
```
|
||||
@@@
|
||||
|
||||
Java
|
||||
: @@@vars
|
||||
```
|
||||
<dependency>
|
||||
<groupId>com.typesafe.akka</groupId>
|
||||
<artifactId>akka-cluster_$scala.binary_version$</artifactId>
|
||||
<version>$akka.version$</version>
|
||||
</dependency>
|
||||
```
|
||||
@@@
|
||||
|
||||
## A Simple Cluster Example
|
||||
|
||||
|
|
@ -73,15 +85,21 @@ ip-addresses or host names of the machines in `application.conf` instead of `127
|
|||
|
||||
An actor that uses the cluster extension may look like this:
|
||||
|
||||
@@snip [SimpleClusterListener.scala]($code$/scala/docs/cluster/SimpleClusterListener.scala) { type=scala }
|
||||
Scala
|
||||
: @@snip [SimpleClusterListener.scala]($code$/scala/docs/cluster/SimpleClusterListener.scala) { type=scala }
|
||||
|
||||
Java
|
||||
: @@snip [SimpleClusterListener.java]($code$/java/jdocs/cluster/SimpleClusterListener.java) { type=java }
|
||||
|
||||
The actor registers itself as subscriber of certain cluster events. It receives events corresponding to the current state
|
||||
of the cluster when the subscription starts and then it receives events for changes that happen in the cluster.
|
||||
|
||||
The easiest way to run this example yourself is to download the ready to run
|
||||
@extref[Akka Cluster Sample with Scala](ecs:akka-samples-cluster-scala)
|
||||
@scala[@extref[Akka Cluster Sample with Scala](ecs:akka-samples-cluster-scala)]
|
||||
@java[@extref[Akka Cluster Sample with Java](ecs:akka-samples-cluster-java)]
|
||||
together with the tutorial. It contains instructions on how to run the `SimpleClusterApp`.
|
||||
The source code of this sample can be found in the @extref[Akka Samples Repository](samples:akka-sample-cluster-scala).
|
||||
The source code of this sample can be found in the
|
||||
@scala[@extref[Akka Samples Repository](samples:akka-sample-cluster-scala)]@java[@extref[Akka Samples Repository](samples:akka-sample-cluster-java)].
|
||||
|
||||
## Joining to Seed Nodes
|
||||
|
||||
|
|
@ -123,7 +141,7 @@ seed nodes in the existing cluster.
|
|||
If you don't configure seed nodes you need to join the cluster programmatically or manually.
|
||||
|
||||
Manual joining can be performed by using [JMX](#cluster-jmx) or [HTTP](#cluster-http).
|
||||
Joining programmatically can be performed with `Cluster(system).join`. Unsuccessful join attempts are
|
||||
Joining programmatically can be performed with @scala[`Cluster(system).join`]@java[`Cluster.get(system).join`]. Unsuccessful join attempts are
|
||||
automatically retried after the time period defined in configuration property `retry-unsuccessful-join-after`.
|
||||
Retries can be disabled by setting the property to `off`.
|
||||
|
||||
|
|
@ -131,7 +149,7 @@ You can join to any node in the cluster. It does not have to be configured as a
|
|||
Note that you can only join to an existing cluster member, which means that for bootstrapping some
|
||||
node must join itself,and then the following nodes could join them to make up a cluster.
|
||||
|
||||
You may also use `Cluster(system).joinSeedNodes` to join programmatically,
|
||||
You may also use @scala[`Cluster(system).joinSeedNodes`]@java[`Cluster.get(system).joinSeedNodes`] to join programmatically,
|
||||
which is attractive when dynamically discovering other nodes at startup by using some external tool or API.
|
||||
When using `joinSeedNodes` you should not include the node itself except for the node that is
|
||||
supposed to be the first seed node, and that should be placed first in parameter to
|
||||
|
|
@ -146,7 +164,7 @@ configured `seed-node-timeout`.
|
|||
|
||||
An actor system can only join a cluster once. Additional attempts will be ignored.
|
||||
When it has successfully joined it must be restarted to be able to join another
|
||||
cluster or to join the same cluster again.It can use the same host name and port
|
||||
cluster or to join the same cluster again. It can use the same host name and port
|
||||
after the restart, when it come up as new incarnation of existing member in the cluster,
|
||||
trying to join in, then the existing one will be removed from the cluster and then it will
|
||||
be allowed to join.
|
||||
|
|
@ -168,7 +186,7 @@ status of the unreachable member must be changed to 'Down'. Changing status to '
|
|||
can be performed automatically or manually. By default it must be done manually, using
|
||||
[JMX](#cluster-jmx) or [HTTP](#cluster-http).
|
||||
|
||||
It can also be performed programmatically with `Cluster(system).down(address)`.
|
||||
It can also be performed programmatically with @scala[`Cluster(system).down(address)`]@java[`Cluster.get(system).down(address)`].
|
||||
|
||||
A pre-packaged solution for the downing problem is provided by
|
||||
[Split Brain Resolver](http://developer.lightbend.com/docs/akka-commercial-addons/current/split-brain-resolver.html),
|
||||
|
|
@ -217,7 +235,11 @@ A more graceful exit can be performed if you tell the cluster that a node shall
|
|||
This can be performed using [JMX](#cluster-jmx) or [HTTP](#cluster-http).
|
||||
It can also be performed programmatically with:
|
||||
|
||||
@@snip [ClusterDocSpec.scala]($code$/scala/docs/cluster/ClusterDocSpec.scala) { #leave }
|
||||
Scala
|
||||
: @@snip [ClusterDocSpec.scala]($code$/scala/docs/cluster/ClusterDocSpec.scala) { #leave }
|
||||
|
||||
Java
|
||||
: @@snip [ClusterDocTest.java]($code$/java/jdocs/cluster/ClusterDocTest.java) { #leave }
|
||||
|
||||
Note that this command can be issued to any member in the cluster, not necessarily the
|
||||
one that is leaving.
|
||||
|
|
@ -257,9 +279,13 @@ have no knowledge about the existence of the new members. You should for example
|
|||
## Subscribe to Cluster Events
|
||||
|
||||
You can subscribe to change notifications of the cluster membership by using
|
||||
`Cluster(system).subscribe`.
|
||||
@scala[`Cluster(system).subscribe`]@java[`Cluster.get(system).subscribe`].
|
||||
|
||||
@@snip [SimpleClusterListener2.scala]($code$/scala/docs/cluster/SimpleClusterListener2.scala) { #subscribe }
|
||||
Scala
|
||||
: @@snip [SimpleClusterListener2.scala]($code$/scala/docs/cluster/SimpleClusterListener2.scala) { #subscribe }
|
||||
|
||||
Java
|
||||
: @@snip [SimpleClusterListener2.java]($code$/java/jdocs/cluster/SimpleClusterListener2.java) { #subscribe }
|
||||
|
||||
A snapshot of the full state, `akka.cluster.ClusterEvent.CurrentClusterState`, is sent to the subscriber
|
||||
as the first message, followed by events for incremental updates.
|
||||
|
|
@ -270,19 +296,23 @@ This is expected behavior. When the node has been accepted in the cluster you wi
|
|||
receive `MemberUp` for that node, and other nodes.
|
||||
|
||||
If you find it inconvenient to handle the `CurrentClusterState` you can use
|
||||
`ClusterEvent.InitialStateAsEvents` as parameter to `subscribe`.
|
||||
@scala[`ClusterEvent.InitialStateAsEvents`] @java[`ClusterEvent.initialStateAsEvents()`] as parameter to `subscribe`.
|
||||
That means that instead of receiving `CurrentClusterState` as the first message you will receive
|
||||
the events corresponding to the current state to mimic what you would have seen if you were
|
||||
listening to the events when they occurred in the past. Note that those initial events only correspond
|
||||
to the current state and it is not the full history of all changes that actually has occurred in the cluster.
|
||||
|
||||
@@snip [SimpleClusterListener.scala]($code$/scala/docs/cluster/SimpleClusterListener.scala) { #subscribe }
|
||||
Scala
|
||||
: @@snip [SimpleClusterListener.scala]($code$/scala/docs/cluster/SimpleClusterListener.scala) { #subscribe }
|
||||
|
||||
Java
|
||||
: @@snip [SimpleClusterListener.java]($code$/java/jdocs/cluster/SimpleClusterListener.java) { #subscribe }
|
||||
|
||||
The events to track the life-cycle of members are:
|
||||
|
||||
* `ClusterEvent.MemberJoined` - A new member has joined the cluster and its status has been changed to `Joining`.
|
||||
* `ClusterEvent.MemberUp` - A new member has joined the cluster and its status has been changed to `Up`.
|
||||
* `ClusterEvent.MemberExited` - A member is leaving the cluster and its status has been changed to `Exiting`.
|
||||
* `ClusterEvent.MemberJoined` - A new member has joined the cluster and its status has been changed to `Joining`
|
||||
* `ClusterEvent.MemberUp` - A new member has joined the cluster and its status has been changed to `Up`
|
||||
* `ClusterEvent.MemberExited` - A member is leaving the cluster and its status has been changed to `Exiting`
|
||||
Note that the node might already have been shutdown when this event is published on another node.
|
||||
* `ClusterEvent.MemberRemoved` - Member completely removed from the cluster.
|
||||
* `ClusterEvent.UnreachableMember` - A member is considered as unreachable, detected by the failure detector
|
||||
|
|
@ -295,7 +325,7 @@ of classes that extends `akka.cluster.ClusterEvent.ClusterDomainEvent`
|
|||
for details about the events.
|
||||
|
||||
Instead of subscribing to cluster events it can sometimes be convenient to only get the full membership state with
|
||||
`Cluster(system).state`. Note that this state is not necessarily in sync with the events published to a
|
||||
@scala[`Cluster(system).state`]@java[`Cluster.get(system).state()`]. Note that this state is not necessarily in sync with the events published to a
|
||||
cluster subscription.
|
||||
|
||||
### Worker Dial-in Example
|
||||
|
|
@ -311,11 +341,19 @@ added or removed to the cluster dynamically.
|
|||
|
||||
Messages:
|
||||
|
||||
@@snip [TransformationMessages.scala]($code$/scala/docs/cluster/TransformationMessages.scala) { #messages }
|
||||
Scala
|
||||
: @@snip [TransformationMessages.scala]($code$/scala/docs/cluster/TransformationMessages.scala) { #messages }
|
||||
|
||||
Java
|
||||
: @@snip [TransformationMessages.java]($code$/java/jdocs/cluster/TransformationMessages.java) { #messages }
|
||||
|
||||
The backend worker that performs the transformation job:
|
||||
|
||||
@@snip [TransformationBackend.scala]($code$/scala/docs/cluster/TransformationBackend.scala) { #backend }
|
||||
Scala
|
||||
: @@snip [TransformationBackend.scala]($code$/scala/docs/cluster/TransformationBackend.scala) { #backend }
|
||||
|
||||
Java
|
||||
: @@snip [TransformationBackend.java]($code$/java/jdocs/cluster/TransformationBackend.java) { #backend }
|
||||
|
||||
Note that the `TransformationBackend` actor subscribes to cluster events to detect new,
|
||||
potential, frontend nodes, and send them a registration message so that they know
|
||||
|
|
@ -323,7 +361,11 @@ that they can use the backend worker.
|
|||
|
||||
The frontend that receives user jobs and delegates to one of the registered backend workers:
|
||||
|
||||
@@snip [TransformationFrontend.scala]($code$/scala/docs/cluster/TransformationFrontend.scala) { #frontend }
|
||||
Scala
|
||||
: @@snip [TransformationFrontend.scala]($code$/scala/docs/cluster/TransformationFrontend.scala) { #frontend }
|
||||
|
||||
Java
|
||||
: @@snip [TransformationFrontend.java]($code$/java/jdocs/cluster/TransformationFrontend.java) { #frontend }
|
||||
|
||||
Note that the `TransformationFrontend` actor watch the registered backend
|
||||
to be able to remove it from its list of available backend workers.
|
||||
|
|
@ -333,9 +375,11 @@ actor. Death watch generates the `Terminated` message to the watching actor when
|
|||
unreachable cluster node has been downed and removed.
|
||||
|
||||
The easiest way to run **Worker Dial-in Example** example yourself is to download the ready to run
|
||||
@extref[Akka Cluster Sample with Scala](ecs:akka-samples-cluster-scala)
|
||||
@scala[@extref[Akka Cluster Sample with Scala](ecs:akka-samples-cluster-scala)]
|
||||
@java[@extref[Akka Cluster Sample with Java](ecs:akka-samples-cluster-java)]
|
||||
together with the tutorial. It contains instructions on how to run the **Worker Dial-in Example** sample.
|
||||
The source code of this sample can be found in the @extref[Akka Samples Repository](samples:akka-sample-cluster-scala).
|
||||
The source code of this sample can be found in the
|
||||
@scala[@extref[Akka Samples Repository](samples:akka-sample-cluster-scala)]@java[@extref[Akka Samples Repository](samples:akka-sample-cluster-java)].
|
||||
|
||||
## Node Roles
|
||||
|
||||
|
|
@ -375,7 +419,11 @@ You can start the actors in a `registerOnMemberUp` callback, which will
|
|||
be invoked when the current member status is changed to 'Up', i.e. the cluster
|
||||
has at least the defined number of members.
|
||||
|
||||
@@snip [FactorialFrontend.scala]($code$/scala/docs/cluster/FactorialFrontend.scala) { #registerOnUp }
|
||||
Scala
|
||||
: @@snip [FactorialFrontend.scala]($code$/scala/docs/cluster/FactorialFrontend.scala) { #registerOnUp }
|
||||
|
||||
Java
|
||||
: @@snip [FactorialFrontendMain.java]($code$/java/jdocs/cluster/FactorialFrontendMain.java) { #registerOnUp }
|
||||
|
||||
This callback can be used for other things than starting actors.
|
||||
|
||||
|
|
@ -410,7 +458,7 @@ Distributes actors across several nodes in the cluster and supports interaction
|
|||
with the actors using their logical identifier, but without having to care about
|
||||
their physical location in the cluster.
|
||||
|
||||
See @ref:[Cluster Sharding](cluster-sharding.md)
|
||||
See @ref:[Cluster Sharding](cluster-sharding.md).
|
||||
|
||||
## Distributed Publish Subscribe
|
||||
|
||||
|
|
@ -575,7 +623,11 @@ Set it to a lower value if you want to limit total number of routees.
|
|||
|
||||
The same type of router could also have been defined in code:
|
||||
|
||||
@@snip [StatsService.scala]($akka$/akka-cluster-metrics/src/multi-jvm/scala/akka/cluster/metrics/sample/StatsService.scala) { #router-lookup-in-code }
|
||||
Scala
|
||||
: @@snip [StatsService.scala]($akka$/akka-cluster-metrics/src/multi-jvm/scala/akka/cluster/metrics/sample/StatsService.scala) { #router-lookup-in-code }
|
||||
|
||||
Java
|
||||
: @@snip [StatsService.java]($code$/java/jdocs/cluster/StatsService.java) { #router-lookup-in-code }
|
||||
|
||||
See [configuration](#cluster-configuration) section for further descriptions of the settings.
|
||||
|
||||
|
|
@ -592,16 +644,35 @@ the average number of characters per word when all results have been collected.
|
|||
|
||||
Messages:
|
||||
|
||||
@@snip [StatsMessages.scala]($akka$/akka-cluster-metrics/src/multi-jvm/scala/akka/cluster/metrics/sample/StatsMessages.scala) { #messages }
|
||||
Scala
|
||||
: @@snip [StatsMessages.scala]($akka$/akka-cluster-metrics/src/multi-jvm/scala/akka/cluster/metrics/sample/StatsMessages.scala) { #messages }
|
||||
|
||||
Java
|
||||
: @@snip [StatsMessages.java]($code$/java/jdocs/cluster/StatsMessages.java) { #messages }
|
||||
|
||||
The worker that counts number of characters in each word:
|
||||
|
||||
@@snip [StatsWorker.scala]($akka$/akka-cluster-metrics/src/multi-jvm/scala/akka/cluster/metrics/sample/StatsWorker.scala) { #worker }
|
||||
Scala
|
||||
: @@snip [StatsWorker.scala]($akka$/akka-cluster-metrics/src/multi-jvm/scala/akka/cluster/metrics/sample/StatsWorker.scala) { #worker }
|
||||
|
||||
Java
|
||||
: @@snip [StatsWorker.java]($code$/java/jdocs/cluster/StatsWorker.java) { #worker }
|
||||
|
||||
The service that receives text from users and splits it up into words, delegates to workers and aggregates:
|
||||
|
||||
@@@ div { .group-scala }
|
||||
|
||||
@@snip [StatsService.scala]($akka$/akka-cluster-metrics/src/multi-jvm/scala/akka/cluster/metrics/sample/StatsService.scala) { #service }
|
||||
|
||||
@@@
|
||||
|
||||
@@@ div { .group-java }
|
||||
|
||||
@@snip [StatsService.java]($code$/java/jdocs/cluster/StatsService.java) { #service }
|
||||
@@snip [StatsAggregator.java]($code$/java/jdocs/cluster/StatsAggregator.java) { #aggregator }
|
||||
|
||||
@@@
|
||||
|
||||
Note, nothing cluster specific so far, just plain actors.
|
||||
|
||||
All nodes start `StatsService` and `StatsWorker` actors. Remember, routees are the workers in this case.
|
||||
|
|
@ -625,9 +696,11 @@ This means that user requests can be sent to `StatsService` on any node and it w
|
|||
`StatsWorker` on all nodes.
|
||||
|
||||
The easiest way to run **Router Example with Group of Routees** example yourself is to download the ready to run
|
||||
@extref[Akka Cluster Sample with Scala](ecs:akka-samples-cluster-scala)
|
||||
@scala[@extref[Akka Cluster Sample with Scala](ecs:akka-samples-cluster-scala)]
|
||||
@java[@extref[Akka Cluster Sample with Java](ecs:akka-samples-cluster-java)]
|
||||
together with the tutorial. It contains instructions on how to run the **Router Example with Group of Routees** sample.
|
||||
The source code of this sample can be found in the @extref[Akka Samples Repository](samples:akka-sample-cluster-scala).
|
||||
The source code of this sample can be found in the
|
||||
@scala[@extref[Akka Samples Repository](samples:akka-sample-cluster-scala)]@java[@extref[Akka Samples Repository](samples:akka-sample-cluster-java)].
|
||||
|
||||
### Router with Pool of Remote Deployed Routees
|
||||
|
||||
|
|
@ -658,7 +731,11 @@ Set it to a lower value if you want to limit total number of routees.
|
|||
|
||||
The same type of router could also have been defined in code:
|
||||
|
||||
@@snip [StatsService.scala]($akka$/akka-cluster-metrics/src/multi-jvm/scala/akka/cluster/metrics/sample/StatsService.scala) { #router-deploy-in-code }
|
||||
Scala
|
||||
: @@snip [StatsService.scala]($akka$/akka-cluster-metrics/src/multi-jvm/scala/akka/cluster/metrics/sample/StatsService.scala) { #router-deploy-in-code }
|
||||
|
||||
Java
|
||||
: @@snip [StatsService.java]($code$/java/jdocs/cluster/StatsService.java) { #router-deploy-in-code }
|
||||
|
||||
See [configuration](#cluster-configuration) section for further descriptions of the settings.
|
||||
|
||||
|
|
@ -666,27 +743,39 @@ See [configuration](#cluster-configuration) section for further descriptions of
|
|||
|
||||
Let's take a look at how to use a cluster aware router on single master node that creates
|
||||
and deploys workers. To keep track of a single master we use the @ref:[Cluster Singleton](cluster-singleton.md)
|
||||
in the cluster-tools module. The `ClusterSingletonManager` is started on each node.:
|
||||
in the cluster-tools module. The `ClusterSingletonManager` is started on each node:
|
||||
|
||||
```
|
||||
system.actorOf(
|
||||
Scala
|
||||
: @@@vars
|
||||
```
|
||||
system.actorOf(
|
||||
ClusterSingletonManager.props(
|
||||
singletonProps = Props[StatsService],
|
||||
terminationMessage = PoisonPill,
|
||||
settings = ClusterSingletonManagerSettings(system).withRole("compute")),
|
||||
name = "statsService")
|
||||
```
|
||||
```
|
||||
@@@
|
||||
|
||||
Java
|
||||
: @@snip [StatsSampleOneMasterMain.java]($code$/java/jdocs/cluster/StatsSampleOneMasterMain.java) { #create-singleton-manager }
|
||||
|
||||
We also need an actor on each node that keeps track of where current single master exists and
|
||||
delegates jobs to the `StatsService`. That is provided by the `ClusterSingletonProxy`.:
|
||||
delegates jobs to the `StatsService`. That is provided by the `ClusterSingletonProxy`:
|
||||
|
||||
```
|
||||
system.actorOf(
|
||||
Scala
|
||||
: @@@vars
|
||||
```
|
||||
system.actorOf(
|
||||
ClusterSingletonProxy.props(
|
||||
singletonManagerPath = "/user/statsService",
|
||||
settings = ClusterSingletonProxySettings(system).withRole("compute")),
|
||||
name = "statsServiceProxy")
|
||||
```
|
||||
```
|
||||
@@@
|
||||
|
||||
Java
|
||||
: @@snip [StatsSampleOneMasterMain.java]($code$/java/jdocs/cluster/StatsSampleOneMasterMain.java) { #singleton-proxy }
|
||||
|
||||
The `ClusterSingletonProxy` receives text from users and delegates to the current `StatsService`, the single
|
||||
master. It listens to cluster events to lookup the `StatsService` on the oldest node.
|
||||
|
|
@ -708,15 +797,19 @@ akka.actor.deployment {
|
|||
```
|
||||
|
||||
The easiest way to run **Router Example with Pool of Remote Deployed Routees** example yourself is to download the ready to run
|
||||
@extref[Akka Cluster Sample with Scala](ecs:akka-samples-cluster-scala)
|
||||
@scala[@extref[Akka Cluster Sample with Scala](ecs:akka-samples-cluster-scala)]
|
||||
@java[@extref[Akka Cluster Sample with Java](ecs:akka-samples-cluster-java)]
|
||||
together with the tutorial. It contains instructions on how to run the **Router Example with Pool of Remote Deployed Routees** sample.
|
||||
The source code of this sample can be found in the @extref[Akka Samples Repository](samples:akka-sample-cluster-scala).
|
||||
The source code of this sample can be found in the
|
||||
@scala[@extref[Akka Samples Repository](samples:akka-sample-cluster-scala)]@java[@extref[Akka Samples Repository](samples:akka-sample-cluster-java)].
|
||||
|
||||
## Cluster Metrics
|
||||
|
||||
The member nodes of the cluster can collect system health metrics and publish that to other cluster nodes
|
||||
and to the registered subscribers on the system event bus with the help of `cluster-metrics`.
|
||||
|
||||
@@@ div { .group-scala }
|
||||
|
||||
## How to Test
|
||||
|
||||
@ref:[Multi Node Testing](multi-node-testing.md) is useful for testing cluster applications.
|
||||
|
|
@ -769,6 +862,17 @@ the actor system for a specific role. This can also be used to grab the `akka.ac
|
|||
|
||||
@@snip [StatsSampleSpec.scala]($akka$/akka-cluster-metrics/src/multi-jvm/scala/akka/cluster/metrics/sample/StatsSampleSpec.scala) { #addresses }
|
||||
|
||||
@@@
|
||||
|
||||
@@@ div { .group-java }
|
||||
|
||||
## How to Test
|
||||
|
||||
Currently testing with the `sbt-multi-jvm` plugin is only documented for Scala.
|
||||
Go to the corresponding @ref[Scala page](../scala/cluster-usage.md#how-to-test) for details.
|
||||
|
||||
@@@
|
||||
|
||||
## Management
|
||||
|
||||
<a id="cluster-http"></a>
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue