pekko/akka-docs/rst/scala/persistence.rst

.. _persistence:

###########
Persistence
###########

This section describes an early access version of the Akka persistence module. Akka persistence is heavily inspired
by the `eventsourced`_ library. It follows the same concepts and architecture of `eventsourced`_ but significantly
differs on API and implementation level.

.. warning::

  This module is marked as **“experimental”** as of its introduction in Akka 2.3.0. We will continue to
  improve this API based on our users’ feedback, which implies that while we try to keep incompatible
  changes to a minimum the binary compatibility guarantee for maintenance releases does not apply to the
  contents of the ``akka.persistence`` package.

.. _eventsourced: https://github.com/eligosource/eventsourced

Dependencies
============

Akka persistence is a separate jar file. Make sure that you have the following dependency in your project::

  "com.typesafe.akka" %% "akka-persistence-experimental" % "@version@" @crossString@

Architecture
============

* *Processor*: A processor is a persistent actor. Messages sent to a processor are written to a journal before
  its ``receive`` method is called. When a processor is started or restarted, journaled messages are replayed
  to that processor, so that it can recover internal state from these messages.

* *Channel*: Channels are used by processors to communicate with other actors. They prevent that replayed messages
  are redundantly delivered to these actors.

Use cases
=========

* TODO: describe command sourcing
* TODO: describe event sourcing

Configuration
=============

By default, journaled messages are written to a directory named ``journal`` in the current working directory. This
can be changed by configuration where the specified path can be relative or absolute:

.. includecode:: code/docs/persistence/PersistencePluginDocSpec.scala#journal-config

The default storage location of :ref:`snapshots` is a directory named ``snapshots`` in the current working directory.
This can be changed by configuration where the specified path can be relative or absolute:

.. includecode:: code/docs/persistence/PersistencePluginDocSpec.scala#snapshot-config

.. _processors:

Processors
==========

A processor can be implemented by extending the ``Processor`` trait and implementing the ``receive`` method.

.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#definition

Processors only write messages of type ``Persistent`` to the journal, others are received without being persisted.
When a processor's ``receive`` method is called with a ``Persistent`` message it can safely assume that this message
has been successfully written to the journal. If a journal fails to write a ``Persistent`` message then the processor
receives a ``PersistenceFailure`` message instead of a ``Persistent`` message. In this case, a processor may want to
inform the sender about the failure, so that the sender can re-send the message, if needed, under the assumption that
the journal recovered from a temporary failure.

A ``Processor`` itself is an ``Actor`` and can therefore be instantiated with ``actorOf``.

.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#usage

Recovery
--------

By default, a processor is automatically recovered on start and on restart by replaying journaled messages.
New messages sent to a processor during recovery do not interfere with replayed messages. New messages will
only be received by that processor after recovery completes.

Recovery customization
^^^^^^^^^^^^^^^^^^^^^^

Automated recovery on start can be disabled by overriding ``preStart`` with an empty implementation.

.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#recover-on-start-disabled

In this case, a processor must be recovered explicitly by sending it a ``Recover()`` message.

.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#recover-explicit

If not overridden, ``preStart`` sends a ``Recover()`` message to ``self``. Applications may also override
``preStart`` to define further ``Recover()`` parameters such as an upper sequence number bound, for example.

.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#recover-on-start-custom

Automated recovery on restart can be disabled by overriding ``preRestart`` with an empty implementation.

.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#recover-on-restart-disabled

Recovery status
^^^^^^^^^^^^^^^

A processor can query its own recovery status via the methods

.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#recovery-status

.. _failure-handling:

Failure handling
^^^^^^^^^^^^^^^^

A persistent message that caused an exception will be received again by a processor after restart. To prevent
a replay of that message during recovery it can be marked as deleted.

.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#deletion

Identifiers
-----------

A processor must have an identifier that doesn't change across different actor incarnations. It defaults to the
``String`` representation of processor's path and can be obtained via the ``processorId`` method.

.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#processor-id

Applications can customize a processor's id by specifying an actor name during processor creation as shown in
section :ref:`processors`. This works well when using local actor references but may cause problems with remote
actor references because their paths also contain deployment information such as host and port (and actor deployments
are likely to change during the lifetime of an application). In this case, ``Processor`` implementation classes
should override ``processorId``.

.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#processor-id-override

Later versions of the Akka persistence module will likely offer a possibility to migrate processor ids.

Channels
========

Channels are special actors that are used by processors to communicate with other actors (channel destinations).
Channels prevent redundant delivery of replayed messages to destinations during processor recovery. A replayed
message is retained by a channel if its previous delivery has been confirmed by a destination.

.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#channel-example

A channel is ready to use once it has been created, no recovery or further activation is needed. A ``Deliver``
request  instructs a channel to send a ``Persistent`` message to a destination where the sender of the ``Deliver``
request is forwarded to the destination. A processor may also reply to a message sender directly by using ``sender``
as channel destination.

.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#channel-example-reply

Channel destinations confirm the delivery of a ``Persistent`` message by calling its ``confirm()`` method. This
(asynchronously) writes a confirmation entry to the journal. Replayed messages internally contain these confirmation
entries which allows a channel to decide if a message should be retained or not.

If an application crashes after a destination called ``confirm()`` but before the confirmation entry could have
been written to the journal then the unconfirmed message will be delivered again during next recovery and it is
the destination's responsibility to detect the duplicate or simply process the message again if it's an idempotent
receiver. Duplicates can be detected, for example, by tracking sequence numbers.

Currently, channels do not store ``Deliver`` requests or retry delivery on network or destination failures. This
feature (*reliable channels*) will be available soon.

Sender resolution
-----------------

``ActorRef`` s of ``Persistent`` message senders are also stored in the journal. Consequently, they may become invalid if
an application is restarted and messages are replayed. For example, the stored ``ActorRef`` may then reference
a previous incarnation of a sender and a new incarnation of that sender cannot receive a reply from a processor.
This may be acceptable for many applications but others may require that a new sender incarnation receives the
reply (to reliably resume a conversation between actors after a JVM crash, for example). Here, a channel may
assist in resolving new sender incarnations by specifying a third ``Deliver`` argument:

* ``Resolve.Destination`` if the sender of a persistent message is used as channel destination

  .. includecode:: code/docs/persistence/PersistenceDocSpec.scala#resolve-destination

* ``Resolve.Sender`` if the sender of a persistent message is forwarded to a destination.

  .. includecode:: code/docs/persistence/PersistenceDocSpec.scala#resolve-sender

Default is ``Resolve.Off`` which means no resolution. Find out more in the ``Deliver`` API docs.

Identifiers
-----------

In the same way as :ref:`processors`, channels also have an identifier that defaults to a channel's path. A channel
identifier can therefore be customized by using a custom actor name at channel creation. As already mentioned, this
works well when using local actor references but may cause problems with remote actor references. In this case, an
application-defined channel id should be provided as argument to ``Channel.props(String)``

.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#channel-id-override

Persistent messages
===================

Payload
-------

The payload of a ``Persistent`` message can be obtained via its

.. includecode:: ../../../akka-persistence/src/main/scala/akka/persistence/Persistent.scala#payload

method or by pattern matching

.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#payload-pattern-matching

Inside processors, new persistent messages are derived from the current persistent message before sending them via a
channel, either by calling ``p.withPayload(...)`` or ``Persistent.create(...)`` where the latter uses the
implicit ``currentPersistentMessage`` made available by ``Processor``.

.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#current-message

This is necessary for delivery confirmations to work properly. Both ways are equivalent but we recommend
using ``p.withPayload(...)`` for clarity.

Sequence number
---------------

The sequence number of a ``Persistent`` message can be obtained via its

.. includecode:: ../../../akka-persistence/src/main/scala/akka/persistence/Persistent.scala#sequence-nr

method or by pattern matching

.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#sequence-nr-pattern-matching

Persistent messages are assigned sequence numbers on a per-processor basis. A sequence starts at ``1L`` and
doesn't contain gaps unless a processor marks a message as deleted.

.. _snapshots:

Snapshots
=========

Snapshots can dramatically reduce recovery times. Processors can save snapshots of internal state by calling the
``saveSnapshot`` method on ``Processor``. If saving of a snapshot succeeds, the processor will receive a
``SaveSnapshotSuccess`` message, otherwise a ``SaveSnapshotFailure`` message

.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#save-snapshot

where ``metadata`` is of type ``SnapshotMetadata``:

.. includecode:: ../../../akka-persistence/src/main/scala/akka/persistence/Snapshot.scala#snapshot-metadata

During recovery, the processor is offered a previously saved snapshot via a ``SnapshotOffer`` message from
which it can initialize internal state.

.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#snapshot-offer

The replayed messages that follow the ``SnapshotOffer`` message, if any, are younger than the offered snapshot.
They finally recover the processor to its current (i.e. latest) state.

In general, a processor is only offered a snapshot if that processor has previously saved one or more snapshots
and at least one of these snapshots matches the ``SnapshotSelectionCriteria`` that can be specified for recovery.

.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#snapshot-criteria

If not specified, they default to ``SnapshotSelectionCriteria.Latest`` which selects the latest (= youngest) snapshot.
To disable snapshot-based recovery, applications should use ``SnapshotSelectionCriteria.None``. A recovery where no
saved snapshot matches the specified ``SnapshotSelectionCriteria`` will replay all journaled messages.

Storage plugins
===============

Storage backends for journals and snapshot stores are plugins in akka-persistence. The default journal plugin writes
messages to LevelDB. The default snapshot store plugin writes snapshots as individual files to the local filesystem.
Applications can provide their own plugins by implementing a plugin API and activate them by configuration. Plugin
development requires the following imports:

.. includecode:: code/docs/persistence/PersistencePluginDocSpec.scala#plugin-imports

Journal plugin API
------------------

A journal plugin either extends ``SyncWriteJournal`` or ``AsyncWriteJournal``.  ``SyncWriteJournal`` is an
actor that should be extended when the storage backend API only supports synchronous, blocking writes. The
methods to be implemented in this case are:

.. includecode:: ../../../akka-persistence/src/main/scala/akka/persistence/journal/SyncWriteJournal.scala#journal-plugin-api

``AsyncWriteJournal`` is an actor that should be extended if the storage backend API supports asynchronous,
non-blocking writes. The methods to be implemented in that case are:

.. includecode:: ../../../akka-persistence/src/main/scala/akka/persistence/journal/AsyncWriteJournal.scala#journal-plugin-api

Message replays are always asynchronous, therefore, any journal plugin must implement:

.. includecode:: ../../../akka-persistence/src/main/scala/akka/persistence/journal/AsyncReplay.scala#journal-plugin-api

A journal plugin can be activated with the following minimal configuration:

.. includecode:: code/docs/persistence/PersistencePluginDocSpec.scala#journal-plugin-config

The specified plugin ``class`` must have a no-arg constructor. The ``plugin-dispatcher`` is the dispatcher
used for the plugin actor. If not specified, it defaults to ``akka.persistence.dispatchers.default-plugin-dispatcher``
for ``SyncWriteJournal`` plugins and ``akka.actor.default-dispatcher`` for ``AsyncWriteJournal`` plugins.

Snapshot store plugin API
-------------------------

A snapshot store plugin must extend the ``SnapshotStore`` actor and implement the following methods:

.. includecode:: ../../../akka-persistence/src/main/scala/akka/persistence/snapshot/SnapshotStore.scala#snapshot-store-plugin-api

A snapshot store plugin can be activated with the following minimal configuration:

.. includecode:: code/docs/persistence/PersistencePluginDocSpec.scala#snapshot-store-plugin-config

The specified plugin ``class`` must have a no-arg constructor. The ``plugin-dispatcher`` is the dispatcher
used for the plugin actor. If not specified, it defaults to ``akka.persistence.dispatchers.default-plugin-dispatcher``.

Miscellaneous
=============

State machines
--------------

State machines can be persisted by mixing in the ``FSM`` trait into processors.

.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#fsm-example

Upcoming features
=================

* Reliable channels
* Custom serialization of messages and snapshots
* Extended deletion of messages and snapshots
* ...
-												akka-persistence prototype

The most prominent changes compared to eventsourced are:

- No central processor and channel registry any more
- Auto-recovery of processors on start and restart (can be disabled)
- Recovery of processor networks doesn't require coordination
- Explicit channel activation not needed any more
- Message sequence numbers generated per processor (no gaps)
- Sender references are journaled along with messages
- Processors can determine their recovery status
- No custom API on extension object, only messages
- Journal created by extension from config, not by application
- Applications only interact with processors and channels via messages
- Internal design prepared for having processor-specific journal actors (for later optimization possibilities)

Further additions and changes during review:

- Allow processor implementation classes to use inherited stash
- Channel support to resolve (potentially invalid) sender references
- Logical intead of physical deletion of messages
- Pinned dispatcher for LevelDB journal
- Processor can handle failures during recovery
- Message renamed to Persistent

This prototype has the following limitations:

- Serialization of persistent messages and their payload via JavaSerializer only (will be configurable later)
- The LevelDB journal implementation based on a LevelDB Java port, not the native LevelDB (will be configurable later)

The following features will be added later using separate tickets:

- Snapshot-based recovery
- Reliable channels
- Journal plugin API
- Optimizations
- ...

											
										
										
											2013-09-14 14:19:18 +02:00
+								.. _persistence:
 								###########
 								Persistence
 								###########
 								This section describes an early access version of the Akka persistence module. Akka persistence is heavily inspired
 								by the `eventsourced`_ library. It follows the same concepts and architecture of `eventsourced`_ but significantly
 								differs on API and implementation level.
 								.. warning::
 								  This module is marked as **“experimental”** as of its introduction in Akka 2.3.0. We will continue to
 								  improve this API based on our users’ feedback, which implies that while we try to keep incompatible
 								  changes to a minimum the binary compatibility guarantee for maintenance releases does not apply to the
 								  contents of the ``akka.persistence`` package.
 								.. _eventsourced: https://github.com/eligosource/eventsourced
 								Dependencies
 								============
 								Akka persistence is a separate jar file. Make sure that you have the following dependency in your project::
-												=per #3615 Fix InvalidActorNameException in persistence tests

- use overridden processor and channel ids
- no need anymore to wait for processor instances to stop
- unrelated: fix wrong artifact names in documentation

											
										
										
											2013-09-18 11:55:29 +02:00
+								  "com.typesafe.akka" %% "akka-persistence-experimental" % "@version@" @crossString@
-												akka-persistence prototype

The most prominent changes compared to eventsourced are:

- No central processor and channel registry any more
- Auto-recovery of processors on start and restart (can be disabled)
- Recovery of processor networks doesn't require coordination
- Explicit channel activation not needed any more
- Message sequence numbers generated per processor (no gaps)
- Sender references are journaled along with messages
- Processors can determine their recovery status
- No custom API on extension object, only messages
- Journal created by extension from config, not by application
- Applications only interact with processors and channels via messages
- Internal design prepared for having processor-specific journal actors (for later optimization possibilities)

Further additions and changes during review:

- Allow processor implementation classes to use inherited stash
- Channel support to resolve (potentially invalid) sender references
- Logical intead of physical deletion of messages
- Pinned dispatcher for LevelDB journal
- Processor can handle failures during recovery
- Message renamed to Persistent

This prototype has the following limitations:

- Serialization of persistent messages and their payload via JavaSerializer only (will be configurable later)
- The LevelDB journal implementation based on a LevelDB Java port, not the native LevelDB (will be configurable later)

The following features will be added later using separate tickets:

- Snapshot-based recovery
- Reliable channels
- Journal plugin API
- Optimizations
- ...

											
										
										
											2013-09-14 14:19:18 +02:00
 								Architecture
 								============
 								* *Processor*: A processor is a persistent actor. Messages sent to a processor are written to a journal before
 								  its ``receive`` method is called. When a processor is started or restarted, journaled messages are replayed
 								  to that processor, so that it can recover internal state from these messages.
 								* *Channel*: Channels are used by processors to communicate with other actors. They prevent that replayed messages
 								  are redundantly delivered to these actors.
 								Use cases
 								=========
 								* TODO: describe command sourcing
 								* TODO: describe event sourcing
 								Configuration
 								=============
 								By default, journaled messages are written to a directory named ``journal`` in the current working directory. This
 								can be changed by configuration where the specified path can be relative or absolute:
-												+per #3641 Storage plugin API

- Journal plugin API for storage backends with asynchronous client API (default impl: in-memory journal)
- Journal plugin API for storage backends with synchronous client API (default impl: LevelDB journal)
- Snapshot store plugin API (default impl: local filesystem snapshot store)

											
										
										
											2013-10-08 11:46:02 +02:00
+								.. includecode:: code/docs/persistence/PersistencePluginDocSpec.scala#journal-config
-												!per #3631 Snapshotting

- capture and save snapshots of processor state
- start processor recovery from saved snapshots
- snapshot storage on local filesystem
- snapshot store completely isolated from journal
- LevelDB journal modularized (and completely re-rwritten)
- In-memory journal removed

											
										
										
											2013-09-26 09:14:43 +02:00
 								The default storage location of :ref:`snapshots` is a directory named ``snapshots`` in the current working directory.
 								This can be changed by configuration where the specified path can be relative or absolute:
-												+per #3641 Storage plugin API

- Journal plugin API for storage backends with asynchronous client API (default impl: in-memory journal)
- Journal plugin API for storage backends with synchronous client API (default impl: LevelDB journal)
- Snapshot store plugin API (default impl: local filesystem snapshot store)

											
										
										
											2013-10-08 11:46:02 +02:00
+								.. includecode:: code/docs/persistence/PersistencePluginDocSpec.scala#snapshot-config
-												akka-persistence prototype

The most prominent changes compared to eventsourced are:

- No central processor and channel registry any more
- Auto-recovery of processors on start and restart (can be disabled)
- Recovery of processor networks doesn't require coordination
- Explicit channel activation not needed any more
- Message sequence numbers generated per processor (no gaps)
- Sender references are journaled along with messages
- Processors can determine their recovery status
- No custom API on extension object, only messages
- Journal created by extension from config, not by application
- Applications only interact with processors and channels via messages
- Internal design prepared for having processor-specific journal actors (for later optimization possibilities)

Further additions and changes during review:

- Allow processor implementation classes to use inherited stash
- Channel support to resolve (potentially invalid) sender references
- Logical intead of physical deletion of messages
- Pinned dispatcher for LevelDB journal
- Processor can handle failures during recovery
- Message renamed to Persistent

This prototype has the following limitations:

- Serialization of persistent messages and their payload via JavaSerializer only (will be configurable later)
- The LevelDB journal implementation based on a LevelDB Java port, not the native LevelDB (will be configurable later)

The following features will be added later using separate tickets:

- Snapshot-based recovery
- Reliable channels
- Journal plugin API
- Optimizations
- ...

											
										
										
											2013-09-14 14:19:18 +02:00
 								.. _processors:
 								Processors
 								==========
 								A processor can be implemented by extending the ``Processor`` trait and implementing the ``receive`` method.
 								.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#definition
 								Processors only write messages of type ``Persistent`` to the journal, others are received without being persisted.
 								When a processor's ``receive`` method is called with a ``Persistent`` message it can safely assume that this message
-												+per #3641 Storage plugin API

- Journal plugin API for storage backends with asynchronous client API (default impl: in-memory journal)
- Journal plugin API for storage backends with synchronous client API (default impl: LevelDB journal)
- Snapshot store plugin API (default impl: local filesystem snapshot store)

											
										
										
											2013-10-08 11:46:02 +02:00
+								has been successfully written to the journal. If a journal fails to write a ``Persistent`` message then the processor
 								receives a ``PersistenceFailure`` message instead of a ``Persistent`` message. In this case, a processor may want to
 								inform the sender about the failure, so that the sender can re-send the message, if needed, under the assumption that
 								the journal recovered from a temporary failure.
 								A ``Processor`` itself is an ``Actor`` and can therefore be instantiated with ``actorOf``.
-												akka-persistence prototype

The most prominent changes compared to eventsourced are:

- No central processor and channel registry any more
- Auto-recovery of processors on start and restart (can be disabled)
- Recovery of processor networks doesn't require coordination
- Explicit channel activation not needed any more
- Message sequence numbers generated per processor (no gaps)
- Sender references are journaled along with messages
- Processors can determine their recovery status
- No custom API on extension object, only messages
- Journal created by extension from config, not by application
- Applications only interact with processors and channels via messages
- Internal design prepared for having processor-specific journal actors (for later optimization possibilities)

Further additions and changes during review:

- Allow processor implementation classes to use inherited stash
- Channel support to resolve (potentially invalid) sender references
- Logical intead of physical deletion of messages
- Pinned dispatcher for LevelDB journal
- Processor can handle failures during recovery
- Message renamed to Persistent

This prototype has the following limitations:

- Serialization of persistent messages and their payload via JavaSerializer only (will be configurable later)
- The LevelDB journal implementation based on a LevelDB Java port, not the native LevelDB (will be configurable later)

The following features will be added later using separate tickets:

- Snapshot-based recovery
- Reliable channels
- Journal plugin API
- Optimizations
- ...

											
										
										
											2013-09-14 14:19:18 +02:00
 								.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#usage
 								Recovery
 								--------
 								By default, a processor is automatically recovered on start and on restart by replaying journaled messages.
 								New messages sent to a processor during recovery do not interfere with replayed messages. New messages will
 								only be received by that processor after recovery completes.
 								Recovery customization
 								^^^^^^^^^^^^^^^^^^^^^^
-												!per #3618 Cleanup duplication of life cycle hooks in Processor

 - introduce around life cycle hooks for symmetry with aroundReceive
 - no custom processor-specific life cycle hooks needed any more
 - preStart and preRestart can be overridden with empty implementation
    (interceptors ensure that super.preXxx calls are still executed)
 - all around life cycle hooks can be final
 - standard life cycle hooks are non-final to preserve composability with existing traits (FSM, ...)

											
										
										
											2013-09-15 09:04:05 +02:00
+								Automated recovery on start can be disabled by overriding ``preStart`` with an empty implementation.
-												akka-persistence prototype

The most prominent changes compared to eventsourced are:

- No central processor and channel registry any more
- Auto-recovery of processors on start and restart (can be disabled)
- Recovery of processor networks doesn't require coordination
- Explicit channel activation not needed any more
- Message sequence numbers generated per processor (no gaps)
- Sender references are journaled along with messages
- Processors can determine their recovery status
- No custom API on extension object, only messages
- Journal created by extension from config, not by application
- Applications only interact with processors and channels via messages
- Internal design prepared for having processor-specific journal actors (for later optimization possibilities)

Further additions and changes during review:

- Allow processor implementation classes to use inherited stash
- Channel support to resolve (potentially invalid) sender references
- Logical intead of physical deletion of messages
- Pinned dispatcher for LevelDB journal
- Processor can handle failures during recovery
- Message renamed to Persistent

This prototype has the following limitations:

- Serialization of persistent messages and their payload via JavaSerializer only (will be configurable later)
- The LevelDB journal implementation based on a LevelDB Java port, not the native LevelDB (will be configurable later)

The following features will be added later using separate tickets:

- Snapshot-based recovery
- Reliable channels
- Journal plugin API
- Optimizations
- ...

											
										
										
											2013-09-14 14:19:18 +02:00
 								.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#recover-on-start-disabled
 								In this case, a processor must be recovered explicitly by sending it a ``Recover()`` message.
 								.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#recover-explicit
-												!per #3618 Cleanup duplication of life cycle hooks in Processor

 - introduce around life cycle hooks for symmetry with aroundReceive
 - no custom processor-specific life cycle hooks needed any more
 - preStart and preRestart can be overridden with empty implementation
    (interceptors ensure that super.preXxx calls are still executed)
 - all around life cycle hooks can be final
 - standard life cycle hooks are non-final to preserve composability with existing traits (FSM, ...)

											
										
										
											2013-09-15 09:04:05 +02:00
+								If not overridden, ``preStart`` sends a ``Recover()`` message to ``self``. Applications may also override
 								``preStart`` to define further ``Recover()`` parameters such as an upper sequence number bound, for example.
-												akka-persistence prototype

The most prominent changes compared to eventsourced are:

- No central processor and channel registry any more
- Auto-recovery of processors on start and restart (can be disabled)
- Recovery of processor networks doesn't require coordination
- Explicit channel activation not needed any more
- Message sequence numbers generated per processor (no gaps)
- Sender references are journaled along with messages
- Processors can determine their recovery status
- No custom API on extension object, only messages
- Journal created by extension from config, not by application
- Applications only interact with processors and channels via messages
- Internal design prepared for having processor-specific journal actors (for later optimization possibilities)

Further additions and changes during review:

- Allow processor implementation classes to use inherited stash
- Channel support to resolve (potentially invalid) sender references
- Logical intead of physical deletion of messages
- Pinned dispatcher for LevelDB journal
- Processor can handle failures during recovery
- Message renamed to Persistent

This prototype has the following limitations:

- Serialization of persistent messages and their payload via JavaSerializer only (will be configurable later)
- The LevelDB journal implementation based on a LevelDB Java port, not the native LevelDB (will be configurable later)

The following features will be added later using separate tickets:

- Snapshot-based recovery
- Reliable channels
- Journal plugin API
- Optimizations
- ...

											
										
										
											2013-09-14 14:19:18 +02:00
 								.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#recover-on-start-custom
-												!per #3618 Cleanup duplication of life cycle hooks in Processor

 - introduce around life cycle hooks for symmetry with aroundReceive
 - no custom processor-specific life cycle hooks needed any more
 - preStart and preRestart can be overridden with empty implementation
    (interceptors ensure that super.preXxx calls are still executed)
 - all around life cycle hooks can be final
 - standard life cycle hooks are non-final to preserve composability with existing traits (FSM, ...)

											
										
										
											2013-09-15 09:04:05 +02:00
+								Automated recovery on restart can be disabled by overriding ``preRestart`` with an empty implementation.
-												akka-persistence prototype

The most prominent changes compared to eventsourced are:

- No central processor and channel registry any more
- Auto-recovery of processors on start and restart (can be disabled)
- Recovery of processor networks doesn't require coordination
- Explicit channel activation not needed any more
- Message sequence numbers generated per processor (no gaps)
- Sender references are journaled along with messages
- Processors can determine their recovery status
- No custom API on extension object, only messages
- Journal created by extension from config, not by application
- Applications only interact with processors and channels via messages
- Internal design prepared for having processor-specific journal actors (for later optimization possibilities)

Further additions and changes during review:

- Allow processor implementation classes to use inherited stash
- Channel support to resolve (potentially invalid) sender references
- Logical intead of physical deletion of messages
- Pinned dispatcher for LevelDB journal
- Processor can handle failures during recovery
- Message renamed to Persistent

This prototype has the following limitations:

- Serialization of persistent messages and their payload via JavaSerializer only (will be configurable later)
- The LevelDB journal implementation based on a LevelDB Java port, not the native LevelDB (will be configurable later)

The following features will be added later using separate tickets:

- Snapshot-based recovery
- Reliable channels
- Journal plugin API
- Optimizations
- ...

											
										
										
											2013-09-14 14:19:18 +02:00
 								.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#recover-on-restart-disabled
 								Recovery status
 								^^^^^^^^^^^^^^^
 								A processor can query its own recovery status via the methods
 								.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#recovery-status
 								.. _failure-handling:
 								Failure handling
 								^^^^^^^^^^^^^^^^
 								A persistent message that caused an exception will be received again by a processor after restart. To prevent
 								a replay of that message during recovery it can be marked as deleted.
 								.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#deletion
 								Identifiers
 								-----------
 								A processor must have an identifier that doesn't change across different actor incarnations. It defaults to the
 								``String`` representation of processor's path and can be obtained via the ``processorId`` method.
 								.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#processor-id
 								Applications can customize a processor's id by specifying an actor name during processor creation as shown in
 								section :ref:`processors`. This works well when using local actor references but may cause problems with remote
 								actor references because their paths also contain deployment information such as host and port (and actor deployments
 								are likely to change during the lifetime of an application). In this case, ``Processor`` implementation classes
 								should override ``processorId``.
 								.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#processor-id-override
 								Later versions of the Akka persistence module will likely offer a possibility to migrate processor ids.
 								Channels
 								========
 								Channels are special actors that are used by processors to communicate with other actors (channel destinations).
 								Channels prevent redundant delivery of replayed messages to destinations during processor recovery. A replayed
 								message is retained by a channel if its previous delivery has been confirmed by a destination.
 								.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#channel-example
 								A channel is ready to use once it has been created, no recovery or further activation is needed. A ``Deliver``
 								request  instructs a channel to send a ``Persistent`` message to a destination where the sender of the ``Deliver``
 								request is forwarded to the destination. A processor may also reply to a message sender directly by using ``sender``
 								as channel destination.
 								.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#channel-example-reply
 								Channel destinations confirm the delivery of a ``Persistent`` message by calling its ``confirm()`` method. This
 								(asynchronously) writes a confirmation entry to the journal. Replayed messages internally contain these confirmation
 								entries which allows a channel to decide if a message should be retained or not.
 								If an application crashes after a destination called ``confirm()`` but before the confirmation entry could have
 								been written to the journal then the unconfirmed message will be delivered again during next recovery and it is
 								the destination's responsibility to detect the duplicate or simply process the message again if it's an idempotent
 								receiver. Duplicates can be detected, for example, by tracking sequence numbers.
 								Currently, channels do not store ``Deliver`` requests or retry delivery on network or destination failures. This
 								feature (*reliable channels*) will be available soon.
 								Sender resolution
 								-----------------
 								``ActorRef`` s of ``Persistent`` message senders are also stored in the journal. Consequently, they may become invalid if
 								an application is restarted and messages are replayed. For example, the stored ``ActorRef`` may then reference
 								a previous incarnation of a sender and a new incarnation of that sender cannot receive a reply from a processor.
 								This may be acceptable for many applications but others may require that a new sender incarnation receives the
 								reply (to reliably resume a conversation between actors after a JVM crash, for example). Here, a channel may
 								assist in resolving new sender incarnations by specifying a third ``Deliver`` argument:
 								* ``Resolve.Destination`` if the sender of a persistent message is used as channel destination
 								  .. includecode:: code/docs/persistence/PersistenceDocSpec.scala#resolve-destination
 								* ``Resolve.Sender`` if the sender of a persistent message is forwarded to a destination.
 								  .. includecode:: code/docs/persistence/PersistenceDocSpec.scala#resolve-sender
 								Default is ``Resolve.Off`` which means no resolution. Find out more in the ``Deliver`` API docs.
 								Identifiers
 								-----------
 								In the same way as :ref:`processors`, channels also have an identifier that defaults to a channel's path. A channel
 								identifier can therefore be customized by using a custom actor name at channel creation. As already mentioned, this
 								works well when using local actor references but may cause problems with remote actor references. In this case, an
 								application-defined channel id should be provided as argument to ``Channel.props(String)``
 								.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#channel-id-override
 								Persistent messages
 								===================
 								Payload
 								-------
 								The payload of a ``Persistent`` message can be obtained via its
 								.. includecode:: ../../../akka-persistence/src/main/scala/akka/persistence/Persistent.scala#payload
 								method or by pattern matching
 								.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#payload-pattern-matching
 								Inside processors, new persistent messages are derived from the current persistent message before sending them via a
 								channel, either by calling ``p.withPayload(...)`` or ``Persistent.create(...)`` where the latter uses the
 								implicit ``currentPersistentMessage`` made available by ``Processor``.
 								.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#current-message
 								This is necessary for delivery confirmations to work properly. Both ways are equivalent but we recommend
 								using ``p.withPayload(...)`` for clarity.
 								Sequence number
 								---------------
 								The sequence number of a ``Persistent`` message can be obtained via its
 								.. includecode:: ../../../akka-persistence/src/main/scala/akka/persistence/Persistent.scala#sequence-nr
 								method or by pattern matching
 								.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#sequence-nr-pattern-matching
 								Persistent messages are assigned sequence numbers on a per-processor basis. A sequence starts at ``1L`` and
 								doesn't contain gaps unless a processor marks a message as deleted.
-												!per #3631 Snapshotting

- capture and save snapshots of processor state
- start processor recovery from saved snapshots
- snapshot storage on local filesystem
- snapshot store completely isolated from journal
- LevelDB journal modularized (and completely re-rwritten)
- In-memory journal removed

											
										
										
											2013-09-26 09:14:43 +02:00
+								.. _snapshots:
 								Snapshots
 								=========
 								Snapshots can dramatically reduce recovery times. Processors can save snapshots of internal state by calling the
 								``saveSnapshot`` method on ``Processor``. If saving of a snapshot succeeds, the processor will receive a
-												+per #3641 Storage plugin API

- Journal plugin API for storage backends with asynchronous client API (default impl: in-memory journal)
- Journal plugin API for storage backends with synchronous client API (default impl: LevelDB journal)
- Snapshot store plugin API (default impl: local filesystem snapshot store)

											
										
										
											2013-10-08 11:46:02 +02:00
+								``SaveSnapshotSuccess`` message, otherwise a ``SaveSnapshotFailure`` message
-												!per #3631 Snapshotting

- capture and save snapshots of processor state
- start processor recovery from saved snapshots
- snapshot storage on local filesystem
- snapshot store completely isolated from journal
- LevelDB journal modularized (and completely re-rwritten)
- In-memory journal removed

											
										
										
											2013-09-26 09:14:43 +02:00
 								.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#save-snapshot
 								where ``metadata`` is of type ``SnapshotMetadata``:
 								.. includecode:: ../../../akka-persistence/src/main/scala/akka/persistence/Snapshot.scala#snapshot-metadata
 								During recovery, the processor is offered a previously saved snapshot via a ``SnapshotOffer`` message from
 								which it can initialize internal state.
 								.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#snapshot-offer
 								The replayed messages that follow the ``SnapshotOffer`` message, if any, are younger than the offered snapshot.
 								They finally recover the processor to its current (i.e. latest) state.
 								In general, a processor is only offered a snapshot if that processor has previously saved one or more snapshots
 								and at least one of these snapshots matches the ``SnapshotSelectionCriteria`` that can be specified for recovery.
 								.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#snapshot-criteria
 								If not specified, they default to ``SnapshotSelectionCriteria.Latest`` which selects the latest (= youngest) snapshot.
 								To disable snapshot-based recovery, applications should use ``SnapshotSelectionCriteria.None``. A recovery where no
 								saved snapshot matches the specified ``SnapshotSelectionCriteria`` will replay all journaled messages.
-												+per #3641 Storage plugin API

- Journal plugin API for storage backends with asynchronous client API (default impl: in-memory journal)
- Journal plugin API for storage backends with synchronous client API (default impl: LevelDB journal)
- Snapshot store plugin API (default impl: local filesystem snapshot store)

											
										
										
											2013-10-08 11:46:02 +02:00
+								Storage plugins
 								===============
 								Storage backends for journals and snapshot stores are plugins in akka-persistence. The default journal plugin writes
 								messages to LevelDB. The default snapshot store plugin writes snapshots as individual files to the local filesystem.
 								Applications can provide their own plugins by implementing a plugin API and activate them by configuration. Plugin
 								development requires the following imports:
 								.. includecode:: code/docs/persistence/PersistencePluginDocSpec.scala#plugin-imports
 								Journal plugin API
 								------------------
 								A journal plugin either extends ``SyncWriteJournal`` or ``AsyncWriteJournal``.  ``SyncWriteJournal`` is an
 								actor that should be extended when the storage backend API only supports synchronous, blocking writes. The
 								methods to be implemented in this case are:
 								.. includecode:: ../../../akka-persistence/src/main/scala/akka/persistence/journal/SyncWriteJournal.scala#journal-plugin-api
 								``AsyncWriteJournal`` is an actor that should be extended if the storage backend API supports asynchronous,
 								non-blocking writes. The methods to be implemented in that case are:
 								.. includecode:: ../../../akka-persistence/src/main/scala/akka/persistence/journal/AsyncWriteJournal.scala#journal-plugin-api
 								Message replays are always asynchronous, therefore, any journal plugin must implement:
 								.. includecode:: ../../../akka-persistence/src/main/scala/akka/persistence/journal/AsyncReplay.scala#journal-plugin-api
 								A journal plugin can be activated with the following minimal configuration:
 								.. includecode:: code/docs/persistence/PersistencePluginDocSpec.scala#journal-plugin-config
 								The specified plugin ``class`` must have a no-arg constructor. The ``plugin-dispatcher`` is the dispatcher
 								used for the plugin actor. If not specified, it defaults to ``akka.persistence.dispatchers.default-plugin-dispatcher``
 								for ``SyncWriteJournal`` plugins and ``akka.actor.default-dispatcher`` for ``AsyncWriteJournal`` plugins.
 								Snapshot store plugin API
 								-------------------------
 								A snapshot store plugin must extend the ``SnapshotStore`` actor and implement the following methods:
 								.. includecode:: ../../../akka-persistence/src/main/scala/akka/persistence/snapshot/SnapshotStore.scala#snapshot-store-plugin-api
 								A snapshot store plugin can be activated with the following minimal configuration:
 								.. includecode:: code/docs/persistence/PersistencePluginDocSpec.scala#snapshot-store-plugin-config
 								The specified plugin ``class`` must have a no-arg constructor. The ``plugin-dispatcher`` is the dispatcher
 								used for the plugin actor. If not specified, it defaults to ``akka.persistence.dispatchers.default-plugin-dispatcher``.
-												akka-persistence prototype

The most prominent changes compared to eventsourced are:

- No central processor and channel registry any more
- Auto-recovery of processors on start and restart (can be disabled)
- Recovery of processor networks doesn't require coordination
- Explicit channel activation not needed any more
- Message sequence numbers generated per processor (no gaps)
- Sender references are journaled along with messages
- Processors can determine their recovery status
- No custom API on extension object, only messages
- Journal created by extension from config, not by application
- Applications only interact with processors and channels via messages
- Internal design prepared for having processor-specific journal actors (for later optimization possibilities)

Further additions and changes during review:

- Allow processor implementation classes to use inherited stash
- Channel support to resolve (potentially invalid) sender references
- Logical intead of physical deletion of messages
- Pinned dispatcher for LevelDB journal
- Processor can handle failures during recovery
- Message renamed to Persistent

This prototype has the following limitations:

- Serialization of persistent messages and their payload via JavaSerializer only (will be configurable later)
- The LevelDB journal implementation based on a LevelDB Java port, not the native LevelDB (will be configurable later)

The following features will be added later using separate tickets:

- Snapshot-based recovery
- Reliable channels
- Journal plugin API
- Optimizations
- ...

											
										
										
											2013-09-14 14:19:18 +02:00
+								Miscellaneous
 								=============
 								State machines
 								--------------
 								State machines can be persisted by mixing in the ``FSM`` trait into processors.
 								.. includecode:: code/docs/persistence/PersistenceDocSpec.scala#fsm-example
 								Upcoming features
 								=================
-												!per #3631 Snapshotting

- capture and save snapshots of processor state
- start processor recovery from saved snapshots
- snapshot storage on local filesystem
- snapshot store completely isolated from journal
- LevelDB journal modularized (and completely re-rwritten)
- In-memory journal removed

											
										
										
											2013-09-26 09:14:43 +02:00
+								* Reliable channels
 								* Custom serialization of messages and snapshots
 								* Extended deletion of messages and snapshots
-												akka-persistence prototype

The most prominent changes compared to eventsourced are:

- No central processor and channel registry any more
- Auto-recovery of processors on start and restart (can be disabled)
- Recovery of processor networks doesn't require coordination
- Explicit channel activation not needed any more
- Message sequence numbers generated per processor (no gaps)
- Sender references are journaled along with messages
- Processors can determine their recovery status
- No custom API on extension object, only messages
- Journal created by extension from config, not by application
- Applications only interact with processors and channels via messages
- Internal design prepared for having processor-specific journal actors (for later optimization possibilities)

Further additions and changes during review:

- Allow processor implementation classes to use inherited stash
- Channel support to resolve (potentially invalid) sender references
- Logical intead of physical deletion of messages
- Pinned dispatcher for LevelDB journal
- Processor can handle failures during recovery
- Message renamed to Persistent

This prototype has the following limitations:

- Serialization of persistent messages and their payload via JavaSerializer only (will be configurable later)
- The LevelDB journal implementation based on a LevelDB Java port, not the native LevelDB (will be configurable later)

The following features will be added later using separate tickets:

- Snapshot-based recovery
- Reliable channels
- Journal plugin API
- Optimizations
- ...

											
										
										
											2013-09-14 14:19:18 +02:00
+								* ...