Merge paradox/scala/io.md and java/io.md (#23168)
This commit is contained in:
parent
34fda05e20
commit
8528deb306
2 changed files with 12 additions and 111 deletions
|
|
@ -1,104 +0,0 @@
|
|||
# I/O
|
||||
|
||||
## Introduction
|
||||
|
||||
The `akka.io` package has been developed in collaboration between the Akka
|
||||
and [spray.io](http://spray.io) teams. Its design combines experiences from the
|
||||
`spray-io` module with improvements that were jointly developed for
|
||||
more general consumption as an actor-based service.
|
||||
|
||||
The guiding design goal for this I/O implementation was to reach extreme
|
||||
scalability, make no compromises in providing an API correctly matching the
|
||||
underlying transport mechanism and to be fully event-driven, non-blocking and
|
||||
asynchronous. The API is meant to be a solid foundation for the implementation
|
||||
of network protocols and building higher abstractions; it is not meant to be a
|
||||
full-service high-level NIO wrapper for end users.
|
||||
|
||||
## Terminology, Concepts
|
||||
|
||||
The I/O API is completely actor based, meaning that all operations are implemented with message passing instead of
|
||||
direct method calls. Every I/O driver (TCP, UDP) has a special actor, called a *manager* that serves
|
||||
as an entry point for the API. I/O is broken into several drivers. The manager for a particular driver
|
||||
is accessible by querying an `ActorSystem`. For example the following code
|
||||
looks up the TCP manager and returns its `ActorRef`:
|
||||
|
||||
@@snip [EchoManager.java]($code$/java/jdocs/io/japi/EchoManager.java) { #manager }
|
||||
|
||||
The manager receives I/O command messages and instantiates worker actors in response. The worker actors present
|
||||
themselves to the API user in the reply to the command that was sent. For example after a `Connect` command sent to
|
||||
the TCP manager the manager creates an actor representing the TCP connection. All operations related to the given TCP
|
||||
connections can be invoked by sending messages to the connection actor which announces itself by sending a `Connected`
|
||||
message.
|
||||
|
||||
### DeathWatch and Resource Management
|
||||
|
||||
I/O worker actors receive commands and also send out events. They usually need a user-side counterpart actor listening
|
||||
for these events (such events could be inbound connections, incoming bytes or acknowledgements for writes). These worker
|
||||
actors *watch* their listener counterparts. If the listener stops then the worker will automatically release any
|
||||
resources that it holds. This design makes the API more robust against resource leaks.
|
||||
|
||||
Thanks to the completely actor based approach of the I/O API the opposite direction works as well: a user actor
|
||||
responsible for handling a connection can watch the connection actor to be notified if it unexpectedly terminates.
|
||||
|
||||
### Write models (Ack, Nack)
|
||||
|
||||
I/O devices have a maximum throughput which limits the frequency and size of writes. When an
|
||||
application tries to push more data than a device can handle, the driver has to buffer bytes until the device
|
||||
is able to write them. With buffering it is possible to handle short bursts of intensive writes --- but no buffer is infinite.
|
||||
"Flow control" is needed to avoid overwhelming device buffers.
|
||||
|
||||
Akka supports two types of flow control:
|
||||
|
||||
* *Ack-based*, where the driver notifies the writer when writes have succeeded.
|
||||
* *Nack-based*, where the driver notifies the writer when writes have failed.
|
||||
|
||||
Each of these models is available in both the TCP and the UDP implementations of Akka I/O.
|
||||
|
||||
Individual writes can be acknowledged by providing an ack object in the write message (`Write` in the case of TCP and
|
||||
`Send` for UDP). When the write is complete the worker will send the ack object to the writing actor. This can be
|
||||
used to implement *ack-based* flow control; sending new data only when old data has been acknowledged.
|
||||
|
||||
If a write (or any other command) fails, the driver notifies the actor that sent the command with a special message
|
||||
(`CommandFailed` in the case of UDP and TCP). This message will also notify the writer of a failed write, serving as a
|
||||
nack for that write. Please note, that in a nack-based flow-control setting the writer has to be prepared for the fact
|
||||
that the failed write might not be the most recent write it sent. For example, the failure notification for a write
|
||||
`W1` might arrive after additional write commands `W2` and `W3` have been sent. If the writer wants to resend any
|
||||
nacked messages it may need to keep a buffer of pending messages.
|
||||
|
||||
@@@ warning
|
||||
|
||||
An acknowledged write does not mean acknowledged delivery or storage; receiving an ack for a write simply signals that
|
||||
the I/O driver has successfully processed the write. The Ack/Nack protocol described here is a means of flow control
|
||||
not error handling. In other words, data may still be lost, even if every write is acknowledged.
|
||||
|
||||
@@@
|
||||
|
||||
<a id="bytestring"></a>
|
||||
### ByteString
|
||||
|
||||
To maintain isolation, actors should communicate with immutable objects only. `ByteString` is an
|
||||
immutable container for bytes. It is used by Akka's I/O system as an efficient, immutable alternative
|
||||
the traditional byte containers used for I/O on the JVM, such as `byte[]` and `ByteBuffer`.
|
||||
|
||||
`ByteString` is a [rope-like](http://en.wikipedia.org/wiki/Rope_\(computer_science\)) data structure that is immutable
|
||||
and provides fast concatenation and slicing operations (perfect for I/O). When two `ByteString`s are concatenated
|
||||
together they are both stored within the resulting `ByteString` instead of copying both to a new array. Operations
|
||||
such as `drop` and `take` return `ByteString`s that still reference the original array, but just change the
|
||||
offset and length that is visible. Great care has also been taken to make sure that the internal array cannot be
|
||||
modified. Whenever a potentially unsafe array is used to create a new `ByteString` a defensive copy is created. If
|
||||
you require a `ByteString` that only blocks a much memory as necessary for it's content, use the `compact` method to
|
||||
get a `CompactByteString` instance. If the `ByteString` represented only a slice of the original array, this will
|
||||
result in copying all bytes in that slice.
|
||||
|
||||
`ByteString` inherits all methods from `IndexedSeq`, and it also has some new ones. For more information, look up the `akka.util.ByteString` class and it's companion object in the ScalaDoc.
|
||||
|
||||
`ByteString` also comes with its own optimized builder and iterator classes `ByteStringBuilder` and
|
||||
`ByteIterator` which provide extra features in addition to those of normal builders and iterators.
|
||||
|
||||
#### Compatibility with java.io
|
||||
|
||||
A `ByteStringBuilder` can be wrapped in a `java.io.OutputStream` via the `asOutputStream` method. Likewise, `ByteIterator` can we wrapped in a `java.io.InputStream` via `asInputStream`. Using these, `akka.io` applications can integrate legacy code based on `java.io` streams.
|
||||
|
||||
## Architecture in-depth
|
||||
|
||||
For further details on the design and internal architecture see @ref:[I/O Layer Design](common/io-layer.md).
|
||||
1
akka-docs/src/main/paradox/java/io.md
Symbolic link
1
akka-docs/src/main/paradox/java/io.md
Symbolic link
|
|
@ -0,0 +1 @@
|
|||
../scala/io.md
|
||||
|
|
@ -19,10 +19,14 @@ full-service high-level NIO wrapper for end users.
|
|||
The I/O API is completely actor based, meaning that all operations are implemented with message passing instead of
|
||||
direct method calls. Every I/O driver (TCP, UDP) has a special actor, called a *manager* that serves
|
||||
as an entry point for the API. I/O is broken into several drivers. The manager for a particular driver
|
||||
is accessible through the `IO` entry point. For example the following code
|
||||
is accessible @scala[through the `IO` entry point]@java[by querying an `ActorSystem`]. For example the following code
|
||||
looks up the TCP manager and returns its `ActorRef`:
|
||||
|
||||
@@snip [IODocSpec.scala]($code$/scala/docs/io/IODocSpec.scala) { #manager }
|
||||
Scala
|
||||
: @@snip [IODocSpec.scala]($code$/scala/docs/io/IODocSpec.scala) { #manager }
|
||||
|
||||
Java
|
||||
: @@snip [EchoManager.java]($code$/java/jdocs/io/japi/EchoManager.java) { #manager }
|
||||
|
||||
The manager receives I/O command messages and instantiates worker actors in response. The worker actors present
|
||||
themselves to the API user in the reply to the command that was sent. For example after a `Connect` command sent to
|
||||
|
|
@ -78,14 +82,14 @@ not error handling. In other words, data may still be lost, even if every write
|
|||
|
||||
To maintain isolation, actors should communicate with immutable objects only. `ByteString` is an
|
||||
immutable container for bytes. It is used by Akka's I/O system as an efficient, immutable alternative
|
||||
the traditional byte containers used for I/O on the JVM, such as `Array[Byte]` and `ByteBuffer`.
|
||||
the traditional byte containers used for I/O on the JVM, such as @scala[`Array[Byte]`]@java[`byte[]`] and `ByteBuffer`.
|
||||
|
||||
`ByteString` is a [rope-like](http://en.wikipedia.org/wiki/Rope_\(computer_science\)) data structure that is immutable
|
||||
and provides fast concatenation and slicing operations (perfect for I/O). When two `ByteString`s are concatenated
|
||||
together they are both stored within the resulting `ByteString` instead of copying both to a new `Array`. Operations
|
||||
such as `drop` and `take` return `ByteString`s that still reference the original `Array`, but just change the
|
||||
offset and length that is visible. Great care has also been taken to make sure that the internal `Array` cannot be
|
||||
modified. Whenever a potentially unsafe `Array` is used to create a new `ByteString` a defensive copy is created. If
|
||||
together they are both stored within the resulting `ByteString` instead of copying both to a new @scala[`Array`]@java[array]. Operations
|
||||
such as `drop` and `take` return `ByteString`s that still reference the original @scala[`Array`]@java[array], but just change the
|
||||
offset and length that is visible. Great care has also been taken to make sure that the internal @scala[`Array`]@java[array] cannot be
|
||||
modified. Whenever a potentially unsafe @scala[`Array`]@java[array] is used to create a new `ByteString` a defensive copy is created. If
|
||||
you require a `ByteString` that only blocks as much memory as necessary for it's content, use the `compact` method to
|
||||
get a `CompactByteString` instance. If the `ByteString` represented only a slice of the original array, this will
|
||||
result in copying all bytes in that slice.
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue