added spawn chapter to main example

2020-02-04 01:31:02 +01:00
parent c1b548bdf5
commit 4af2fc03e6
16 changed files with 506 additions and 130 deletions
--- a/book/6_future_example.html
+++ b/book/6_future_example.html
@@ -3,7 +3,7 @@
    <head>
        <!-- Book generated using mdBook -->
        <meta charset="UTF-8">
-        <title>The main example - Futures Explained in 200 Lines of Rust</title>
+        <title>Futures - our main example - Futures Explained in 200 Lines of Rust</title>
        

        <meta content="text/html; charset=utf-8" http-equiv="Content-Type">
@@ -78,7 +78,7 @@

        <nav id="sidebar" class="sidebar" aria-label="Table of contents">
            <div class="sidebar-scrollbox">
-                <ol class="chapter"><li class="affix"><a href="introduction.html">Introduction</a></li><li><a href="1_background_information.html"><strong aria-hidden="true">1.</strong> Some background information</a></li><li><a href="2_trait_objects.html"><strong aria-hidden="true">2.</strong> Trait objects and fat pointers</a></li><li><a href="3_generators_pin.html"><strong aria-hidden="true">3.</strong> Generators</a></li><li><a href="4_pin.html"><strong aria-hidden="true">4.</strong> Pin</a></li><li><a href="6_future_example.html" class="active"><strong aria-hidden="true">5.</strong> The main example</a></li><li><a href="8_finished_example.html"><strong aria-hidden="true">6.</strong> Finished example (editable)</a></li><li class="affix"><a href="conclusion.html">Conclusion and exercises</a></li></ol>
+                <ol class="chapter"><li class="affix"><a href="introduction.html">Introduction</a></li><li><a href="1_background_information.html"><strong aria-hidden="true">1.</strong> Some background information</a></li><li><a href="2_trait_objects.html"><strong aria-hidden="true">2.</strong> Trait objects and fat pointers</a></li><li><a href="3_generators_pin.html"><strong aria-hidden="true">3.</strong> Generators</a></li><li><a href="4_pin.html"><strong aria-hidden="true">4.</strong> Pin</a></li><li><a href="6_future_example.html" class="active"><strong aria-hidden="true">5.</strong> Futures - our main example</a></li><li><a href="8_finished_example.html"><strong aria-hidden="true">6.</strong> Finished example (editable)</a></li><li class="affix"><a href="conclusion.html">Conclusion and exercises</a></li></ol>
            </div>
            <div id="sidebar-resize-handle" class="sidebar-resize-handle"></div>
        </nav>
@@ -154,9 +154,11 @@
 executor which allows you to edit, run an play around with the code right here
 in your browser.</p>
 <p>I'll walk you through the example, but if you want to check it out closer, you
-can always <a href="https://github.com/cfsamson/examples-futures">clone the repository</a> and play around with the code yourself. There
-are two branches. The <code>basic_example</code> is this code, and the <code>basic_example_commented</code>
-is this example with extensive comments.</p>
+can always <a href="https://github.com/cfsamson/examples-futures">clone the repository</a> and play around with the code yourself.</p>
+<p>There are several branches explained in the readme, but two are
+relevant for this chapter. The <code>main</code> branch is the example we go through here,
+and the <code>basic_example_commented</code> branch is this example with extensive
+comments.</p>
 <blockquote>
 <p>If you want to follow along as we go through, initialize a new cargo project
 by creating a new folder and run <code>cargo init</code> inside it. Everything we write
@@ -171,7 +173,7 @@ here will be in <code>main.rs</code></p>
 };
 </code></pre>
 <h2><a class="header" href="#the-executor" id="the-executor">The Executor</a></h2>
-<p>The executors task is to take one or more futures and run them to completion.</p>
+<p>The executors responsibility is to take one or more futures and run them to completion.</p>
 <p>The first thing an <code>executor</code> does when it gets a <code>Future</code> is polling it.</p>
 <p><strong>When polled one of three things can happen:</strong></p>
 <ul>
@@ -214,7 +216,7 @@ fn block_on&lt;F: Future&gt;(mut future: F) -&gt; F::Output {
 <p>Inn all the examples here I've chose to comment the code extensively. I find it
 easier to follow that way than dividing if up into many paragraphs.</p>
 <p>We'll see more about the <code>Waker</code> in the next paragraph, but just look at it like
-a <em>trait object</em> like the one we constructed  in the first chapter.</p>
+a <em>trait object</em> similar to the one we constructed  in the first chapter.</p>
 <blockquote>
 <p><code>Context</code> is just a wrapper around the <code>Waker</code>. At the time of writing this
 book it's nothing more. In the future it might be possible that the <code>Context</code>
@@ -226,13 +228,14 @@ abstraction gives some flexibility.</p>
 be rather easy to understand. <code>Future</code> is a state machine, every <code>await</code> point
 is a <code>yield</code> point. We could borrow data across <code>await</code> points and we meet the
 exact same challenges as we do when borrowing across <code>yield</code> points.</p>
-<p>As we explained in that chapter, we use <code>Pin</code> and the guarantees that give us to
-allow <code>Futures</code> to have self references.</p>
+<p>As we explained in the <a href="./3_generators_pin.html">chapter about generators</a>, we use
+<code>Pin</code> and the guarantees that give us to allow <code>Futures</code> to have self
+references.</p>
 <h2><a class="header" href="#the-future-implementation" id="the-future-implementation">The <code>Future</code> implementation</a></h2>
 <p>In Rust we call an interruptible task a <code>Future</code>. Futures has a well defined interface, which means they can be used across the entire ecosystem. We can chain
 these <code>Futures</code> so that once a &quot;leaf future&quot; is ready we'll perform a set of
 operations.</p>
-<p>These operations can spawn new leaf futures themselves.</p>
+<p>These chained operations can spawn new leaf futures themselves.</p>
 <p><strong>Our Future implementation looks like this:</strong></p>
 <pre><code class="language-rust noplaypen ignore">// This is the definition of our `Waker`. We use a regular thread-handle here.
 // It works but it's not a good solution. It's easy to fix though, I'll explain
@@ -256,7 +259,7 @@ pub struct Task {
 }

 // These are function definitions we'll use for our waker. Remember the
-// &quot;Trait Objects&quot; chapter from the book.
+// &quot;Trait Objects&quot; chapter earlier.
 fn mywaker_wake(s: &amp;MyWaker) {
    let waker_ptr: *const MyWaker = s;
    let waker_arc = unsafe {Arc::from_raw(waker_ptr)};
@@ -304,8 +307,8 @@ impl Task {

 // This is our `Future` implementation
 impl Future for Task {
-    // The output for this kind of `leaf future` is just an `usize`. For other
-    // futures this could be something more interesting like a bytearray.
+    // The output for our kind of `leaf future` is just an `usize`. For other
+    // futures this could be something more interesting like a byte array.
    type Output = usize;
    fn poll(mut self: Pin&lt;&amp;mut Self&gt;, cx: &amp;mut Context&lt;'_&gt;) -&gt; Poll&lt;Self::Output&gt; {
        let mut r = self.reactor.lock().unwrap();
@@ -313,7 +316,7 @@ impl Future for Task {
        // i.e. if it's `Ready`
        if r.is_ready(self.id) {
            // if it is, we return the data. In this case it's just the ID of
-            // the task. 
+            // the task since this is just a very simple example.
            Poll::Ready(self.id)
        } else if self.is_registered {
            // If the future is registered alredy, we just return `Pending`
@@ -345,7 +348,7 @@ to allow such cases. </p>
 trouble of creating our own <code>vtable</code> and a <code>RawWaker</code>. We could just implement
 a normal trait.</p>
 <p>Fortunately, in the future this will probably be possible in the standard
-library as well. For now, <a href="https://rust-lang-nursery.github.io/futures-api-docs/0.3.0-alpha.13/futures/task/trait.ArcWake.html">this trait lives in the nursery</a>, but mye
+library as well. For now, <a href="https://rust-lang-nursery.github.io/futures-api-docs/0.3.0-alpha.13/futures/task/trait.ArcWake.html">this trait lives in the nursery</a>, but my
 guess is that this will be a part of the standard library after som maturing.</p>
 <p>We choose to pass in a reference to the whole <code>Reactor</code> here. This isn't normal.
 The reactor will often be a global resource which let's us register interests
@@ -353,7 +356,8 @@ without passing around a reference.</p>
 <h3><a class="header" href="#why-using-thread-parkunpark-is-a-bad-idea-for-a-library" id="why-using-thread-parkunpark-is-a-bad-idea-for-a-library">Why using thread park/unpark is a bad idea for a library</a></h3>
 <p>It could deadlock easily since anyone could get a handle to the <code>executor thread</code>
 and call park/unpark on it.</p>
-<p>If one of our <code>Futures</code> holds a handle to our thread and takes it with it to a different thread the following could happen:</p>
+<p>If one of our <code>Futures</code> holds a handle to our thread, or any unrelated code
+calls <code>unpark</code> on our thread, the following could happen:</p>
 <ol>
 <li>A future could call <code>unpark</code> on the executor thread from a different thread</li>
 <li>Our <code>executor</code> thinks that data is ready and wakes up and polls the future</li>
@@ -365,9 +369,14 @@ run in parallel.</li>
 awake already at that point.</li>
 <li>We're deadlocked and our program stops working</li>
 </ol>
+<blockquote>
+<p>There is also the case that our thread could have what's called a
+<code>spurious wakeup</code> (<a href="https://cfsamson.github.io/book-exploring-async-basics/9_3_http_module.html#bonus-section">which can happen unexpectedly</a>), which
+could cause the same deadlock if we're unlucky.</p>
+</blockquote>
 <p>There are many better solutions, here are some:</p>
 <ul>
-<li>Use <code>std::sync::CondVar</code></li>
+<li>Use <a href="https://doc.rust-lang.org/stable/std/sync/struct.Condvar.html">std::sync::CondVar</a></li>
 <li>Use <a href="https://docs.rs/crossbeam/0.7.3/crossbeam/sync/struct.Parker.html">crossbeam::sync::Parker</a></li>
 </ul>
 <h2><a class="header" href="#the-reactor" id="the-reactor">The Reactor</a></h2>
@@ -376,7 +385,7 @@ to have an example to run.</p>
 <p>Since concurrency mostly makes sense when interacting with the outside world (or
 at least some peripheral), we need something to actually abstract over this
 interaction in an asynchronous way. </p>
-<p>This is the <code>Reactors</code> job. Most often you'll see reactors in rust use a library called <a href="https://github.com/tokio-rs/mio">Mio</a>, which provides non 
+<p>This is the <code>Reactors</code> job. Most often you'll see reactors in Rust use a library called <a href="https://github.com/tokio-rs/mio">Mio</a>, which provides non 
 blocking APIs and event notification for several platforms.</p>
 <p>The reactor will typically give you something like a <code>TcpStream</code> (or any other resource) which you'll use to create an I/O request. What you get in return
 is a <code>Future</code>. </p>
@@ -385,7 +394,7 @@ is a <code>Future</code>. </p>
 is pretty normal), our <code>Task</code> in would instead be a special <code>TcpStream</code> which
 registers interest with the global <code>Reactor</code> and no reference is needed.</p>
 </blockquote>
-<p>We can call this kind of <code>Future</code> a &quot;leaf Future`, since it's some operation 
+<p>We can call this kind of <code>Future</code> a &quot;leaf Future&quot;, since it's some operation 
 we'll actually wait on and that we can chain operations on which are performed
 once the leaf future is ready. </p>
 <p><strong>Our Reactor will look like this:</strong></p>
@@ -401,8 +410,8 @@ struct Reactor {
    readylist: Arc&lt;Mutex&lt;Vec&lt;usize&gt;&gt;&gt;,
 }

-// We just have two kind of events. A timeout event, a &quot;timeout&quot; event called
-// `Timeout` and a `Close` event to close down our reactor.
+// We just have two kind of events. An event called `Timeout`
+// and a `Close` event to close down our reactor.
 #[derive(Debug)]
 enum Event {
    Close,
@@ -422,7 +431,6 @@ impl Reactor {
        let mut handles = vec![];
        // This will be the &quot;Reactor thread&quot;
        let handle = thread::spawn(move || {
-            // This simulates some I/O resource
            for event in rx {
                let rl_clone = rl_clone.clone();
                match event {
@@ -430,9 +438,10 @@ impl Reactor {
                    Event::Close =&gt; break,
                    Event::Timeout(waker, duration, id) =&gt; {

-                        // When we get an event we simply spawn a new thread...
+                        // When we get an event we simply spawn a new thread
+                        // which will simulate some I/O resource...
                        let event_handle = thread::spawn(move || {
-                            //... which will just sleep for the number of seconds
+                            //... by sleeping for the number of seconds
                            // we provided when creating the `Task`.
                            thread::sleep(Duration::from_secs(duration));
                            // When it's done sleeping we put the ID of this task
@@ -450,7 +459,7 @@ impl Reactor {

            // When we exit the Reactor we first join all the handles on
            // the child threads we've spawned so we catch any panics and
-            // release all resources.
+            // release any resources.
            for handle in handles {
                handle.join().unwrap();
            }
@@ -805,6 +814,103 @@ fn main() {
 #     }
 # }
 </code></pre></pre>
+<p>This is the first time we actually see the <code>async/await</code> syntax so let's
+finish this book by explaining them briefly.</p>
+<p>Hopefully, the <code>await</code> syntax looks pretty familiar. It has a lot in common
+with <code>yield</code> and indeed, it works in much the same way.</p>
+<p>The <code>async</code> keyword can be used on functions as in <code>async fn(...)</code> or on a
+block as in <code>async { ... }</code>. Both will turn your function, or block, into a
+<code>Future</code>.</p>
+<p>These <code>Futures</code> are rather simple. Imagine our generator from a few chapters
+back. Every <code>await</code> point is like a <code>yield</code> point.</p>
+<p>Instead of <code>yielding</code> a value we pass in, it yields the <code>Future</code> we're awaiting.
+In turn this <code>Future</code> is polled. </p>
+<p>Now, as is the case in our code, our <code>mainfut</code> contains two non-leaf futures
+which it awaits, and all that happens is that these state machines are polled
+as well until some &quot;leaf future&quot; in the end is finally polled and either
+returns <code>Ready</code> or <code>Pending</code>.</p>
+<p>The way our example is right now, it's not much better than regular synchronous
+code. For us to actually await multiple futures at the same time we somehow need
+to <code>spawn</code> them so they're polled once, but does not cause our thread to sleep
+and wait for them one after one.</p>
+<p>Our example as it stands now returns this:</p>
+<pre><code>Future got 1 at time: 1.00.
+Future got 2 at time: 3.00.
+</code></pre>
+<p>If these <code>Futures</code> were executed asynchronously we would expect to see:</p>
+<pre><code>Future got 1 at time: 1.00.
+Future got 2 at time: 2.00.
+</code></pre>
+<p>To accomplish this we can create the simplest possible <code>spawn</code> function I could
+come up with:</p>
+<pre><code class="language-rust ignore noplaypen">fn spawn&lt;F: Future&gt;(future: F) -&gt; Pin&lt;Box&lt;F&gt;&gt; {
+    // We start off the same way as we did before
+    let mywaker = Arc::new(MyWaker{ thread: thread::current() });
+    let waker = waker_into_waker(Arc::into_raw(mywaker));
+    let mut cx = Context::from_waker(&amp;waker);
+    
+    // But we need to Box this Future. We can't pin it to this stack frame
+    // since we'll return before the `Future` is resolved so it must be pinned
+    // to the heap.
+    let mut boxed = Box::pin(future);
+    // Now we poll and just discard the result. This way, we register a `Waker`
+    // with our `Reactor` and kick of whatever operation we're expecting.
+    let _ = Future::poll(boxed.as_mut(), &amp;mut cx);
+    
+    // We still need this `Future` since we'll await it later so we return it...
+    boxed
+}
+</code></pre>
+<p>Now if we change our code in <code>main</code> to look like this instead.</p>
+<pre><pre class="playpen"><code class="language-rust">fn main() {
+    let start = Instant::now();
+    let reactor = Reactor::new();
+    let reactor = Arc::new(Mutex::new(reactor));
+    let future1 = Task::new(reactor.clone(), 1, 1);
+    let future2 = Task::new(reactor.clone(), 2, 2);
+
+    let fut1 = async {
+        let val = future1.await;
+        let dur = (Instant::now() - start).as_secs_f32();
+        println!(&quot;Future got {} at time: {:.2}.&quot;, val, dur);
+    };
+
+    let fut2 = async {
+        let val = future2.await;
+        let dur = (Instant::now() - start).as_secs_f32();
+        println!(&quot;Future got {} at time: {:.2}.&quot;, val, dur);
+    };
+
+    // You'll notice everything stays the same until this point
+    let mainfut = async {
+        // Here we &quot;kick off&quot; our first `Future`
+        let handle1 = spawn(fut1);
+        // And the second one
+        let handle2 = spawn(fut2);
+
+        // Now, they're already started, and when they get polled in our
+        // executor now they will just return `Pending`, or if we somehow used
+        // so much time that they're already resolved, they will return `Ready`.
+        handle1.await;
+        handle2.await;
+    };
+
+    block_on(mainfut);
+    reactor.lock().map(|mut r| r.close()).unwrap();
+}
+</code></pre></pre>
+<p>If you add this code to our example and run it, you'll see:</p>
+<pre><code>Future got 1 at time: 1.00.
+Future got 2 at time: 2.00.
+</code></pre>
+<p>Exactly as we expected.</p>
+<p>Now this <code>spawn</code> method is not very sophisticated but it explains the concept.
+I've <a href="./conclusion.html#building-a-better-exectuor">challenged you to create a better version</a> and pointed you at a better resource
+in the next chapter under <a href="./conclusion.html#reader-exercises">reader exercises</a>.</p>
+<p>That's actually it for now. There are probably much more to learn, but I think it
+will be easier once the fundamental concepts are there and that further
+exploration will get a lot easier. </p>
+<p>Don't forget the exercises in the last chapter 😊. Have fun until the next time! </p>

                    </main>