thr3ads.net - Libguestfs - [Libguestfs] [libnbd PATCH 0/2] Better handling of failed block

If this information is useful, please help other people find it:
Share via:

Eric Blake

2019-Jun-04 17:12 UTC

[Libguestfs] [libnbd PATCH 0/2] Better handling of failed block_status callback

Rather than moving the connection to DEAD, we can just ignore further
contexts to the existing command handle, and fail the overall command
with the same errno as the failed callback.

Eric Blake (2):
  states: Track cmd->error as errno, not wire value
  api: Recover from block status callback failure

 generator/generator                 |  5 ++-
 generator/states-reply-simple.c     |  2 +-
 generator/states-reply-structured.c | 61 ++++++++++++++-------------
 interop/dirty-bitmap.c              | 65 +++++++++++++++++++++++------
 lib/aio.c                           |  1 -
 lib/internal.h                      |  2 +-
 6 files changed, 91 insertions(+), 45 deletions(-)

-- 
2.20.1

Eric Blake

2019-Jun-04 17:12 UTC

head link

[Libguestfs] [libnbd PATCH 1/2] states: Track cmd->error as errno, not wire value

We want to allow the block_status callback to be able to report a full
range of errno values, rather than the limited set of NBD_E* defined
in the protocol. To do so, we need to convert from wire error values
back into local errno values at a sooner point in the state machine.

This also changes a structured reply that includes more than one error
designation (theoretically possible at least for structured reads that
uses NBD_REPLY_TYPE_ERROR_OFFSET more than once to NBD_CMD_READ - even
if no existing NBD server actually does that) to preserve the first
error encountered, as that is often more interesting than any
follow-on errors.
---
 generator/states-reply-simple.c     | 2 +-
 generator/states-reply-structured.c | 4 +++-
 lib/aio.c                           | 1 -
 lib/internal.h                      | 2 +-
 4 files changed, 5 insertions(+), 4 deletions(-)

diff --git a/generator/states-reply-simple.c b/generator/states-reply-simple.c
index e170482..7e5340c 100644
--- a/generator/states-reply-simple.c
+++ b/generator/states-reply-simple.c
@@ -45,7 +45,7 @@
     return 0;
   }

-  cmd->error = error;
+  cmd->error = nbd_internal_errno_of_nbd_error (error);
   if (cmd->error == 0 && cmd->type == NBD_CMD_READ) {
     h->rbuf = cmd->data;
     h->rlen = cmd->count;
diff --git a/generator/states-reply-structured.c
b/generator/states-reply-structured.c
index c835713..c7ba892 100644
--- a/generator/states-reply-structured.c
+++ b/generator/states-reply-structured.c
@@ -179,7 +179,9 @@
     }
     assert (cmd); /* guaranteed by CHECK */

-    cmd->error = error;
+    /* Preserve first error encountered */
+    if (cmd->error == 0)
+      cmd->error = nbd_internal_errno_of_nbd_error (error);

     if (flags & NBD_REPLY_FLAG_DONE)
       SET_NEXT_STATE (%^FINISH_COMMAND);
diff --git a/lib/aio.c b/lib/aio.c
index a129af2..8361854 100644
--- a/lib/aio.c
+++ b/lib/aio.c
@@ -165,7 +165,6 @@ nbd_unlocked_aio_command_completed (struct nbd_handle *h,
     return 1;

   /* The command failed, set an error indication and return an error. */
-  error = nbd_internal_errno_of_nbd_error (error);
   set_error (error, "%s: command failed: %s",
              nbd_internal_name_of_nbd_cmd (type), strerror (error));
   return -1;
diff --git a/lib/internal.h b/lib/internal.h
index c1a57ac..45fc6df 100644
--- a/lib/internal.h
+++ b/lib/internal.h
@@ -223,7 +223,7 @@ struct command_in_flight {
   uint32_t count;
   void *data; /* Buffer for read/write, opaque for block status */
   extent_fn extent_fn;
-  uint32_t error;
+  uint32_t error; /* Local errno value */
 };

 /* crypto.c */
-- 
2.20.1

Eric Blake

2019-Jun-04 17:12 UTC

head link

[Libguestfs] [libnbd PATCH 2/2] api: Recover from block status callback failure

If the user's extent_fn callback fails, we want to fail the overall
block status command with the same errno, but not give up on the
server.  Do this by skipping any further context chunks from the
server, and relying on the previous patch's ability to preserve the
first error encountered.

Update the interop test with qemu's dirty bitmap context to provoke
callback failure, as well as prove that the connection is still valid.

I did not go as far as documenting EPROTO as our default errno if the
callback returns -1 with errno unchanged.
---
 generator/generator                 |  5 ++-
 generator/states-reply-structured.c | 57 +++++++++++++------------
 interop/dirty-bitmap.c              | 65 +++++++++++++++++++++++------
 3 files changed, 86 insertions(+), 41 deletions(-)

diff --git a/generator/generator b/generator/generator
index 91545ce..1261d54 100755
--- a/generator/generator
+++ b/generator/generator
@@ -1414,7 +1414,10 @@ supported by the server (see
C<nbd_can_meta_context>) this call
 returns information about extents by calling back to the
 extent function.  The callback cannot call C<nbd_*> APIs on the
 same handle since it holds the handle lock and will
-cause a deadlock.
+cause a deadlock.  If the callback returns C<-1>, any remaining
+contexts will be ignored and the overall block status command
+will fail with the same value of C<errno> left after the
+failing callback.

 The extent function is called once per type of metadata
 available.  The C<metacontext> parameter is a string
diff --git a/generator/states-reply-structured.c
b/generator/states-reply-structured.c
index c7ba892..5791360 100644
--- a/generator/states-reply-structured.c
+++ b/generator/states-reply-structured.c
@@ -357,34 +357,37 @@
     assert (h->bs_entries);
     assert (length >= 12);

-    /* Need to byte-swap the entries returned, but apart from that we
-     * don't validate them.
-     */
-    for (i = 0; i < length/4; ++i)
-      h->bs_entries[i] = be32toh (h->bs_entries[i]);
-
-    /* Look up the context ID. */
-    context_id = h->bs_entries[0];
-    for (meta_context = h->meta_contexts;
-         meta_context;
-         meta_context = meta_context->next)
-      if (context_id == meta_context->context_id)
-        break;
-
-    if (meta_context) {
-      /* Call the caller's extent function. */
-      if (cmd->extent_fn (cmd->data, meta_context->name,
cmd->offset,
-                          &h->bs_entries[1], (length-4) / 4) == -1) {
-        SET_NEXT_STATE (%.DEAD); /* XXX We should be able to recover. */
-        if (errno == 0) errno = EPROTO;
-        set_error (errno, "extent function failed");
-        return -1;
-      }
-    }
-    else
+    if (cmd->error)
       /* Emit a debug message, but ignore it. */
-      debug (h, "server sent unexpected meta context ID %" PRIu32,
-             context_id);
+      debug (h, "ignoring meta context ID %" PRIu32 " after
callback failure",
+             be32toh (h->bs_entries[0]));
+    else {
+      /* Need to byte-swap the entries returned, but apart from that we
+       * don't validate them.
+       */
+      for (i = 0; i < length/4; ++i)
+        h->bs_entries[i] = be32toh (h->bs_entries[i]);
+
+      /* Look up the context ID. */
+      context_id = h->bs_entries[0];
+      for (meta_context = h->meta_contexts;
+           meta_context;
+           meta_context = meta_context->next)
+        if (context_id == meta_context->context_id)
+          break;
+
+      if (meta_context) {
+        /* Call the caller's extent function. */
+        errno = 0;
+        if (cmd->extent_fn (cmd->data, meta_context->name,
cmd->offset,
+                            &h->bs_entries[1], (length-4) / 4) == -1)
+          cmd->error = errno ? errno : EPROTO;
+      }
+      else
+        /* Emit a debug message, but ignore it. */
+        debug (h, "server sent unexpected meta context ID %" PRIu32,
+               context_id);
+    }

     if (flags & NBD_REPLY_FLAG_DONE)
       SET_NEXT_STATE (%^FINISH_COMMAND);
diff --git a/interop/dirty-bitmap.c b/interop/dirty-bitmap.c
index 8d34173..59be0c8 100644
--- a/interop/dirty-bitmap.c
+++ b/interop/dirty-bitmap.c
@@ -23,6 +23,8 @@
 #include <string.h>
 #include <unistd.h>
 #include <assert.h>
+#include <stdbool.h>
+#include <errno.h>

 #include <libnbd.h>

@@ -30,21 +32,31 @@ static const char *unixsocket;
 static const char *bitmap;
 static const char *base_allocation = "base:allocation";

-static int calls; /* Track which contexts passed through callback */
+struct data {
+  bool req_one;    /* input: true if req_one was passed to request */
+  int count;       /* input: count of expected remaining calls */
+  bool fail;       /* input: true to return failure */
+  bool seen_base;  /* output: true if base:allocation encountered */
+  bool seen_dirty; /* output: true if qemu:dirty-bitmap encountered */
+};

 static int
-cb (void *data, const char *metacontext, uint64_t offset,
+cb (void *opaque, const char *metacontext, uint64_t offset,
     uint32_t *entries, size_t len)
 {
-  /* Hack: data is non-NULL if request included REQ_ONE flag */
+  struct data *data = opaque;
+
   assert (offset == 0);
+  assert (data->count-- > 0);
+
   if (strcmp (metacontext, base_allocation) == 0) {
-    calls += 0x1;
-    assert (len == (data ? 2 : 8));
+    assert (!data->seen_base);
+    data->seen_base = true;
+    assert (len == (data->req_one ? 2 : 8));

     /* Data block offset 0 size 128k */
     assert (entries[0] == 131072); assert (entries[1] == 0);
-    if (!data) {
+    if (!data->req_one) {
       /* hole|zero offset 128k size 384k */
       assert (entries[2] == 393216); assert (entries[3] == 3);
       /* allocated zero offset 512k size 64k */
@@ -54,11 +66,12 @@ cb (void *data, const char *metacontext, uint64_t offset,
     }
   }
   else if (strcmp (metacontext, bitmap) == 0) {
-    calls += 0x10;
-    assert (len == (data ? 2 : 10));
+    assert (!data->seen_dirty);
+    data->seen_dirty = true;
+    assert (len == (data->req_one ? 2 : 10));

     assert (entries[0] ==  65536); assert (entries[1] == 0);
-    if (!data) {
+    if (!data->req_one) {
       /* dirty block offset 64K size 64K */
       assert (entries[2] ==  65536); assert (entries[3] == 1);
       assert (entries[4] == 393216); assert (entries[5] == 0);
@@ -72,6 +85,10 @@ cb (void *data, const char *metacontext, uint64_t offset,
     exit (EXIT_FAILURE);
   }

+  if (data->fail) {
+    errno = EPROTO; /* Something NBD servers can't send */
+    return -1;
+  }
   return 0;
 }

@@ -80,6 +97,8 @@ main (int argc, char *argv[])
 {
   struct nbd_handle *nbd;
   int64_t exportsize;
+  struct data data;
+  char c;

   if (argc != 3) {
     fprintf (stderr, "%s unixsocket bitmap\n", argv[0]);
@@ -108,17 +127,37 @@ main (int argc, char *argv[])
     exit (EXIT_FAILURE);
   }

-  if (nbd_block_status (nbd, exportsize, 0, NULL, cb, 0) == -1) {
+  data = (struct data) { .count = 2, };
+  if (nbd_block_status (nbd, exportsize, 0, &data, cb, 0) == -1) {
     fprintf (stderr, "%s\n", nbd_get_error ());
     exit (EXIT_FAILURE);
   }
-  assert (calls == 0x11);
-  if (nbd_block_status (nbd, exportsize, 0, &exportsize, cb,
+  assert (data.seen_base && data.seen_dirty);
+
+  data = (struct data) { .req_one = true, .count = 2, };
+  if (nbd_block_status (nbd, exportsize, 0, &data, cb,
                         LIBNBD_CMD_FLAG_REQ_ONE) == -1) {
     fprintf (stderr, "%s\n", nbd_get_error ());
     exit (EXIT_FAILURE);
   }
-  assert (calls == 0x22);
+  assert (data.seen_base && data.seen_dirty);
+
+  /* Trigger a failed callback, to prove connection stays up. No way
+   * to know which context will respond first, but we can check that
+   * the other one did not get visited.
+   */
+  data = (struct data) { .count = 1, .fail = true, };
+  if (nbd_block_status (nbd, exportsize, 0, &data, cb, 0) != -1) {
+    fprintf (stderr, "unexpected block status success\n");
+    exit (EXIT_FAILURE);
+  }
+  assert (nbd_get_errno () == EPROTO && nbd_aio_is_ready (nbd));
+  assert (data.seen_base ^ data.seen_dirty);
+
+  if (nbd_pread (nbd, &c, 1, 0, 0) == -1) {
+    fprintf (stderr, "%s\n", nbd_get_error ());
+    exit (EXIT_FAILURE);
+  }

   if (nbd_shutdown (nbd) == -1) {
     fprintf (stderr, "%s\n", nbd_get_error ());
-- 
2.20.1

Richard W.M. Jones

2019-Jun-05 12:29 UTC

head link

Re: [Libguestfs] [libnbd PATCH 2/2] api: Recover from block status callback failure

Series looks good, ACK, thanks.

Rich.

-- 
Richard Jones, Virtualization Group, Red Hat http://people.redhat.com/~rjones
Read my programming and virtualization blog: http://rwmj.wordpress.com
virt-df lists disk usage of guests without needing to install any
software inside the virtual machine.  Supports Linux and Windows.
http://people.redhat.com/~rjones/virt-df/

Reasonably Related Threads

Search for more possibly parallel threads

Libguestfs - Jun 2019 - [libnbd PATCH 0/2] Better handling of failed block_status callback

[Libguestfs] [libnbd PATCH 0/2] Better handling of failed block_status callback

[Libguestfs] [libnbd PATCH 1/2] states: Track cmd->error as errno, not wire value

[Libguestfs] [libnbd PATCH 2/2] api: Recover from block status callback failure

Re: [Libguestfs] [libnbd PATCH 2/2] api: Recover from block status callback failure

Reasonably Related Threads