Nir Soffer
2020-Aug-06  22:54 UTC
[Libguestfs] [PATCH nbdkit] plugins: python: Add imageio plugin example
This is mainly for testing the new parallel python threading model, but
it also an example how to manage multiple connection from a plugin.
I tested this with local imageio server, serving qcow2 image on local
SSD.
Start imageio server from imageio source:
    ./ovirt-imageio -c test
Create test disk:
    qemu-img create -f qcow2 /var/tmp/disk.qcow2 6g
Add ticket to accessing the image, using nbd example ticket:
    curl --unix-socket ../daemon/test/daemon.sock \
        --upload-file examples/nbd.json http://localhost/ticket/nbd
Start qemu-nbd, serving the image for imageio:
    qemu-nbd --socket=/tmp/nbd.sock --persistent --shared=8 --format=qcow2 \
        --aio=native --cache=none --discard=unmap  /var/tmp/disk.qcow2
Start nbdkit with this plugin:
    ./nbdkit -U nbd.sock -t4 -f python ./plugins/python/examples/imageio.py \
        transfer_url=https://localhost:54322/images/nbd connections=4 secure=no
Finally, upload the image using qemu-img:
    time qemu-img convert -n -f raw -O raw -W /var/tmp/fedora-32.raw \
        nbd+unix:///?socket=./nbd.sock
I tested with 1 and 4 threads/connections, creating new empty qcow2
image before each test.
1 connections, 4 threads:
real	0m7.885s
user	0m0.663s
sys	0m0.803s
4 connections, 4 threads:
real	0m3.336s
user	0m0.439s
sys	0m0.651s
This is what we see on imageio side:
1 connection:
[connection 1 ops, 7.866482 s]
[dispatch 2630 ops, 6.488580 s]
[extents 1 ops, 0.002326 s]
[zero 1176 ops, 0.661475 s, 4.73 GiB, 7.15 GiB/s]
[write 1451 ops, 5.475842 s, 1.27 GiB, 237.08 MiB/s]
[flush 2 ops, 0.029208 s]
4 connections:
[connection 1 ops, 3.289038 s]
[dispatch 670 ops, 2.679317 s]
[extents 1 ops, 0.010870 s]
[write 383 ops, 2.172633 s, 333.70 MiB, 153.59 MiB/s]
[zero 286 ops, 0.346506 s, 1.29 GiB, 3.72 GiB/s]
[connection 1 ops, 3.303300 s]
[dispatch 632 ops, 2.711896 s]
[zero 273 ops, 0.380406 s, 1.12 GiB, 2.93 GiB/s]
[extents 1 ops, 0.000485 s]
[write 358 ops, 2.182803 s, 310.67 MiB, 142.33 MiB/s]
[connection 1 ops, 3.318177 s]
[dispatch 669 ops, 2.759531 s]
[extents 1 ops, 0.064217 s]
[write 354 ops, 2.067320 s, 336.70 MiB, 162.87 MiB/s]
[zero 313 ops, 0.470069 s, 1.20 GiB, 2.55 GiB/s]
[flush 1 ops, 0.002421 s]
[connection 1 ops, 3.280020 s]
[dispatch 662 ops, 2.685547 s]
[zero 304 ops, 0.431782 s, 1.13 GiB, 2.62 GiB/s]
[extents 1 ops, 0.000424 s]
[write 356 ops, 2.101127 s, 317.17 MiB, 150.95 MiB/s]
[flush 1 ops, 0.000127 s]
Results are not very stable, but the trend is clear. We can use this
to optimize the virt-v2v.
Signed-off-by: Nir Soffer <nsoffer@redhat.com>
---
 plugins/python/examples/imageio.py | 167 +++++++++++++++++++++++++++++
 1 file changed, 167 insertions(+)
 create mode 100644 plugins/python/examples/imageio.py
diff --git a/plugins/python/examples/imageio.py
b/plugins/python/examples/imageio.py
new file mode 100644
index 00000000..e77fd2f4
--- /dev/null
+++ b/plugins/python/examples/imageio.py
@@ -0,0 +1,167 @@
+# Example Python plugin.
+#
+# This example can be freely used for any purpose.
+#
+# Upload and download images to oVirt with nbdkit and qemu-img.
+#
+# Install ovirt-imageio-client
+#
+#   dnf copr enable nsoffer/ovirt-imageio-preview
+#   dnf install ovirt-imageio-client
+#
+# To upload or download images, you need to start an image transfer. The
+# easiest way is using oVirt image_transfer.py example:
+#
+#  /usr/share/doc/python3-ovirt-enigne-sdk4/eamples/image_transfer.py \
+#      --engine-url https://my.engine \
+#      --username admin@internal \
+#      --password-file password \
+#      --cafile my.engine.pem \
+#      upload disk-uuid
+#
+# This will print the trasnfer URL for this image transfer.
+#
+# Run this example from the build directory:
+#
+#   ./nbdkit -t4 -f -v -U /tmp/nbd.sock -t4 python \
+#       ./plugins/python/examples/imageio.py \
+#       transfer_url=https://server:54322/images/ticket-id \
+#       connections=4 \
+#       secure=no
+#
+# Note that number of nbdkit threads and imageio connections should match.
+#
+# To upload an image run:
+#
+#   qemu-img convert -f qcow2 -O raw disk.img nbd:///?socket=tmp/nbd.sock
+#
+# Downloading image is not efficient with this version, since we don't
report
+# extents yet.
+#
+# The -f -v arguments are optional.  They cause the server to stay in
+# the foreground and print debugging, which is useful when testing.
+
+import queue
+import threading
+from contextlib import contextmanager
+
+from ovirt_imageio.client import ImageioClient
+
+import nbdkit
+
+# Using version 2 supporting the buffer protocol for better performance.
+API_VERSION = 2
+
+# Plugin configuration, can be set using key=value in the command line.
+params = {
+    "secure": True,
+    "ca_file": "",
+    "connections": 1,
+    "transfer_url": None,
+}
+
+
+def config(key, value):
+    """
+    Parse the url parameter which contains the transfer URL that we want to
+    serve.
+    """
+    if key == "transfer_url":
+        params["transfer_url"] = value
+    elif key == "connections":
+        params["connections"] = int(value)
+    elif key == "ca_file":
+        params["ca_file"] = value
+    elif key == "secure":
+        params["secure"] = boolify(key, value)
+    else:
+        raise RuntimeError("unknown parameter: {!r}".format(key))
+
+
+def boolify(key, value):
+    v = value.lower()
+    if v in ("yes", "true", "1"):
+        return True
+    if v in ("no", "false", 0):
+        return False
+    raise RuntimeError("Invalid boolean value for {}:
{!r}".format(key, value))
+
+
+def config_complete():
+    """
+    Called when configuration completed.
+    """
+    if params["transfer_url"] is None:
+        raise RuntimeError("'transfer_url' parameter is
required")
+
+
+def thread_model():
+    """
+    Using parallel model to speed up transfer with multiple connections to
+    imageio server.
+    """
+    return nbdkit.THREAD_MODEL_PARALLEL
+
+
+def open(readonly):
+    """
+    Called once when plugin is loaded. We created a pool of connected clients
+    that will be used for requests later.
+    """
+    pool = queue.Queue()
+    for i in range(params["connections"]):
+        client = ImageioClient(
+            params["transfer_url"],
+            cafile=params["ca_file"],
+            secure=params["secure"])
+        pool.put(client)
+    return { "pool": pool }
+
+
+def close(h):
+    """
+    Called when plugin is closed. Close and remove all clients from the pool.
+    """
+    pool = h["pool"]
+    while not pool.empty():
+        client = pool.get()
+        client.close()
+
+
+@contextmanager
+def client(h):
+    """
+    Context manager fetching an imageio client from the pool. Blocks until a
+    client is available.
+    """
+    pool = h["pool"]
+    client = pool.get()
+    try:
+        yield client
+    finally:
+        pool.put(client)
+
+
+def get_size(h):
+    with client(h) as c:
+        return c.size()
+
+
+def pread(h, buf, offset, flags):
+    with client(h) as c:
+        c.read(offset, buf)
+
+
+def pwrite(h, buf, offset, flags):
+    with client(h) as c:
+        c.write(offset, buf)
+
+
+def zero(h, count, offset, flags):
+    with client(h) as c:
+        c.zero(offset, count)
+
+
+def flush(h, flags):
+    with client(h) as c:
+        c.flush()
-- 
2.25.4
Nir Soffer
2020-Aug-06  23:11 UTC
Re: [Libguestfs] [PATCH nbdkit] plugins: python: Add imageio plugin example
On Fri, Aug 7, 2020 at 1:54 AM Nir Soffer <nirsof@gmail.com> wrote:> > This is mainly for testing the new parallel python threading model, but > it also an example how to manage multiple connection from a plugin. > > I tested this with local imageio server, serving qcow2 image on local > SSD. > > Start imageio server from imageio source: > > ./ovirt-imageio -c test > > Create test disk: > > qemu-img create -f qcow2 /var/tmp/disk.qcow2 6g > > Add ticket to accessing the image, using nbd example ticket: > > curl --unix-socket ../daemon/test/daemon.sock \ > --upload-file examples/nbd.json http://localhost/ticket/nbd > > Start qemu-nbd, serving the image for imageio: > > qemu-nbd --socket=/tmp/nbd.sock --persistent --shared=8 --format=qcow2 \ > --aio=native --cache=none --discard=unmap /var/tmp/disk.qcow2 > > Start nbdkit with this plugin: > > ./nbdkit -U nbd.sock -t4 -f python ./plugins/python/examples/imageio.py \ > transfer_url=https://localhost:54322/images/nbd connections=4 secure=no > > Finally, upload the image using qemu-img: > > time qemu-img convert -n -f raw -O raw -W /var/tmp/fedora-32.raw \ > nbd+unix:///?socket=./nbd.sock > > I tested with 1 and 4 threads/connections, creating new empty qcow2 > image before each test. > > 1 connections, 4 threads: > > real 0m7.885s > user 0m0.663s > sys 0m0.803s > > 4 connections, 4 threads: > > real 0m3.336s > user 0m0.439s > sys 0m0.651s > > This is what we see on imageio side: > > 1 connection: > > [connection 1 ops, 7.866482 s] > [dispatch 2630 ops, 6.488580 s] > [extents 1 ops, 0.002326 s] > [zero 1176 ops, 0.661475 s, 4.73 GiB, 7.15 GiB/s] > [write 1451 ops, 5.475842 s, 1.27 GiB, 237.08 MiB/s] > [flush 2 ops, 0.029208 s] > > 4 connections: > > [connection 1 ops, 3.289038 s] > [dispatch 670 ops, 2.679317 s] > [extents 1 ops, 0.010870 s] > [write 383 ops, 2.172633 s, 333.70 MiB, 153.59 MiB/s] > [zero 286 ops, 0.346506 s, 1.29 GiB, 3.72 GiB/s] > > [connection 1 ops, 3.303300 s] > [dispatch 632 ops, 2.711896 s] > [zero 273 ops, 0.380406 s, 1.12 GiB, 2.93 GiB/s] > [extents 1 ops, 0.000485 s] > [write 358 ops, 2.182803 s, 310.67 MiB, 142.33 MiB/s] > > [connection 1 ops, 3.318177 s] > [dispatch 669 ops, 2.759531 s] > [extents 1 ops, 0.064217 s] > [write 354 ops, 2.067320 s, 336.70 MiB, 162.87 MiB/s] > [zero 313 ops, 0.470069 s, 1.20 GiB, 2.55 GiB/s] > [flush 1 ops, 0.002421 s] > > [connection 1 ops, 3.280020 s] > [dispatch 662 ops, 2.685547 s] > [zero 304 ops, 0.431782 s, 1.13 GiB, 2.62 GiB/s] > [extents 1 ops, 0.000424 s] > [write 356 ops, 2.101127 s, 317.17 MiB, 150.95 MiB/s] > [flush 1 ops, 0.000127 s] > > Results are not very stable, but the trend is clear. We can use this > to optimize the virt-v2v. > > Signed-off-by: Nir Soffer <nsoffer@redhat.com> > --- > plugins/python/examples/imageio.py | 167 +++++++++++++++++++++++++++++ > 1 file changed, 167 insertions(+) > create mode 100644 plugins/python/examples/imageio.py > > diff --git a/plugins/python/examples/imageio.py b/plugins/python/examples/imageio.py > new file mode 100644 > index 00000000..e77fd2f4 > --- /dev/null > +++ b/plugins/python/examples/imageio.py > @@ -0,0 +1,167 @@ > +# Example Python plugin. > +# > +# This example can be freely used for any purpose. > +# > +# Upload and download images to oVirt with nbdkit and qemu-img. > +# > +# Install ovirt-imageio-client > +# > +# dnf copr enable nsoffer/ovirt-imageio-preview > +# dnf install ovirt-imageio-client > +# > +# To upload or download images, you need to start an image transfer. The > +# easiest way is using oVirt image_transfer.py example: > +# > +# /usr/share/doc/python3-ovirt-enigne-sdk4/eamples/image_transfer.py \ > +# --engine-url https://my.engine \ > +# --username admin@internal \ > +# --password-file password \ > +# --cafile my.engine.pem \ > +# upload disk-uuid > +# > +# This will print the trasnfer URL for this image transfer. > +# > +# Run this example from the build directory: > +# > +# ./nbdkit -t4 -f -v -U /tmp/nbd.sock -t4 python \ > +# ./plugins/python/examples/imageio.py \ > +# transfer_url=https://server:54322/images/ticket-id \ > +# connections=4 \ > +# secure=no > +# > +# Note that number of nbdkit threads and imageio connections should match. > +# > +# To upload an image run: > +# > +# qemu-img convert -f qcow2 -O raw disk.img nbd:///?socket=tmp/nbd.sockShould be: qemu-img convert -n -f raw -O raw disk.img nbd+unix:///?socket=/tmp/nbd.sock> +# > +# Downloading image is not efficient with this version, since we don't report > +# extents yet.Do we support reporting extents from python plugin?> +# > +# The -f -v arguments are optional. They cause the server to stay in > +# the foreground and print debugging, which is useful when testing. > + > +import queue > +import threading > +from contextlib import contextmanager > + > +from ovirt_imageio.client import ImageioClient > + > +import nbdkit > + > +# Using version 2 supporting the buffer protocol for better performance. > +API_VERSION = 2 > + > +# Plugin configuration, can be set using key=value in the command line. > +params = { > + "secure": True, > + "ca_file": "", > + "connections": 1, > + "transfer_url": None, > +} > + > + > +def config(key, value): > + """ > + Parse the url parameter which contains the transfer URL that we want to > + serve. > + """ > + if key == "transfer_url": > + params["transfer_url"] = value > + elif key == "connections": > + params["connections"] = int(value) > + elif key == "ca_file": > + params["ca_file"] = value > + elif key == "secure": > + params["secure"] = boolify(key, value) > + else: > + raise RuntimeError("unknown parameter: {!r}".format(key)) > + > + > +def boolify(key, value): > + v = value.lower() > + if v in ("yes", "true", "1"): > + return True > + if v in ("no", "false", 0): > + return False > + raise RuntimeError("Invalid boolean value for {}: {!r}".format(key, value)) > + > + > +def config_complete(): > + """ > + Called when configuration completed. > + """ > + if params["transfer_url"] is None: > + raise RuntimeError("'transfer_url' parameter is required") > + > + > +def thread_model(): > + """ > + Using parallel model to speed up transfer with multiple connections to > + imageio server. > + """ > + return nbdkit.THREAD_MODEL_PARALLEL > + > + > +def open(readonly): > + """ > + Called once when plugin is loaded. We created a pool of connected clients > + that will be used for requests later. > + """ > + pool = queue.Queue() > + for i in range(params["connections"]): > + client = ImageioClient( > + params["transfer_url"], > + cafile=params["ca_file"], > + secure=params["secure"]) > + pool.put(client) > + return { "pool": pool } > + > + > +def close(h): > + """ > + Called when plugin is closed. Close and remove all clients from the pool. > + """ > + pool = h["pool"] > + while not pool.empty(): > + client = pool.get() > + client.close() > + > + > +@contextmanager > +def client(h): > + """ > + Context manager fetching an imageio client from the pool. Blocks until a > + client is available. > + """ > + pool = h["pool"] > + client = pool.get() > + try: > + yield client > + finally: > + pool.put(client) > + > + > +def get_size(h): > + with client(h) as c: > + return c.size() > + > + > +def pread(h, buf, offset, flags): > + with client(h) as c: > + c.read(offset, buf) > + > + > +def pwrite(h, buf, offset, flags): > + with client(h) as c: > + c.write(offset, buf) > + > + > +def zero(h, count, offset, flags): > + with client(h) as c: > + c.zero(offset, count) > + > + > +def flush(h, flags): > + with client(h) as c: > + c.flush() > -- > 2.25.4 >
Richard W.M. Jones
2020-Aug-07  07:53 UTC
Re: [Libguestfs] [PATCH nbdkit] plugins: python: Add imageio plugin example
On Fri, Aug 07, 2020 at 02:11:41AM +0300, Nir Soffer wrote:> Do we support reporting extents from python plugin?No, can_extents and extents aren't implemented for python so far. I will push this patch with your small fix, and we can add extents support later. Thanks, Rich. -- Richard Jones, Virtualization Group, Red Hat http://people.redhat.com/~rjones Read my programming and virtualization blog: http://rwmj.wordpress.com virt-p2v converts physical machines to virtual machines. Boot with a live CD or over the network (PXE) and turn machines into KVM guests. http://libguestfs.org/virt-v2v
Eric Blake
2020-Aug-08  17:34 UTC
Re: [Libguestfs] [PATCH nbdkit] plugins: python: Add imageio plugin example
On 8/6/20 5:54 PM, Nir Soffer wrote:> This is mainly for testing the new parallel python threading model, but > it also an example how to manage multiple connection from a plugin. > > I tested this with local imageio server, serving qcow2 image on local > SSD. >> diff --git a/plugins/python/examples/imageio.py b/plugins/python/examples/imageio.py > new file mode 100644 > index 00000000..e77fd2f4 > --- /dev/null > +++ b/plugins/python/examples/imageio.py > @@ -0,0 +1,167 @@ > +# Example Python plugin. > +# > +# This example can be freely used for any purpose. > +# > +# Upload and download images to oVirt with nbdkit and qemu-img. > +# > +# Install ovirt-imageio-client > +# > +# dnf copr enable nsoffer/ovirt-imageio-preview > +# dnf install ovirt-imageio-clientEven after running these steps,...> +# > +# To upload or download images, you need to start an image transfer. The > +# easiest way is using oVirt image_transfer.py example: > +# > +# /usr/share/doc/python3-ovirt-enigne-sdk4/eamples/image_transfer.py \...I do not have a /usr/share/doc/python3-ovirt-* directory at all. Where am I supposed to get it from? Also, I suspect you want s/enigne/engine/; s/eamples/examples/> +# --engine-url https://my.engine \ > +# --username admin@internal \ > +# --password-file password \ > +# --cafile my.engine.pem \ > +# upload disk-uuid > +# > +# This will print the trasnfer URL for this image transfer.s/trasnfer/transfer/> +# > +# Run this example from the build directory: > +# > +# ./nbdkit -t4 -f -v -U /tmp/nbd.sock -t4 python \-t4 does not need to be listed twice.> +# ./plugins/python/examples/imageio.py \ > +# transfer_url=https://server:54322/images/ticket-id \ > +# connections=4 \ > +# secure=no > +# > +# Note that number of nbdkit threads and imageio connections should match. > +# > +# To upload an image run: > +# > +# qemu-img convert -f qcow2 -O raw disk.img nbd:///?socket=tmp/nbd.sockURIs containing ? should be shell-quoted (in the unlikely case that someone has an 'nbd:' subdirectory that could interfere with shell globbing).> +# > +# Downloading image is not efficient with this version, since we don't report > +# extents yet.I'll push the obvious typo fixes.> + > +def boolify(key, value): > + v = value.lower() > + if v in ("yes", "true", "1"): > + return True > + if v in ("no", "false", 0): > + return False > + raise RuntimeError("Invalid boolean value for {}: {!r}".format(key, value))nbdkit_parse_bool also handles 'on/off'. Should we make that function more easily available to python plugins, so you aren't having to reimplement it yourself?> + > + > +def config_complete(): > + """ > + Called when configuration completed.missing 'is' -- Eric Blake, Principal Software Engineer Red Hat, Inc. +1-919-301-3226 Virtualization: qemu.org | libvirt.org
Nir Soffer
2020-Aug-08  19:47 UTC
Re: [Libguestfs] [PATCH nbdkit] plugins: python: Add imageio plugin example
On Sat, Aug 8, 2020 at 8:34 PM Eric Blake <eblake@redhat.com> wrote:> > On 8/6/20 5:54 PM, Nir Soffer wrote: > > This is mainly for testing the new parallel python threading model, but > > it also an example how to manage multiple connection from a plugin. > > > > I tested this with local imageio server, serving qcow2 image on local > > SSD. > > > > > diff --git a/plugins/python/examples/imageio.py b/plugins/python/examples/imageio.py > > new file mode 100644 > > index 00000000..e77fd2f4 > > --- /dev/null > > +++ b/plugins/python/examples/imageio.py > > @@ -0,0 +1,167 @@ > > +# Example Python plugin. > > +# > > +# This example can be freely used for any purpose. > > +# > > +# Upload and download images to oVirt with nbdkit and qemu-img. > > +# > > +# Install ovirt-imageio-client > > +# > > +# dnf copr enable nsoffer/ovirt-imageio-preview > > +# dnf install ovirt-imageio-client > > Even after running these steps,... > > > +# > > +# To upload or download images, you need to start an image transfer. The > > +# easiest way is using oVirt image_transfer.py example: > > +# > > +# /usr/share/doc/python3-ovirt-enigne-sdk4/eamples/image_transfer.py \ > > ...I do not have a /usr/share/doc/python3-ovirt-* directory at all. > Where am I supposed to get it from?I forgot to mention that you need also python3-ovirt-engine-sdk4. But if you want to try this you also need oVirt installation, requiring at least 2 vms, one for engine, one for the host. I think it would be better to replace this with the instructions of using a standalone imageio server, like I used in the commit message.> Also, I suspect you want s/enigne/engine/; s/eamples/examples/ > > > +# --engine-url https://my.engine \ > > +# --username admin@internal \ > > +# --password-file password \ > > +# --cafile my.engine.pem \ > > +# upload disk-uuid > > +# > > +# This will print the trasnfer URL for this image transfer. > > s/trasnfer/transfer/ > > > +# > > +# Run this example from the build directory: > > +# > > +# ./nbdkit -t4 -f -v -U /tmp/nbd.sock -t4 python \ > > -t4 does not need to be listed twice. > > > +# ./plugins/python/examples/imageio.py \ > > +# transfer_url=https://server:54322/images/ticket-id \ > > +# connections=4 \ > > +# secure=no > > +# > > +# Note that number of nbdkit threads and imageio connections should match. > > +# > > +# To upload an image run: > > +# > > +# qemu-img convert -f qcow2 -O raw disk.img nbd:///?socket=tmp/nbd.sock > > URIs containing ? should be shell-quoted (in the unlikely case that > someone has an 'nbd:' subdirectory that could interfere with shell > globbing). > > > +# > > +# Downloading image is not efficient with this version, since we don't report > > +# extents yet. > > I'll push the obvious typo fixes.Thanks!> > + > > +def boolify(key, value): > > + v = value.lower() > > + if v in ("yes", "true", "1"): > > + return True > > + if v in ("no", "false", 0): > > + return False > > + raise RuntimeError("Invalid boolean value for {}: {!r}".format(key, value)) > > nbdkit_parse_bool also handles 'on/off'. Should we make that function > more easily available to python plugins, so you aren't having to > reimplement it yourself?Yes, it makes sense that all plugins can use a similar interface.> > +def config_complete(): > > + """ > > + Called when configuration completed. > > missing 'is' > > -- > Eric Blake, Principal Software Engineer > Red Hat, Inc. +1-919-301-3226 > Virtualization: qemu.org | libvirt.org >
Reasonably Related Threads
- Re: [PATCH nbdkit] plugins: python: Add imageio plugin example
- [PATCH nbdkit] plugins: python: Add imageio plugin example
- [PATCH nbdkit] plugins: python: Fix imageio example instructions
- v2v: -o rhv-upload: Upload via NBD
- Re: [PATCH] RFC: rhv-upload-plugin: Use imageio client