thr3ads.net - Libguestfs - [Libguestfs] v2v: -o rhv-upload: Long time spent zeroing the disk [Apr 2018]

If this information is useful, please help other people find it:
Share via:

Richard W.M. Jones

2018-Apr-10 10:44 UTC

[Libguestfs] v2v: -o rhv-upload: Long time spent zeroing the disk

We now have true zeroing support in oVirt imageio, thanks for that.

However a problem is that ‘qemu-img convert’ issues zero requests for
the whole disk before starting the transfer.  It does this using 32 MB
requests which take approx. 1 second each to execute on the oVirt side.

Two problems therefore:

(1) Zeroing the disk can take a long time (eg. 40 GB is approx.
    20 minutes).  Furthermore there is no progress indication while this
    is happening.

    Nothing bad happens: because it is making frequent requests there
    is no timeout.

(2) I suspect that because we don't have trim support that this is
    actually causing the disk to get fully allocated on the target.

    The NBD requests are sent with may_trim=1 so we could turn these
    into trim requests, but obviously cannot do that while there is no
    trim support.

Note that this is (sort of) a regression over previous versions of the
‘-o rhv-upload’ patch since the old code ignored some zero requests.

I'm not sure what to do about this.  Possibly we could fix the
progress bar issue in qemu.  It's also possible we could do something
in nbdkit such as having is signal back that "stuff is being done" and
turn that into some indication in virt-v2v.

Anyway it's something to be aware of, in case you try out the
forthcoming virt-v2v rhvpreview and it appears to sit at 0/100% "doing
nothing" for a long time.

Rich.

-- 
Richard Jones, Virtualization Group, Red Hat http://people.redhat.com/~rjones
Read my programming and virtualization blog: http://rwmj.wordpress.com
virt-top is 'top' for virtual machines.  Tiny program with many
powerful monitoring features, net stats, disk stats, logging, etc.
http://people.redhat.com/~rjones/virt-top

Nir Soffer

2018-Apr-10 13:03 UTC

head link

Re: [Libguestfs] v2v: -o rhv-upload: Long time spent zeroing the disk

On Tue, Apr 10, 2018 at 1:44 PM Richard W.M. Jones <rjones@redhat.com>
wrote:
> We now have true zeroing support in oVirt imageio, thanks for that.
>
> However a problem is that ‘qemu-img convert’ issues zero requests for
> the whole disk before starting the transfer.  It does this using 32 MB
> requests which take approx. 1 second each to execute on the oVirt side.
> Two problems therefore:
>
> (1) Zeroing the disk can take a long time (eg. 40 GB is approx.
>     20 minutes).  Furthermore there is no progress indication while this
>     is happening.
>
>     Nothing bad happens: because it is making frequent requests there
>     is no timeout.
>
> (2) I suspect that because we don't have trim support that this is
>     actually causing the disk to get fully allocated on the target.
>
>     The NBD requests are sent with may_trim=1 so we could turn these
>     into trim requests, but obviously cannot do that while there is no
>     trim support.
>
It sounds like nbdkit is emulating trim with zero instead of noop.

I'm not sure why qemu-img is trying to do, I hope the nbd maintainer on
qemu side can explain this.

However, since you suggest that we could use "trim" request for these
requests, it means that these requests are advisory (since trim is), and
we can just ignore them if the server does not support trim.

This will also solve the timeout issue you reported in private mail.

Adding Eric and qemu-block.

Nir

Kevin Wolf

2018-Apr-10 13:48 UTC

head link

Re: [Libguestfs] [Qemu-block] v2v: -o rhv-upload: Long time spent zeroing the disk

Am 10.04.2018 um 15:03 hat Nir Soffer geschrieben:> On Tue, Apr 10, 2018 at 1:44 PM Richard W.M. Jones
<rjones@redhat.com>
> wrote:
> 
> > We now have true zeroing support in oVirt imageio, thanks for that.
> >
> > However a problem is that ‘qemu-img convert’ issues zero requests for
> > the whole disk before starting the transfer.  It does this using 32 MB
> > requests which take approx. 1 second each to execute on the oVirt
side.
> 
> 
> > Two problems therefore:
> >
> > (1) Zeroing the disk can take a long time (eg. 40 GB is approx.
> >     20 minutes).  Furthermore there is no progress indication while
this
> >     is happening.
> >
> 
> >     Nothing bad happens: because it is making frequent requests there
> >     is no timeout.
> >
> > (2) I suspect that because we don't have trim support that this is
> >     actually causing the disk to get fully allocated on the target.
> >
> >     The NBD requests are sent with may_trim=1 so we could turn these
> >     into trim requests, but obviously cannot do that while there is no
> >     trim support.
> >
> 
> It sounds like nbdkit is emulating trim with zero instead of noop.
> 
> I'm not sure why qemu-img is trying to do, I hope the nbd maintainer on
> qemu side can explain this.
qemu-img tries to efficiently zero out the whole device at once so that
it doesn't have to use individual small write requests for unallocated
parts of the image later on.

The problem is that the NBD block driver has max_pwrite_zeroes = 32 MB,
so it's not that efficient after all. I'm not sure if there is a real
reason for this, but Eric should know.
> However, since you suggest that we could use "trim" request for
these
> requests, it means that these requests are advisory (since trim is), and
> we can just ignore them if the server does not support trim.
What qemu-img sends shouldn't be a NBD_CMD_TRIM request (which is indeed
advisory), but a NBD_CMD_WRITE_ZEROES request. qemu-img relies on the
image actually being zeroed after this.

Kevin

Possibly Parallel Threads

Search for more apparently analagous threads

Libguestfs - Apr 2018 - v2v: -o rhv-upload: Long time spent zeroing the disk

[Libguestfs] v2v: -o rhv-upload: Long time spent zeroing the disk

Re: [Libguestfs] v2v: -o rhv-upload: Long time spent zeroing the disk

Re: [Libguestfs] [Qemu-block] v2v: -o rhv-upload: Long time spent zeroing the disk

Possibly Parallel Threads