2013000 – RFE: allow nbdkit to fix the effective URL for mirrored sites

Bug 2013000 - RFE: allow nbdkit to fix the effective URL for mirrored sites

Summary: RFE: allow nbdkit to fix the effective URL for mirrored sites

Keywords:
Status:	CLOSED UPSTREAM
Alias:	None
Product:	Virtualization Tools
Classification:	Community
Component:	nbdkit
Sub Component:
Version:	unspecified
Hardware:	Unspecified
OS:	Unspecified
Priority:	unspecified
Severity:	unspecified
Target Milestone:	---
Assignee:	Richard W.M. Jones
QA Contact:	Fedora Extras Quality Assurance
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2021-10-11 19:16 UTC by Alexander Wels
Modified:	2021-10-19 20:57 UTC (History)
CC List:	2 users (show)
Fixed In Version:
Clone Of:
Environment:
Last Closed:	2021-10-19 20:57:36 UTC
Embargoed:
Dependent Products:

Attachments	(Terms of Use)
test.c (1.24 KB, text/plain) 2021-10-12 09:54 UTC, Richard W.M. Jones	no flags	Details
View All

Description Alexander Wels 2021-10-11 19:16:29 UTC

Description of problem:
When using nbdkit with qemu-img-curl to download and convert an image, it should only resolve a redirect once. For instance if I am downloading from a mirrored source like download.fedoraproject.org it will follow the redirect to a different mirror many times during the download. I am not sure if the problem is in nbdkit or qemu-img-curl plugin.

Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1. Start a conversion with nbdkit with the qemu-img-curl plugin and the source URL is one of the mirrored URLs, like download.fedoraproject.org, AND one of the mirrors is flaky.

Actual results:
Each time nbdkit makes a request (each byte-range I think) and the URL is a mirrored URL like above, there is a chance that the returned mirror is down or invalid. This will break the import immediately.

Expected results:
Well there are IMO 2 possible solutions:
1. Resolve the mirrored URL to a real mirror first, and then only use that real mirror for all the requests.
2. If the returned mirror fails, retry the original URL, and hope you get a new mirror that does work.

Additional info:
We use nbdkit with the qemu-img-curl extensively in Containerized Data Importer, and whenever a fedora or centos mirror is flaky, we get bug reports because the import fails. The real problem is the flaky mirror, but because there are soo many chances of getting the bad mirror returned during an import, it is almost guaranteed that the import fails. If one of the two solutions suggested is implemented, it should make the whole import more resilient.

Comment 1 Richard W.M. Jones 2021-10-12 09:54:28 UTC

Created attachment 1832139 [details]
test.c

To be clear about this, are we talking about nbdkit-curl-plugin?
qemu's curl plugin is something different.

Anyway it is indeed true that: (a) nbdkit-curl-plugin will
make many small byte-range requests from different threads
and (b) if the source issues a redirect then we could redirect
to a different mirror on each request and (c) if one request
fails then the whole thing will fail.

There are a few possible ways to work around this:

(1) https://libguestfs.org/nbdkit-retry-filter.1.html can
be inserted in the chain and it will restart the plugin if
a failure happens.  It's likely to be quite a sledgehammer
fix, it might be possible to improve the curl plugin to
do something less aggressive.

(2) Prefetch the URL yourself, which will resolve the mirror,
check the mirror works and retry, then pass the resolved URL
to nbdkit.

I wrote something similar to (2) a while back when this same
question came up previously, see attached program.

Comment 2 Alexander Wels 2021-10-12 12:26:33 UTC

I am sorry, yes I mean the nbdkit-curl-plugin. I have a PR that implements 2, and IMO works great, however in the review I got comments that nbdkit should be doing the prefetch instead of us, which is the reason I opened this bug. Let me see if 1 can work for our use case.

Comment 3 Richard W.M. Jones 2021-10-12 13:48:20 UTC

For (1) I think the easiest thing would be if there's
a curl option to "pin" the redirection to a single
mirror (although you'd still have a problem if the
mirror it happened to choose was broken).  I don't see
much here that looks like it could help:

https://curl.se/libcurl/c/curl_easy_setopt.html

It might be asking the curl developers if they've
considered this case.

Comment 4 Alexander Wels 2021-10-12 14:03:12 UTC

So if the mirror picked is the broken one is not a huge problem for us. The import will fail, and be retried using the mirror URL, and hopefully we will get a different mirror that works. The problem manifests itself because each time we read a small byte range, there is a chance we hit the broken mirror. So if we have 10 mirrors, and 1 is broken, and during an import we read 1000 different byte ranges, we have, it is almost guaranteed that at some point we get the broken mirror.

Does the retry filter retry just one byte range if it fails, or does it retry the entire import. I am hoping just the one byte range. If so I think that will make it sufficiently robust for our purposes.

Comment 5 Richard W.M. Jones 2021-10-12 14:30:11 UTC

The retry filter is going to be a big (too big) hammer here.  It
will actually reload the entire plugin if any request fails.

I think what you actually want is more like some way to pin the
redirect to a fixed URL, ie the first time any range is requested,
we get the resolved URL from curl and use that URL in future requests.
(This would be opt-in through a new command line option).

I'm open to an implementation of this in nbdkit-curl-plugin provided
it's not going to be too invasive.

Comment 6 Alexander Wels 2021-10-12 15:39:22 UTC

Yes that is basically what I am asking for. And if the one it resolves to is the broken one, it will fail, which is fine since that is exactly what would have happened before anyway. A flag to pin it would be a great solution for us.

Comment 7 Richard W.M. Jones 2021-10-13 10:07:31 UTC

Patch posted:
https://listman.redhat.com/archives/libguestfs/2021-October/thread.html#00048

Can you tell us what are the specific objections to doing
the prefetch outside of nbdkit (as you said in comment 2)?

Comment 8 Alexander Wels 2021-10-13 12:57:29 UTC

Personally I think it is fine for us to pre-resolve the URL for a real mirror, and then passing that to nbdkit, I did exactly that in this PR https://github.com/kubevirt/containerized-data-importer/pull/1981 but other members of my team feel like it is going against the way http works for us to pre-resolve the URL, and then passing that to nbdkit, and that that responsibility should reside in nbdkit.

Comment 9 Richard W.M. Jones 2021-10-15 14:19:30 UTC

Alternative proposal of a new nbdkit-retry-request-filter:
https://listman.redhat.com/archives/libguestfs/2021-October/thread.html#00084

Comment 10 Richard W.M. Jones 2021-10-19 20:57:36 UTC

Fixed upstream and in 1.29.1 by:

https://gitlab.com/nbdkit/nbdkit/-/commit/73ff1ad1bf11988949509ba299a5454f4397f952

https://libguestfs.org/nbdkit-retry-request-filter.1.html

Note You need to log in before you can comment on or make changes to this bug.