Re: FYI: Go implementation of the NBD protocol

To: "Richard W.M. Jones" <rjones@redhat.com>, Axel Wagner <axel.wagner.hh@googlemail.com>
Cc: nbd@other.debian.org
Subject: Re: FYI: Go implementation of the NBD protocol
From: Eric Blake <eblake@redhat.com>
Date: Wed, 28 Nov 2018 15:58:31 -0600
Message-id: <[🔎] 7d8220fd-10b0-71b4-a000-3d094e0a5e58@redhat.com>
In-reply-to: <[🔎] 20181128102327.GG27120@redhat.com>
References: <[🔎] CAEkBMfHihH8eBny1eC5zX743kxkR+fjaTjxTefTtfrcC_ULHsA@mail.gmail.com> <[🔎] 20181127074542.GF17471@redhat.com> <[🔎] CAEkBMfEh57R9MYVcTr0KiFqr_g-fQHi5GKEjuP33z51P-PKbow@mail.gmail.com> <[🔎] 20181128102327.GG27120@redhat.com>

On 11/28/18 4:23 AM, Richard W.M. Jones wrote:

On Tue, Nov 27, 2018 at 04:22:02PM +0100, Axel Wagner wrote:

Hi Richard,

no, I have only tested against nbd-{client,server} and the Linux kernel
implementation. Compatibility simply hasn't been a huge priority for me :)

Personally, it seems more efficient to me to have one reference
implementation and testsuite to run against new implementations, than to
require each new implementation to build a new testsuite for each existing
one. For example, I don't know nbdkit at all and know very little about
qemu. The thought of having to figure out how to run a client/server of
each and actually observe the outcomes of a testsuite seems… dreadful.
Whereas if you'd give me a binary that I can just point at my server and it
gives me a list of protocol-violations, I'd be fine to fix them all.


I don't disagree but the chances of us having a reference
implementation which fully tests the protocol any time soon is slim.
In the meantime testing against lots of clients/servers is the best bet.

Agreed. qemu-nbd has a python script that simulates a server that isintentionally broken (early disconnects and/or intentionally wrongbytes) at strategic points during initial handshake and the first clientrequest, in order to test client robustness against flaky servers(qemu.git/tests/qemu-iotests/nbd-fault-injector.py), but it does nothave a client counterpart, and it is sadly out of date (doesn't knowNBD_OPT_GO, for example).

In my experience, the most common server bugs are failure to implementNBD_OPT_ length handling correctly, both for known options (did youcheck for a client sending length when it shouldn't, and after reportingthe error are you still in sync to continue reading the next option fromthe client) and for unknown options (clients will want to probe you forthe support of extensions, and this probing MUST not kill theconnection, whether or not the client sent a payload). I recall fixingbugs in that category in all three of qemu-nbd, nbd-server, and nbdkit;) Most clients that can get into transmission phase tend to bewell-behaved, so testing that a server is robust against an ill-behavedclient is harder.


For reference here are the commands to test against qemu, qemu-nbd and
nbdkit:

Also, I don't know if you've implemented TLS support yet, but that'sanother tricky thing to get right, and we can help you with commandlines for the same three projects with TLS support.

And, in a quick read of your project's README, you mention that it isdesigned to make it easy to implement arbitrary block mode failures.The nbdkit implementation has a similar mode of operation already, andRich even has a recent video he made with that in action (in his video,he is running 5 NBD disks coupled to a tcl visualization, to demonstrategraphically which portions of a disks the kernel is touching, and toshow what happens during the hot-failover of a RAID5 setup when one ofthe devices starts giving errors). It might be interesting to comparedesigns.


--
Eric Blake, Principal Software Engineer
Red Hat, Inc.           +1-919-301-3266
Virtualization:  qemu.org | libvirt.org

Reply to:

Follow-Ups:
- Re: FYI: Go implementation of the NBD protocol
  - From: Wouter Verhelst <w@uter.be>

References:
- FYI: Go implementation of the NBD protocol
  - From: Axel Wagner <axel.wagner.hh@googlemail.com>
- Re: FYI: Go implementation of the NBD protocol
  - From: "Richard W.M. Jones" <rjones@redhat.com>
- Re: FYI: Go implementation of the NBD protocol
  - From: Axel Wagner <axel.wagner.hh@googlemail.com>
- Re: FYI: Go implementation of the NBD protocol
  - From: "Richard W.M. Jones" <rjones@redhat.com>

Prev by Date: Re: FYI: A talk about using NBD as an alternative to loop device / loop mounting
Next by Date: Re: FYI: Go implementation of the NBD protocol
Previous by thread: Re: FYI: Go implementation of the NBD protocol
Next by thread: Re: FYI: Go implementation of the NBD protocol
Index(es):
- Date
- Thread