Re: [Nbd] 3.12 BUG() on ext4, kernel crash on nbd-client when nbd server rebooting

To: Paul Clements <paul.clements@...856...>
Cc: "nbd-general@lists.sourceforge.net" <nbd-general@lists.sourceforge.net>, Wouter Verhelst <w@...112...>, Jack Kara <jack@...1290...>, Wouter Verhelst <wouter@...825...>
Subject: Re: [Nbd] 3.12 BUG() on ext4, kernel crash on nbd-client when nbd server rebooting
From: Goswin von Brederlow <goswin-v-b@...186...>
Date: Mon, 25 Nov 2013 13:42:31 +0100
Message-id: <20131125124231.GB3610@...1266...>
In-reply-to: <CAECXXi7dEzPct6iEWxOFiPf707VOhFKwFFb6Og_G2+qOv-Od-g@...18...>
References: <8bf7c5db475eefcf17976a36f892200d@...1427...> <20131112214632.GB31763@...1426...> <7c1b2ca40c3abfe805e9e944f21c7016@...1427...> <20131114075827.GA13554@...1426...> <5285D258.9040808@...112...> <CAECXXi6Vt5gAjv=qkrGzLG3iRjNmjYiYZd7+gCXK860a2tonKg@...18...> <52889084.2080700@...825...> <C2E9E8AD-752C-4190-BD4F-45A9482FF400@...872...> <CAECXXi7dEzPct6iEWxOFiPf707VOhFKwFFb6Og_G2+qOv-Od-g@...18...>

On Mon, Nov 18, 2013 at 05:17:13PM -0500, Paul Clements wrote:
> On Sun, Nov 17, 2013 at 12:19 PM, Alex Bligh <alex@...872...> wrote:
> 
> >
> > On 17 Nov 2013, at 09:46, Wouter Verhelst wrote:
> >
> > >>
> > >> In order for nbd to seamlessly handle this situation, we'd have to do a
> > >> reconnect in-kernel
> > >
> > > This would be fairly complicated, since all the connection and
> > > negotiation currently happens in userspace. I'm not sure I want to go
> > > down that route.
> > >
> > >> (or have a callout to userland to reconnect)
> > >
> > > That sounds interesting, too. How would you do that?
> > >
> > >> and
> > >> then we'd have to retry any I/Os that may have failed in the meantime
> > >> (or just let them fail, but that probably is not as useful).
> > >
> >
> > Would another option be as follows:
> 
> 
> >
> 1. When persistency is required, a new persist flag is specified to
> >    the kernel by the client.
> yes

Which might as well be alawys set, i.e. unneccessary.

> > 2. On a connection failure, if the persist flag is set, don't
> >    clear up and return with a specific error number. The fd is
> >    still open (as still owned by the process), but (by assumption)
> >    unusable.
> right

By fd do you mean the socket? The socket could be closed. The kernel
got an error on it so it is unusable. What would be the point of
keeping it?

> > 3. In persist mode, The block device only gets torn down when
> >    the fd closes / userland process terminates (whichever is
> >    easier, detection method TBD). Until then all writes block.

In persist mode the client would open a new socket and do another
NBD_DOIT and operations would resume.

When killed or not persistant the client (or kernel on exit) would
close the FD for the NBD device. This should kill the device, clean up
any buffers and return EIO to anything pending.

> Yes, I'm thinking maybe a .release would allow the driver to see process
> exit/fd close and do the tear down in that case (for example, a SIGKILL on
> nbd-client will not allow it to reconnect as the client will die
> immediately upon return to userland).

The FD that first creates the device should be controlling the device.
Closing it should kill the device but only that first one. Not other,
later users of the device. Not sure how that reflect in kernel.

> Blocking the writes will hopefully be straightforward -- however, there is
> a fair amount of cleanup required to change the current behavior of failing
> I/Os to blocking instead. Also, we'll have to retry any requests that are
> left on the nbd->queue_head (awaiting response from the server), followed
> by resuming handling of requests that are on the main request queue.
> 
> 
> 
> > 4. A newer nbd client detects the errno in persist mode, opens another
> >    fd, and calls the NBD_DOIT ioctl passing the old fd as an
> >    additional parameter (or does a new ioctl first to associate
> >    the new fd with the old fd).
> 
> 
> Probably a new ioctl, as the routine will be slightly different on
> reconnect...
> 
> 
> > A new kernel then detects this,
> >    closes the old fd, and 'takes over' the existing block device
> >    with the new fd.
> >
> >
> Right, some rearrangement of the ioctls would be required too...we'd
> probably want alternate versions of SET_SOCK and DO_IT that are re-entrant
> (right now those will error on an already-configured device, and they're
> doing some setup and teardown that is unneeded in the reconnect case).

Wouldn't it be better (as in less new ioctls) to allow SET_SOCK and
DO_IT when the device is in a disconnected state? The setup and
teardown would have to be moved then.

MfG
	Goswin

Reply to:

References:
- Re: [Nbd] 3.12 BUG() on ext4, kernel crash on nbd-client when nbd server rebooting
  - From: Jan Kara <jack@...1290...>
- Re: [Nbd] 3.12 BUG() on ext4, kernel crash on nbd-client when nbd server rebooting
  - From: Wouter Verhelst <w@...112...>
- Re: [Nbd] 3.12 BUG() on ext4, kernel crash on nbd-client when nbd server rebooting
  - From: Paul Clements <paul.clements@...856...>
- Re: [Nbd] 3.12 BUG() on ext4, kernel crash on nbd-client when nbd server rebooting
  - From: Wouter Verhelst <wouter@...825...>
- Re: [Nbd] 3.12 BUG() on ext4, kernel crash on nbd-client when nbd server rebooting
  - From: Alex Bligh <alex@...872...>
- Re: [Nbd] 3.12 BUG() on ext4, kernel crash on nbd-client when nbd server rebooting
  - From: Paul Clements <paul.clements@...856...>

Prev by Date: Re: [Nbd] 3.12 BUG() on ext4, kernel crash on nbd-client when nbd server rebooting
Next by Date: [Nbd] [PATCH 07/25] block: Convert bio_for_each_segment() to bvec_iter
Previous by thread: Re: [Nbd] 3.12 BUG() on ext4, kernel crash on nbd-client when nbd server rebooting
Next by thread: [Nbd] [PATCH 07/25] block: Convert bio_for_each_segment() to bvec_iter
Index(es):
- Date
- Thread