Bug#1059624: linux-image-6.1.0-16-amd64: aacraid abort request / SCSI hang after upgrade from 11.8 -> 12.4
Control: tags -1 + confirmed pending
Control: tags -1 - moreinfo
On Sat, Dec 30, 2023 at 01:06:20AM +0100, Samuel Wolf wrote:
> Hi Salvatore,
>
> > So it would be welcome if you find time to make it possible to test it by saturday evening.
>
> my test was quicker than expected since i found a way to reproduce the
> issue on my test server.
>
> Behind the Adaptec 8805 is a raid6 storage with 54TB and LUKS encrypted.
> As soon I open and mount the LUKS drive with kernel 6.1.67-1 the
> controller hang:
>
> [ 480.888273] aacraid: Host adapter abort request.
> aacraid: Outstanding commands on (0,0,3,0):
> [ 480.902784] aacraid: Host bus reset request. SCSI hang ?
> [ 480.902933] aacraid 0000:02:00.0: outstanding cmd: midlevel-0
> [ 480.902935] aacraid 0000:02:00.0: outstanding cmd: lowlevel-0
> [ 480.902936] aacraid 0000:02:00.0: outstanding cmd: error handler-0
> [ 480.902936] aacraid 0000:02:00.0: outstanding cmd: firmware-251
> [ 480.902937] aacraid 0000:02:00.0: outstanding cmd: kernel-0
> [ 480.916921] aacraid 0000:02:00.0: Controller reset type is 3
> [ 480.917076] aacraid 0000:02:00.0: Issuing IOP reset
> [ 517.004437] aacraid 0000:02:00.0: IOP reset succeeded
> [ 517.029007] aacraid: Comm Interface type2 enabled
> [ 529.479247] aacraid 0000:02:00.0: Scheduling bus rescan
> [ 539.678274] aacraid 0000:02:00.0: DDR cache data recovered successfull
>
> This is reproducible with every luksClose and luksOpen mount.
>
> Now I booting into your test kernel 6.1.67-1a~test and try the same again:
>
> [ 9.610151] IPv6: ADDRCONF(NETDEV_CHANGE): enp1s0: link becomes ready
> [ 81.503552] EXT4-fs (dm-0): mounted filesystem with ordered data
> mode. Quota mode: none.
> [ 119.133460] EXT4-fs (dm-0): unmounting filesystem.
> [ 138.547366] sd 0:0:3:0: [sda] Very big device. Trying to use READ
> CAPACITY(16).
> [ 139.214205] EXT4-fs (dm-0): mounted filesystem with ordered data
> mode. Quota mode: none.
> [ 162.376044] EXT4-fs (dm-0): unmounting filesystem.
> [ 182.222397] sd 0:0:3:0: [sda] Very big device. Trying to use READ
> CAPACITY(16).
> [ 182.913977] EXT4-fs (dm-0): mounted filesystem with ordered data
> mode. Quota mode: none.
> [ 217.611072] EXT4-fs (dm-0): unmounting filesystem.
> [ 230.778060] sd 0:0:3:0: [sda] Very big device. Trying to use READ
> CAPACITY(16).
> [ 231.386349] EXT4-fs (dm-0): mounted filesystem with ordered data
> mode. Quota mode: none.
>
> No errors and the LUKS device is opened in ~1 second not like before
> in 1 minute.
>
> Since I can not technical overview the patch/revert, is this enough
> testing for you?
>
> Thanks for the test kernel.
Thanks for your testing! Yes this is enough from your side, thanks a
lot for taking the time for the explict test rounds!
Regards,
Salvatore
Reply to: