[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: New test kernel - second attempt



Hi,

On Tue, 2025-09-09 at 19:34 +0200, Riccardo Mottola wrote:
> John Paul Adrian Glaubitz wrote:
> > https://people.debian.org/~glaubitz/sparc64/
> > 
> > Don't mind the weird version number, it's a 6.12.3 kernel:
> > 
> > linux (6.12.43-1+nothp1) UNRELEASED; urgency=medium
> > 
> >    * Add sparc64_fix_hugetlb.patch
> >    * Add sparc64_fix_copyops.patch
> >    * Disable CONFIG_TRANSPARENT_HUGEPAGE
> >    * Disable CONFIG_TRANSPARENT_HUGEPAGE_ALWAYS
> 
> Sorry, but this is not a good one. On the Sun Fire T2000 i used the 
> "smp" kernel and it crashes on boot:
> Loading Linux 6.12+unreleased-sparc64-smp ...tect other bootable 
> partitions. â
> Loading initial ramdisk ...GNU GRUB  version 2.04-16onfiguration. â
> Check GRUB_DISABLE_OS_PROBER documentation entry. â
> [    2.919995] pci 0000:02:00.0: failed to populate child OF nodes 
> (-19)------+ â
> [    2.920055] pci 0000:03:01.0: failed to populate child OF nodes 
> (-19)      | â
> [    2.920103] pci 0000:03:02.0: failed to populate child OF nodes 
> (-19))     | â
> [    2.920150] pci 0000:03:08.0: failed to populate child OF nodes 
> (-19)      |4) ... â
> [    2.920197] pci 0000:03:09.0: failed to populate child OF nodes 
> (-19)      | â
> [    2.995757] pci 0001:02:00.0: failed to populate child OF nodes 
> (-19)      | â
> [    2.995812] pci 0001:03:01.0: failed to populate child OF nodes 
> (-19)de)   | â
> [    2.995860] pci 0001:03:02.0: failed to populate child OF nodes 
> (-19)      | 100%
> [    2.995907] pci 0001:03:08.0: failed to populate child OF nodes 
> (-19)      |
> [    2.995954] pci 0001:03:09.0: failed to populate child OF nodes 
> (-19)      |
> [    2.996002] pci 0001:04:00.0: failed to populate child OF nodes 
> (-19)      |
> [    2.996048] pci 0001:04:00.2: failed to populate child OF nodes 
> (-19)      |v
> [   10.810326] mptsas 0000:07:00.0: Unable to change power state from 
> D3cold to D0, device inaccessible
> [   11.147798] NON-RESUMABLE ERROR: Reporting on cpu 
> 24-----------------------+
> [   11.147975] NON-RESUMABLE ERROR: TPC [0x000000001017e034] 
> <MakeIocReady+0x10/0x298 [mptbase]>
> [   11.148142] NON-RESUMABLE ERROR: RAW 
> [1810000000000001:00000010e6e37b64:0000000202000004:000000ea00300000
> [   11.148226] NON-RESUMABLE ERROR: 
> 0000000000180000:0000000000000000:0000000000000000:0000000000000000]
> [   11.148307] NON-RESUMABLE ERROR: handle [0x1810000000000001] stick 
> [0x00000010e6e37b64]
> [   11.148382] NON-RESUMABLE ERROR: type [precise nonresumable]
> [   11.148444] NON-RESUMABLE ERROR: attrs [0x02000004] < PIO sp-faulted 
> priv >
> [   11.148568] NON-RESUMABLE ERROR: raddr [0x000000ea00300000].
> [   11.149692] Kernel panic - not syncing: Non-resumable error.
> [   11.149772] CPU: 24 UID: 0 PID: 405 Comm: (udev-worker) Not tainted 
> 6.12+unreleased-sparc64-smp #1  Debian 6.12.43-1+nothp1
> [   11.149898] Call Trace:
> [   11.149951] [<0000000000eff2b4>] dump_stack+0x8/0x18
> [   11.150042] [<0000000000ef1930>] panic+0xf4/0x398
> [   11.150121] [<000000000042a48c>] sun4v_nonresum_error+0x16c/0x240
> [   11.150217] [<0000000000406eb8>] sun4v_nonres_mondo+0xc8/0xd8
> [   11.150320] [<000000001017e034>] MakeIocReady+0x10/0x298 [mptbase]
> [   11.150426] [<000000001017e4b4>] mpt_do_ioc_recovery+0x9c/0x1110 
> [mptbase]
> [   11.150532] [<000000001017d6f8>] mpt_attach+0xb58/0xd20 [mptbase]
> [   11.150636] [<0000000010283f30>] mptsas_probe+0x10/0x440 [mptsas]
> [   11.150761] [<0000000000ad1fac>] pci_device_probe+0xac/0x180
> [   11.150862] [<0000000000b8b8e8>] really_probe+0xc8/0x400
> [   11.150956] [<0000000000b8bcac>] __driver_probe_device+0x8c/0x160
> [   11.151050] [<0000000000b8be68>] driver_probe_device+0x28/0x100
> [   11.151145] [<0000000000b8c11c>] __driver_attach+0xbc/0x1e0
> [   11.151238] [<0000000000b8927c>] bus_for_each_dev+0x5c/0xc0
> [   11.151328] [<0000000000b8b09c>] driver_attach+0x1c/0x40
> [   11.151419] [<0000000000b8a860>] bus_add_driver+0x180/0x240
> [   11.772806] Press Stop-A (L1-A) from sun keyboard or send break
> [   11.772806] twice on console to return to the boot prom
> [   11.773113] ---[ end Kernel panic - not syncing: Non-resumable error. 
> ]---
> 
> 
> the first part "failed to populate child OF node" happens also with 
> kernel that do boot and run successfully.

Keep in mind you may have to keep the machine off for a longer time or reset
the NV-RAM. We've got multiple reports now of machines that became stable
after that.

Adrian

-- 
 .''`.  John Paul Adrian Glaubitz
: :' :  Debian Developer
`. `'   Physicist
  `-    GPG: 62FF 8A75 84E0 2956 9546  0006 7426 3B37 F5B5 F913


Reply to: