[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: 5.10.0-4-sparc64-smp #1 Debian 5.10.19-1 crashes on T2000



Hi Adrian,

John Paul Adrian Glaubitz wrote:
> Did you forget to create an initrd? After installing the kernel, run:
> 
> $ update-initramfs -k KERNEL_VERSION -c

I did not run it this way, will do.

I had it however, of a very big size:
316M Jan 14 17:15 initrd.img-5.9.0-rc1+

which filled up my /boot

I removed it, regenerated with your command, but I get dropped into
initramfs with no modules found. Hmm..

> 
>> The good news is that latest kernel installed seems to boot and takes
>> all CPUs online. How stable it is I don't know, it needs to be tested.
> Please run some stress tests such as stress-ng and report back.

Not nice. I started compiling some stuff and the box froze, I connected
serial console and could not resume due to Fast Data Access MMU miss"

I will now stress things again, but keeping serial console attached with
another computer and see.

UP to last week with the old 5.9 kernel I had no issues compiling even
large things as gecko based ArcticFox or the Linux kernel itself. So if
the Fire didn't fail over the weekend.... it smells as kernel instability.

What should I use in stress-ng? I just tried "--all 8 --timeout 120s"

and the machine locked up after a little and in the serial console I see:

[ 8563.833509] current->{active_,}mm->context = 0000000000000fcb

[ 8563.833523] current->{active_,}mm->pgd = ffff8000d35c8000

[ 8563.846347] Unable to handle kernel NULL pointer dereference in mna
handler
[ 8563.846365]  at virtual address 00000000000000e7

[ 8563.846380] current->{active_,}mm->context = 0000000000000fcc

[ 8563.846395] current->{active_,}mm->pgd = ffff8000d2d3c000

[ 8563.856171] Unable to handle kernel NULL pointer dereference

[ 8563.863274] tsk->{mm,active_mm}->context = 0000000000000fd2

[ 8563.863294] tsk->{mm,active_mm}->pgd = ffff8000d3fc0000

[ 8563.928911] Unable to handle kernel NULL pointer dereference in mna
handler
[ 8563.928935]  at virtual address 00000000000000e7

[ 8563.928955] current->{active_,}mm->context = 0000000000000fde

[ 8563.928972] current->{active_,}mm->pgd = ffff8000d32e8000

[ 8563.952221] Unable to handle kernel NULL pointer dereference in mna
handler
[ 8563.952244]  at virtual address 00000000000000e7

[ 8563.952261] current->{active_,}mm->context = 0000000000000fe3

[ 8563.952278] current->{active_,}mm->pgd = ffff8000d2f54000

[ 8563.954004] Unable to handle kernel NULL pointer dereference in mna
handler
[ 8563.954022]  at virtual address 00000000000000e7

[ 8563.954037] current->{active_,}mm->context = 0000000000000fe5

[ 8563.954053] current->{active_,}mm->pgd = ffff8000d2d5c000

[ 8563.972643] Unable to handle kernel NULL pointer dereference

[ 8563.972660] tsk->{mm,active_mm}->context = 0000000000000fea

[ 8563.972677] tsk->{mm,active_mm}->pgd = ffff8000d31300

These are kernel messages, not OF, so it looks like a kernel problem

Riccardo


Reply to: