[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Unable to handle kernel NULL pointer dereference



i looked around but could not find any real informnation
on this message. i am trying to re-deploy an older server
with different cpus and different ram(the rest is the
same) yet i am getting these kernel oopses. the ram and
cpus came from other systems running on the same brand/model
of motherboard and tehy ran 24/7/365 for over a year so i
believe they are functional.

a couple months back i ran memtest86 on this machine for
2 weeks straight and it did not detect a single error. i
just started it up again to see if anything has changed.
before it went through a couple hundred passes of the
1GB of memory in the system.

the full kernel oops:

Unable to handle kernel NULL pointer dereference at virtual address
0000006c
current->tss.cr3 = 3ba07000, %cr3 = 3ba07000
*pde = 3ba11067
*pte = 00000000
Oops: 0000
CPU:    1
EIP:    0010:[<c011aaf6>]
EFLAGS: 00010207
eax: 00000000   ebx: ffffffff   ecx: 00000000   edx: 00000000
esi: 00000146   edi: f98d2000   ebp: f98d2000   esp: f9c0ff30
ds: 0018   es: 0018   ss: 0018
Process bash (pid: 278, process nr: 42, stackpage=f9c0f000)
Stack: c01ef7c9 00000146 f9531000 bffffa60 c01ef7b1 f98d2000 bffff9d8
00000007
       f9082800 080bcc9c fb7ff2f0 f9082800 c01efb93 f98d2000 f98d2000
bffff9d8
       f9c0e000 fa350f80 bffff9d8 00005410 ffffffe7 f9c0e000 00000008
00000000
Call Trace: [<c01ef7c9>] [<c01ef7b1>] [<c01efb93>] [<c013404f>]
[<c0109395>] [<c010927c>]
Code: 8b 42 6c 89 c1 85 c0 7e ec 39 72 64 75 e0 89 c3 f0 ff 0d 88


the oops loops over and over until the machine locks up
each time the call trace gets bigger(i guess thats normal?)
e.g.:

(this is the last oops before the lockup)
Unable to handle kernel NULL pointer dereference at virtual address
00000074
current->tss.cr3 = 00101000, %cr3 = 00101000
*pde = 00000000
Oops: 0000
CPU:    1
EIP:    0010:[<c011adf8>]
EFLAGS: 00010207
eax: 0000000b   ebx: 00000000   ecx: f9c0e4c8   edx: 00000000
esi: f9c0e000   edi: 00000074   ebp: c02a4000   esp: f9c0e694
ds: 0018   es: 0018   ss: 0018
Process bash (pid: 278, process nr: 42, stackpage=f9c0f000)
Stack: 00000074 f9c0e000 f9c0e000 00000000 c010b68d c011b30b f9c0e754
00000000
       00000074 f9c0e000 00000074 f9c0e000 00000001 c0100018 f9c00018
ffffffff
       c0109780 c0109787 0000000b c024020e 00000000 f9c0e000 00000000
00000001
Call Trace: [<c010b68d>] [<c011b30b>] [<c0100018>] [<c0109780>]
[<c0109787>] [<c024020e>] [<c02401c0>]
       [<c0111789>] [<c024020e>] [<c01114a8>] [<c0109395>] [<c0110018>]
[<c011adf8>] [<c010b68d>] [<c011b30b>]
       [<c0100018>] [<c0109780>] [<c0109787>] [<c024020e>] [<c02401c0>]
[<c0111789>] [<c024020e>] [<c01114a8>]
       [<c0109395>] [<c0110018>] [<c011adf8>] [<c010b68d>] [<c011b30b>]
[<c0100018>] [<c0109780>] [<c0109787>]
       [<c024020e>] [<c02401c0>] [<c0111789>] [<c024020e>] [<c01114a8>]
[<c0109395>] [<c0110018>] [<c011adf8>]
       [<c010b68d>] [<c011b30b>] [<c0100018>] [<c0109780>] [<c0109787>]
[<c024020e>] [<c02401c0>] [<c0111789>]
       [<c024020e>] [<c01114a8>] [<c0109395>] [<c0110018>] [<c011adf8>]
[<c010b68d>] [<c011b30b>] [<c0100018>]
       [<c0109780>] [<c0109787>] [<c024020e>] [<c02401c0>] [<c0111789>]
[<c024020e>] [<c01114a8>] [<c0109395>]
       [<c0110018>] [<c011adf8>] [<c010b68d>] [<c011b30b>] [<c0100018>]
[<c0109780>] [<c0109787>] [<c024020e>]
       [<c02401c0>] [<c0111789>] [<c024020e>] [<c01114a8>] [<c0114dc4>]
[<c0109395>] [<c011adf8>] [<c011b30b>]
       [<c0100018>] [<c0109780>] [<c0109787>] [<c024020e>] [<c02401c0>]
[<c0111789>] [<c024020e>] [<c01114a8>]
       [<c0114dc4>] [<c0109395>] [<c011adf8>] [<c011b30b>] [<c0100018>]
[<c0109780>] [<c0109787>] [<c024020e>]
       [<c02401c0>] [<c0111789>] [<c024020e>] [<c01114a8>] [<c0109395>]
[<c0110018>] [<c011adf8>] [<c010b68d>]
       [<c011b30b>] [<c0100018>] [<c0109780>] [<c0109787>] [<c024020e>]
[<c02401c0>] [<c0111789>] [<c024020e>]
       [<c01114a8>] [<c0114dc4>] [<c0109395>] [<c011adf8>] [<c011b30b>]
[<c0100018>] [<c0109780>] [<c0109787>]
       [<c024020e>] [<c02401c0>] [<c0111789>] [<c024020e>] [<c01114a8>]
[<c0114dc4>] [<c0109395>] [<c011adf8>]
       [<c011b30b>] [<c0100018>] [<c0109780>] [<c0109787>] [<c024020e>]
[<c02401c0>] [<c0111789>] [<c024020e>]
       [<c01114a8>] [<c0109395>] [<c0110018>] [<c011adf8>] [<c010b68d>]
[<c011b30b>] [<c0100018>] [<c0109780>]
       [<c0109787>] [<c024020e>] [<c02401c0>] [<c0111789>] [<c024020e>]
[<c01114a8>] [<c0109395>] [<c0110018>]
       [<c011adf8>] [<c010b68d>] [<c011b30b>] [<c0100018>] [<c0109780>]
[<c0109787>] [<c024020e>] [<c02401c0>]
       [<c0111789>] [<c024020e>] [<c01114a8>] [<c0114dc4>] [<c0109395>]
[<c011adf8>] [<c011b30b>] [<c0100018>]
       [<c0109780>] [<c0109787>] [<c024020e>] [<c02401c0>] [<c0111789>]
[<c024020e>] [<c01114a8>] [<c0109395>]
       [<c0110018>] [<c011adf8>] [<c010b68d>] [<c011b30b>] [<c0100018>]
[<c0109780>] [<c0109787>] [<c024020e>]
       [<c02401c0>] [<c0111789>] [<c024020e>] [<c01114a8>] [<c0109395>]
[<c0110018>] [<c011adf8>] [<c010b68d>]
       [<c011b30b>] [<c0100018>] [<c0109780>] [<c0109787>] [<c024020e>]
[<c02401c0>] [<c0111789>] [<c024020e>]
       [<c01114a8>] [<c0114dc4>] [<c0109395>] [<c011adf8>] [<c011b30b>]
[<c0100018>] [<c0109780>] [<c0109787>]
       [<c024020e>] [<c02401c0>] [<c0111789>] [<c024020e>] [<c01114a8>]
[<c0109395>] [<c0110018>] [<c011adf8>]
       [<c010b68d>] [<c011b30b>] [<c0100018>] [<c0109780>] [<c0109787>]
[<c024020e>] [<c02401c0>] [<c0111789>]
       [<c024020e>] [<c01114a8>] [<c0114dc4>] [<c0109395>] [<c011adf8>]
[<c011b30b>] [<c0100018>] [<c0109780>]
       [<c0109787>] [<c024020e>] [<c02401c0>] [<c0111789>] [<c024020e>]
[<c01114a8>] [<c0109395>] [<c0110018>]
       [<c011adf8>] [<c010b68d>] [<c011b30b>] [<c0100018>] [<c0109780>]
[<c0109787>] [<c024020e>] [<c02401c0>]
       [<c0111789>] [<c024020e>] [<c01114a8>] [<c0114dc4>] [<c0109395>]
[<c011adf8>] [<c011b30b>] [<c0100018>]
       [<c0109780>] [<c0109787>] [<c024020e>] [<c02401c0>] [<c0111789>]
[<c024020e>] [<c01114a8>] [<c0114dc4>]
       [<c0109395>] [<c011adf8>] [<c011b30b>] [<c0100018>] [<c0109780>]
[<c0109787>] [<c024020e>] [<c02401c0>]
       [<c0111789>] [<c024020e>] [<c01114a8>] [<c0109395>] [<c0110018>]
[<c011adf8>] [<c010b68d>] [<c011b30b>]
       [<c0100018>] [<c0109780>] [<c0109787>] [<c024020e>] [<c02401c0>]
[<c0111789>] [<c024020e>] [<c01114a8>]
       [<c0114dc4>] [<c010b5ea>] [<c0109395>] [<c011adf8>] [<c011b30b>]
[<c0100018>] [<c0109780>] [<c0109787>]
       [<c024020e>] [<c02401c0>] [<c0111789>] [<c024020e>] [<c01114a8>]
[<c0109395>] [<c011adf8>] [<c010b68d>]
       [<c011b30b>] [<c0109780>] [<c0109787>] [<c024020e>] [<c02401c0>]
[<c0111789>] [<c024020e>] [<c01114a8>]
       [<c0109395>] [<c011aaf6>] [<c01ef7c9>] [<c01ef7b1>] [<c01efb93>]
[<c013404f>] [<c0109395>] [<c010927c>]
Code: 39 73 74 75 2d c7 43 50 11 00 00 00 ff 83 dc 04 00 00 a1 d0


it is quite easy to trigger, theres no one program that
does it, i have had a lot of these crashes over the past
week while stress testing the machine. last week i ran
2 instances of cpuburn(1 for each cpu), as well as put bonnie++
in a 150-pass loop writing 1800MB worth of data to the 3ware
raid array. it ran fine for probably 2-3 days(all 150 passes
of bonnie++ were successful), then on the weekend it crashed.
I did not have a serial console hooked up then so i couldn't
see the full error i believe it is the same as this one though.

I think something else might be the problem ..but am not sure
how to determine it. i guess if i can't i can put the old cpu
back in and see what happens ..but i don't expect a change
as these cpus worked fine ...

system config:

Intel L440GX+ Motherboard
Dual P3-550Mhz Processors (or 500Mhz i forget)
4 x 256MB sticks o ram
3Ware 6800 Series IDE Raid controller
6 x maxtor 80GB ide drives connected to controller in raid10(~220GB)
1 x maxtor 30GB ide drive connected to motherboard's PIIX4 ide controller
(the OS is on the 30GB drive)
1 x cdrom connected to motherboard's PIIX4 ide controller
floppy
PC Power & Cooling Turbocool 450Watt ATX power supply
huge 4U chassis with tons and tons of fans blowing, all components
are cool to the touch
running in a climate controlled enviornment(68F)
debian 2.2r5
linux 2.2.19(custom kernel)
1GB swap located on the raid array

with the exception of the serial console support, the kernel
is identical to the kernel I am running in 2 other systems, with
identical hardware with the exception that this one has different
cpus and ram.

in the past when i got a message like this it usually was from
some kernel driver acting up, but the drivers i am using in this
system are all pretty standard, the same ones i'm using on probably
a dozen other systems(same version even)..so i am a little lost
as to how to narrow down this problem.

thanks

nate





-- 
To UNSUBSCRIBE, email to debian-user-request@lists.debian.org 
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org



Reply to: