[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: panics on powerbook3400 & misc problems



On Thu, 2006-06-29 at 13:55 +0200, Michael Schmitz wrote:
> > > ppc603ev@180mhz/80MB RaM/1.GB (after subtracting OS9) hd
> > > macsense aerocard (intersil4) pcmcia + bus card install.
> > > openbox+gnustep(well some of it ...), gtkmm, ...
> >
> > I have a 3400 in a drawer, I'll give it a spin asap with recent kernels.
> 
> On the topic of boot panics: I just noticed cold booting 2.6.17 (pulled
> Tuesday) panics somewhere during knfsd or samba startup. The oops does end
> up in the log so it does not seem fatal
> 
> 2.6.17-rc5 does boot fine under the same circumstances.

Ouch... looks bad... looks like memory corruption to me.

Ben.

> Sample oops:
> 
> 
> 
> Jun 28 14:13:58 michael kernel: input: Mouseemu virtual keyboard as /class/input/input8
> Jun 28 14:13:58 michael kernel: input: Mouseemu virtual mouse as /class/input/input9
> Jun 28 14:14:05 michael kernel: NET: Registered protocol family 5
> Jun 28 14:14:05 michael kernel: Installing knfsd (copyright (C) 1996 okir@monad.swb.de).
> Jun 28 14:14:05 michael kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory
> Jun 28 14:14:05 michael kernel: NFSD: starting 90-second grace period
> Jun 28 14:14:08 michael kernel: Unable to handle kernel paging request for data at address 0x008ee004
> Jun 28 14:14:08 michael kernel: Faulting instruction address: 0xc0067e6c
> Jun 28 14:14:08 michael kernel: Oops: Kernel access of bad area, sig: 11 [#1]
> Jun 28 14:14:08 michael kernel:
> Jun 28 14:14:08 michael kernel: Modules linked in: nfsd exportfs lockd appletalk psnap llc sunrpc uinput i2c_dev snd_powermac therm_adt746x cpufreq_userspace cpufreq_powersave usbhid pcmcia ohci1394 snd_aoa_i2sbus snd_pcm_oss snd_mixer_oss ieee1394 sungem evdev snd_pcm snd_timer snd_page_alloc sungem_phy snd ehci_hcd ohci_hcd yenta_socket rsrc_nonstatic pcmcia_core i2c_powermac uninorth_agp usbcore ide_cd soundcore agpgart snd_aoa_soundbus cdrom unix
> Jun 28 14:14:08 michael kernel: NIP: C0067E6C LR: C0068234 CTR: C0007248
> Jun 28 14:14:08 michael kernel: REGS: efa69d80 TRAP: 0300   Not tainted  (2.6.17)
> Jun 28 14:14:08 michael kernel: MSR: 00001032 <ME,IR,DR>  CR: 28000888  XER: 00000000
> Jun 28 14:14:08 michael kernel: DAR: 008EE004, DSISR: 42000000
> Jun 28 14:14:08 michael kernel: TASK = efe422b0[2345] 'nmbd' THREAD: efa68000
> Jun 28 14:14:08 michael kernel: GPR00: 00100100 EFA69E30 EFE422B0 EFFF6CC0 C08C36E0 EF937D4C EFFF4DA0 EF937000
> Jun 28 14:14:08 michael kernel: GPR08: EFA15000 EF93701C 008EE000 00200200 0FE6A000 10112F3C 100D0000 100D0000
> Jun 28 14:14:08 michael kernel: GPR16: 00000000 100F1848 100D0000 100B0000 10010000 00000001 10010000 7F81CDBC
> Jun 28 14:14:08 michael kernel: GPR24: FFFFFFFF 10010000 10010000 0000003C 00000000 0000002F EFFEECCC EFFF6CC0
> Jun 28 14:14:08 michael kernel: NIP [C0067E6C] free_block+0x90/0x15c
> Jun 28 14:14:08 michael kernel: LR [C0068234] cache_flusharray+0x78/0xac
> Jun 28 14:14:08 michael kernel: Call Trace:
> Jun 28 14:14:08 michael kernel: [EFA69E30] [C0059038] free_pgd_range+0x150/0x184 (unreliable)
> Jun 28 14:14:08 michael kernel: [EFA69E50] [C0068234] cache_flusharray+0x78/0xac
> Jun 28 14:14:08 michael kernel: [EFA69E70] [C0067CD0] kmem_cache_free+0x9c/0xcc
> Jun 28 14:14:08 michael kernel: [EFA69E90] [C005A824] remove_vma+0x58/0x70
> Jun 28 14:14:08 michael kernel: [EFA69EA0] [C005B020] exit_mmap+0xb4/0xe8
> Jun 28 14:14:08 michael kernel: [EFA69ED0] [C0026A98] mmput+0x3c/0xd0
> Jun 28 14:14:08 michael kernel: [EFA69EE0] [C002A2FC] exit_mm+0x174/0x188
> Jun 28 14:14:08 michael kernel: [EFA69F00] [C002B25C] do_exit+0x190/0x77c
> Jun 28 14:14:08 michael kernel: [EFA69F30] [C002B8CC] sys_exit_group+0x0/0x8
> Jun 28 14:14:08 michael kernel: [EFA69F40] [C00110CC] ret_from_syscall+0x0/0x38
> Jun 28 14:14:08 michael kernel: --- Exception: c01 at 0xfc31bd0
> Jun 28 14:14:08 michael kernel:     LR = 0xfcf2250
> Jun 28 14:14:08 michael kernel: Instruction dump:
> Jun 28 14:14:08 michael kernel: 80e4001c 5789103a 3c000010 7d29fa14 3d600020 81470000 60000100 616b0200
> Jun 28 14:14:08 michael kernel: 81070004 80c90014 3927001c 91480000 <910a0004> 90070000 91670004 8167000c
> Jun 28 14:14:08 michael kernel:  <1>Fixing recursive fault but reboot is needed!
> Jun 28 14:14:13 michael kernel: Unable to handle kernel paging request for data at address 0x008ee004
> Jun 28 14:14:13 michael kernel: Faulting instruction address: 0xc0067e6c
> Jun 28 14:14:13 michael kernel: Oops: Kernel access of bad area, sig: 11 [#2]
> Jun 28 14:14:13 michael kernel:
> Jun 28 14:14:13 michael kernel: Modules linked in: nfsd exportfs lockd appletalk psnap llc sunrpc uinput i2c_dev snd_powermac therm_adt746x cpufreq_userspace cpufreq_powersave usbhid pcmcia ohci1394 snd_aoa_i2sbus snd_pcm_oss snd_mixer_oss ieee1394 sungem evdev snd_pcm snd_timer snd_page_alloc sungem_phy snd ehci_hcd ohci_hcd yenta_socket rsrc_nonstatic pcmcia_core i2c_powermac uninorth_agp usbcore ide_cd soundcore agpgart snd_aoa_soundbus cdrom unix
> Jun 28 14:14:13 michael kernel: NIP: C0067E6C LR: C0067FD0 CTR: 00000000
> Jun 28 14:14:13 michael kernel: REGS: effcde50 TRAP: 0300   Not tainted  (2.6.17)
> Jun 28 14:14:13 michael kernel: MSR: 00001032 <ME,IR,DR>  CR: 28008088  XER: 20000000
> Jun 28 14:14:13 michael kernel: DAR: 008EE004, DSISR: 42000000
> Jun 28 14:14:13 michael kernel: TASK = effc0b90[3] 'events/0' THREAD: effcc000
> Jun 28 14:14:13 michael kernel: GPR00: 00100100 EFFCDF00 EFFC0B90 EFFF6CC0 C08C36E0 EF937D4C EFFF4DA0 EF937000
> Jun 28 14:14:13 michael kernel: GPR08: EFA15000 EF93701C 008EE000 00200200 00000000 00000000 00000000 00000000
> Jun 28 14:14:13 michael kernel: GPR16: 00000000 00000000 00000000 00000000 00000000 41400000 0164F790 018C8028
> Jun 28 14:14:13 michael kernel: GPR24: 00000000 002C4000 41400000 00000018 00000000 00000017 EFFEEC6C EFFF6CC0
> Jun 28 14:14:13 michael kernel: NIP [C0067E6C] free_block+0x90/0x15c
> Jun 28 14:14:13 michael kernel: LR [C0067FD0] drain_array+0x98/0xd8
> Jun 28 14:14:13 michael kernel: Call Trace:
> Jun 28 14:14:13 michael kernel: [EFFCDF00] [C0067848] kmem_freepages+0x98/0xdc (unreliable)
> Jun 28 14:14:13 michael kernel: [EFFCDF20] [C0067FD0] drain_array+0x98/0xd8
> Jun 28 14:14:13 michael kernel: [EFFCDF40] [C006805C] cache_reap+0x4c/0x1ac
> Jun 28 14:14:13 michael kernel: [EFFCDF60] [C0039B74] run_workqueue+0xa4/0x108
> Jun 28 14:14:13 michael kernel: [EFFCDF70] [C0039D88] worker_thread+0xe4/0x12c
> Jun 28 14:14:13 michael kernel: [EFFCDFC0] [C003DAA0] kthread+0xc4/0x100
> Jun 28 14:14:13 michael kernel: [EFFCDFF0] [C0012288] kernel_thread+0x44/0x60
> Jun 28 14:14:13 michael kernel: Instruction dump:
> Jun 28 14:14:13 michael kernel: 80e4001c 5789103a 3c000010 7d29fa14 3d600020 81470000 60000100 616b0200
> Jun 28 14:14:13 michael kernel: 81070004 80c90014 3927001c 91480000 <910a0004> 90070000 91670004 8167000c
> Jun 28 14:14:13 michael kernel:  BUG: events/0/3, lock held at task exit time!
> Jun 28 14:14:13 michael kernel:  [c025e238] {cache_chain_mutex}
> Jun 28 14:14:13 michael kernel: .. held by:          events/0:    3 [effc0b90, 110]
> Jun 28 14:14:13 michael kernel: ... acquired at:               cache_reap+0x1c/0x1ac
> 
> Slightly different one:
> 
> Jun 28 14:15:48 michael kernel: input: Mouseemu virtual keyboard as /class/input/input8
> Jun 28 14:15:48 michael kernel: input: Mouseemu virtual mouse as /class/input/input9
> Jun 28 14:15:55 michael kernel: NET: Registered protocol family 5
> Jun 28 14:15:55 michael kernel: Installing knfsd (copyright (C) 1996 okir@monad.swb.de).
> Jun 28 14:15:56 michael kernel: kernel BUG in page_remove_rmap at mm/rmap.c:522!
> Jun 28 14:15:56 michael kernel: Oops: Exception in kernel mode, sig: 5 [#1]
> Jun 28 14:15:56 michael kernel:
> Jun 28 14:15:56 michael kernel: Modules linked in: nfsd exportfs appletalk psnap llc lockd sunrpc uinput i2c_dev snd_powermac therm_adt746x cpufreq_userspace cpufreq_powersave usbhid pcmcia snd_aoa_i2sbus snd_pcm_oss snd_mixer_oss ohci1394 ehci_hcd ohci_hcd ieee1394 snd_pcm evdev i2c_powermac sungem sungem_phy yenta_socket rsrc_nonstatic pcmcia_core usbcore snd_timer snd_page_alloc uninorth_agp agpgart ide_cd snd soundcore snd_aoa_soundbus cdrom unix
> Jun 28 14:15:56 michael kernel: NIP: C005E284 LR: C00594A0 CTR: 00000006
> Jun 28 14:15:56 michael kernel: REGS: efa3fd60 TRAP: 0700   Not tainted  (2.6.17)
> Jun 28 14:15:56 michael kernel: MSR: 00029032 <EE,ME,IR,DR>  CR: 22004448  XER: 00000000
> Jun 28 14:15:56 michael kernel: TASK = efe4e2d0[2308] 'grep' THREAD: efa3e000
> Jun 28 14:15:56 michael kernel: GPR00: FFFFFFFF EFA3FE10 EFE4E2D0 C02F0400 30000000 2F9FE000 00000001 40000000
> Jun 28 14:15:56 michael kernel: GPR08: 00000000 00000001 00009032 FFFFFFFE 0FFE98C8 10031130 C029BDE0 00000000
> Jun 28 14:15:56 michael kernel: GPR16: FFFFFFFF EFA1E300 30019000 00378EE5 30019000 EFC79DE0 00000000 EFA3E000
> Jun 28 14:15:56 michael kernel: GPR24: 00FA0307 00000000 EFA25F44 30019000 30000000 C02F0400 FFFFFFFD EF9FE000
> Jun 28 14:15:56 michael kernel: NIP [C005E284] page_remove_rmap+0x38/0x58
> Jun 28 14:15:56 michael kernel: LR [C00594A0] unmap_vmas+0x388/0x5f4
> Jun 28 14:15:56 michael kernel: Call Trace:
> Jun 28 14:15:56 michael kernel: [EFA3FE10] [C006086C] free_page_and_swap_cache+0x50/0x64 (unreliable)
> Jun 28 14:15:56 michael kernel: [EFA3FE20] [C00594A0] unmap_vmas+0x388/0x5f4
> Jun 28 14:15:56 michael kernel: [EFA3FEA0] [C005AFCC] exit_mmap+0x60/0xe8
> Jun 28 14:15:56 michael kernel: [EFA3FED0] [C0026A98] mmput+0x3c/0xd0
> Jun 28 14:15:56 michael kernel: [EFA3FEE0] [C002A2FC] exit_mm+0x174/0x188
> Jun 28 14:15:56 michael kernel: [EFA3FF00] [C002B25C] do_exit+0x190/0x77c
> Jun 28 14:15:56 michael kernel: [EFA3FF30] [C002B8CC] sys_exit_group+0x0/0x8
> Jun 28 14:15:56 michael kernel: [EFA3FF40] [C00110CC] ret_from_syscall+0x0/0x38
> Jun 28 14:15:56 michael kernel: --- Exception: c01 at 0xff2cbd0
> Jun 28 14:15:56 michael kernel:     LR = 0xffed250
> Jun 28 14:15:56 michael kernel: Instruction dump:
> Jun 28 14:15:56 michael kernel: 39230008 90010014 3800ffff 7d604828 7d605a14 7d60492d 40a2fff4 2f8b0000
> Jun 28 14:15:56 michael kernel: 40bc0020 81230008 39290001 55290ffe <0f090000> 38600010 3880ffff 4bfeea91
> Jun 28 14:15:56 michael kernel:  <1>Fixing recursive fault but reboot is needed!
> Jun 28 14:15:56 michael kernel: BUG: scheduling while atomic: grep/0x00000001/2308
> Jun 28 14:15:56 michael kernel: Call Trace:
> Jun 28 14:15:56 michael kernel: [EFA3FBC0] [C0007D9C] show_stack+0x54/0x174 (unreliable)
> Jun 28 14:15:56 michael kernel: [EFA3FBF0] [C01E2910] schedule+0x48/0x608
> Jun 28 14:15:56 michael kernel: [EFA3FC20] [C002B198] do_exit+0xcc/0x77c
> Jun 28 14:15:56 michael kernel: [EFA3FC50] [C000FA84] kernel_bad_stack+0x0/0x4c
> Jun 28 14:15:56 michael kernel: [EFA3FC70] [C000FC4C] _exception+0x38/0xd8
> Jun 28 14:15:56 michael kernel: [EFA3FD10] [C0010390] program_check_exception+0x4bc/0x4e0
> Jun 28 14:15:56 michael kernel: [EFA3FD50] [C0011728] ret_from_except_full+0x0/0x4c
> Jun 28 14:15:56 michael kernel: --- Exception: 700 at page_remove_rmap+0x38/0x58
> Jun 28 14:15:56 michael kernel:     LR = unmap_vmas+0x388/0x5f4
> Jun 28 14:15:56 michael kernel: [EFA3FE10] [C006086C] free_page_and_swap_cache+0x50/0x64 (unreliable)
> Jun 28 14:15:56 michael kernel: [EFA3FE20] [C00594A0] unmap_vmas+0x388/0x5f4
> Jun 28 14:15:56 michael kernel: [EFA3FEA0] [C005AFCC] exit_mmap+0x60/0xe8
> Jun 28 14:15:56 michael kernel: [EFA3FED0] [C0026A98] mmput+0x3c/0xd0
> Jun 28 14:15:56 michael kernel: [EFA3FEE0] [C002A2FC] exit_mm+0x174/0x188
> Jun 28 14:15:56 michael kernel: [EFA3FF00] [C002B25C] do_exit+0x190/0x77c
> Jun 28 14:15:56 michael kernel: [EFA3FF30] [C002B8CC] sys_exit_group+0x0/0x8
> Jun 28 14:15:56 michael kernel: [EFA3FF40] [C00110CC] ret_from_syscall+0x0/0x38
> Jun 28 14:15:56 michael kernel: --- Exception: c01 at 0xff2cbd0
> Jun 28 14:15:56 michael kernel:     LR = 0xffed250
> 
> I've even got it to confess to a kernel stack overrun.
> 
> Any obvious changes to look at?
> 
> 	Michael



Reply to: