Bug#275999: Reboot fails with ROOT on a raid1 partition with a kernel 2.6
Package: installation-reports
Debian-installer-version: PRE RC-2
uname -a: Linux zegal 2.6.8-1-386 #1 Mon Sep 13 23:29:55 EDT 2004 i586 GNU/Linux
Date: 11 oct 2004 ~ 11 H 00 (GMT+2)
Method: expert26
NetInstall
From SCSI CDRW Drive
Local Network without DHCP and without Proxy
Machine: PackardBell MultimediaGL. I changed some hardware on this
box in order to have a new gateway for my personnal usage.
Processor: Pentium 200 MMX
Memory: 48 Mb
Root Device: IDE /dev/md0
Root Size/partition table:
Disk /dev/hda: 4311 MB, 4311982080 bytes
15 heads, 63 sectors/track, 8912 cylinders, total 8421840 sectors
Units = sectors of 1 * 512 = 512 bytes
Device Boot Start End Blocks Id System
/dev/hda1 * 63 408239 204088+ fd Linux raid autodetect
/dev/hda2 408240 8421839 4006800 f W95 Ext'd (LBA)
/dev/hda5 408303 3568319 1580008+ fd Linux raid autodetect
/dev/hda6 3568383 4931954 681786 83 Linux
/dev/hda7 4932018 5201279 134631 82 Linux swap
/dev/hda8 5201343 5443199 120928+ 83 Linux
/dev/hda9 5443263 8421839 1489288+ 83 Linux
Disk /dev/hdc: 2160 MB, 2160431104 bytes
15 heads, 63 sectors/track, 4465 cylinders, total 4219592 sectors
Units = sectors of 1 * 512 = 512 bytes
Device Boot Start End Blocks Id System
/dev/hdc1 63 3161024 1580481 fd Linux raid autodetect
/dev/hdc2 3161025 3525794 182385 fd Linux raid autodetect
/dev/hdc3 3525795 4219424 346815 83 Linux
content of /etc/mdadm/mdadm.conf:
DEVICE partitions
ARRAY /dev/md1 level=raid1 num-devices=2 UUID=7e1d9927:1620cd05:82e23a2f:eb86dbd0
devices=/dev/hda5,/dev/hdc1
ARRAY /dev/md0 level=raid1 num-devices=2 UUID=520029b9:0014b5b3:75abf49a:67bf05f6
devices=/dev/hda1,/dev/hdc2
# /etc/fstab: static file system information.
#
# <file system> <mount point> <type> <options> <dump> <pass>
proc /proc proc defaults 0 0
/dev/md0 / ext3 defaults,errors=remount-ro 0 1
/dev/hda9 /home ext3 defaults 0 2
/dev/hda8 /tmp ext3 defaults 0 2
/dev/md1 /usr ext3 defaults 0 2
/dev/hda6 /var ext3 defaults 0 2
/dev/hda7 none swap sw 0 0
/dev/scd0 /media/cdrom0 iso9660 ro,user,noauto 0 0
/dev/fd0 /media/floppy0 auto rw,user,noauto 0 0
Output of lspci:
0000:00:00.0 Host bridge: Intel Corp. 430VX - 82437VX TVX [Triton VX] (rev 02)
0000:00:07.0 ISA bridge: Intel Corp. 82371SB PIIX3 ISA [Natoma/Triton II] (rev 01)
0000:00:07.1 IDE interface: Intel Corp. 82371SB PIIX3 IDE [Natoma/Triton II]
0000:00:07.2 USB Controller: Intel Corp. 82371SB PIIX3 USB [Natoma/Triton II] (rev 01)
0000:00:08.0 VGA compatible controller: S3 Inc. 86c764/765 [Trio32/64/64V+] (rev 54)
0000:00:11.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL-8029(AS)
0000:00:13.0 SCSI storage controller: Advanced System Products, Inc ABP940-U / ABP960-U (rev 03)
Output of lspci -n:
0000:00:00.0 0600: 8086:7030 (rev 02)
0000:00:07.0 0601: 8086:7000 (rev 01)
0000:00:07.1 0101: 8086:7010
0000:00:07.2 0c03: 8086:7020 (rev 01)
0000:00:08.0 0300: 5333:8811 (rev 54)
0000:00:11.0 0200: 10ec:8029
0000:00:13.0 0100: 10cd:1300 (rev 03)
Base System Installation Checklist:
[O] = OK, [E] = Error (please elaborate below), [ ] = didn't try it
Initial boot worked: [O]
Configure network HW: [O]
Config network: [O]
Detect CD: [O]
Load installer modules: [O]
Detect hard drives: [O]
Partition hard drives: [O]
Create file systems: [O]
Mount partitions: [O]
Install base system: [O]
Install boot loader: [O]
Reboot: [E] Kernel hangs on raid1 root partition[1]
Comments/Problems:
The installation failed with the kernel 2.6.8-1-386 and a raid1 root
partition. The kernel hangs[1] with the default initird. To solve the
problem, I booted on a root backup stored on /dev/hdc3. Then I made
a mkinitrd as it's written in the documentation of mdadm. That is to say:
1) add md and raid1 into /etc/mkinitrd/modules
md
raid1
2) change ROOT on /etc/mkinitrd.conf
ROOT=/dev/md0
3) run mkinitrd -o /boot/initrd.img-2.6.8-raid
4) modify /boot/grub/menu.lst to specify to use the initrd I had
created
[1] :
[...]
serio: i8042 AUX port at 0x60,0x64 irq 12
serio: i8042 KBD port at 0x60,0x64 irq 1
input: AT Translated Set 2 keyboard on isa0060/serio0
EISA: Probing bus 0 at eisa0
EISA: Detected 9 cards.
NET: Registered protocol family 2
IP: routing cache hash table of 512 buckets, 4Kbytes
TCP: Hash tables configured (established 4096 bind 8192)
NET: Registered protocol family 8
NET: Registered protocol family 20
RAMDISK: cramfs filesystem found at block 0
RAMDISK: Loading 4360 blocks [1 disk] into ram disk... done.
VFS: Mounted root (cramfs filesystem) readonly.
Unable to handle kernel NULL pointer dereference at virtual address 0000000c
printing eip:
c024b6c8
*pde = 00000000
Oops: 0000 [#1]
PREEMPT
Modules linked in:
CPU: 0
EIP: 0060:[<c024b6c8>] Not tainted
EFLAGS: 00010002 (2.6.8-1-386)
EIP is at __down+0x40/0xef
eax: 00000008 exb: 00000286 ecx: 00000000 edx: c294a000
esi: 00000000 edi: c294be20 ebp: c2fa0b90 esp: c294be0c
ds: 007b es: 007b ss: 0068
Process mount (pid: 194, threadinfo=c294a000 task=c2fa0b90)
Stack: 00000001 c2fa0b90 c01161b1 00000000 00000000 00000001 00000000 00000282
c10555e0 c028d8a0 c2aaf000 0037d926 c024b8a0 00000000 00000a64 02aaf000
c017fa61 c2ba4800 00000000 c10555e0 c29503b8 00000000 c2b9e420 c012df97
Call Trace:
[<c01161b1>] default_wake_function+0x0/0x12
[<c024b8a0>] __down_failed+0x8/0xc
[<c017fa61>] .text.lock.inode+0x7d/0x8c
[<c012df97>] page_cache_read+0x53/0xb7
[<c012e1c3>] filemap_nopage+0x1c8/0x30d
[<c01393f2>] do_no_page+0xa6/0x2cf
[<c013974f>] handle_mm_fault+0x6c/0x125
[<c0114be1>] do_page_fault+0x14d/0x49f
[<c011990c>] profile_hook+0x1c/0x34
[<c0112ac4>] smp_local_timer_interrupt+0xc/0x90
[<c018e852>] copy_from_user+0x2e/0x54
[<c0104a5f>] sys_set_thread_area+0x72/0x1ac
[<c0114a94>] do_page_fault+0x0/0x49f
[<c0106a19>] error_code+0x2d/0x38
Code: 8b 50 04 8d 4c 24 0c 89 48 04 89 44 24 0c 89 54 24 10 89 0a
<6>note: mount[194] exited with preempt_count 1
Reply to: