[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: nasty bug in /usr/sbin/grub-probe



On 4/1/22 22:03, Stan Johnson wrote:
Hi Dennis,

Unless you already know that your system's memory is ok...


Sparc machines generally have ECC memory and the diagnostics are quite
well trusted.

However ... just for giggles ( yes the battery is crap ) :


root@hades:~#
root@hades:~# shutdown -h 'now'
root@hades:~# [  OK  ] Removed slic         Stopping Rescue Shell...
         Stopping Load/Save Random Seed...
[  OK  ] Stopped Rescue Shell.
[  OK  ] Stopped target System Initialization.
[  OK  ] Unset automount Arbitrary b&s File System Automount Point.
[  OK  ] Stopped target Local Encrypted Volumes.
[  OK  ] Stopped Dispatch Password b&ts to Console Directory Watch.
[  OK  ] Stopped target Local Integrity Protected Volumes.
[  OK  ] Stopped target Swaps.
[  OK  ] Stopped target Local Verity Protected Volumes.
         Deactivating swap /dev/disb&_3CD0ZHE200007120K5BC-part2...
         Stopping Record System Boot/Shutdown in UTMP...
[  OK  ] Deactivated swap /dev/diskb&00:01:02.0-scsi-0:0:0:0-part2.
[  OK  ] Deactivated swap /dev/diskb&6G_3CD0ZHE200007120K5BC-part2.
[  OK  ] Deactivated swap /dev/sda2.
[  OK  ] Stopped ifup for eth0.
[  OK  ] Stopped Load/Save Random Seed.
[  OK  ] Deactivated swap /dev/diskb&2-e35a-4ff4-b7f5-f7d028c4ca2c.
[  OK  ] Stopped Record System Boot/Shutdown in UTMP.
[  OK  ] Stopped Apply Kernel Variables.
[  OK  ] Stopped Load Kernel Modules.
[  OK  ] Stopped Create Volatile Files and Directories.
[  OK  ] Stopped target Local File Systems.
         Unmounting /boot...
         Unmounting /home...
         Unmounting /run/credentials/systemd-sysusers.service...
         Unmounting /usr/local...
[  OK  ] Unmounted /boot.
[  OK  ] Unmounted /home.
[  OK  ] Unmounted /run/credentials/systemd-sysusers.service.
[  OK  ] Unmounted /usr/local.
[  OK  ] Reached target Unmount All Filesystems.
[  OK  ] Stopped File System Check b&6-88fb-4134-88c5-485c34d4614c.
[  OK  ] Stopped File System Check b&f-421e-477d-b5e3-830365a86b19.
[  OK  ] Stopped File System Check b&9-00f1-497b-8b83-d726e914d044.
[  OK  ] Removed slice Slice /system/systemd-fsck.
[  OK  ] Stopped target Preparation for Local File Systems.
[  OK  ] Stopped Create Static Device Nodes in /dev.
[  OK  ] Stopped Create System Users.
[  OK  ] Stopped Remount Root and Kernel File Systems.
[  OK  ] Reached target System Shutdown.
[  OK  ] Reached target Late Shutdown Services.
[  OK  ] Finished System Power Off.
[  OK  ] Reached target System Power Off.
[ 4732.049137] systemd-shutdown[1]: Syncing filesystems and block devices.
[ 4732.541119] systemd-shutdown[1]: Sending SIGTERM to remaining processes... [ 4732.650924] systemd-journald[183]: Received SIGTERM from PID 1 (systemd-shutdow). [ 4732.886493] systemd-shutdown[1]: Sending SIGKILL to remaining processes...
[ 4732.995664] systemd-shutdown[1]: Unmounting file systems.
[ 4733.075318] [308]: Remounting '/' read-only with options 'errors=remount-ro'. [ 4733.223777] EXT4-fs (sda4): re-mounted. Opts: errors=remount-ro. Quota mode: none.
[ 4733.335759] systemd-shutdown[1]: All filesystems unmounted.
[ 4733.409207] systemd-shutdown[1]: Deactivating swaps.
[ 4733.474932] systemd-shutdown[1]: All swaps deactivated.
[ 4733.543741] systemd-shutdown[1]: Detaching loop devices.
[ 4733.614459] systemd-shutdown[1]: All loop devices detached.
[ 4733.687858] systemd-shutdown[1]: Stopping MD devices.
[ 4733.755436] systemd-shutdown[1]: All MD devices stopped.
[ 4733.825318] systemd-shutdown[1]: Detaching DM devices.
[ 4733.893652] systemd-shutdown[1]: All DM devices detached.
[ 4733.964778] systemd-shutdown[1]: All filesystems, swaps, loop devices, MD devices and DM devices detached.
[ 4734.136742] systemd-shutdown[1]: Syncing filesystems and block devices.
[ 4734.227936] systemd-shutdown[1]: Powering off.
[ 4734.286653] sd 0:0:1:0: [sdb] Synchronizing SCSI cache
[ 4734.354622] sd 0:0:0:0: [sda] Synchronizing SCSI cache
[ 4734.423032] reboot: Power down
lom>
LOM event: power off
lom>

Get a coffee or whiskey or both and let the electrons settle...


lom>
LOM event: power on

ps/2 kbd check: 0000.0000.0000.00fe
Checking Sun KB Done
%o0 = 0000.0000.0055.4001

Executing Power On SelfTest


SPARCengine(tm)Ultra CP 1500 POST 1.17 ME created 03/06/00
 WARRNING: NVRAM battery is either bad or just replaced!
Time Stamp [hour:min:sec] 33:30:02

Init POST BSS
        Init System BSS

Probing system keyboard : Done
DMMU TLB Tags
        DMMU TLB Tag Access Test
DMMU TLB RAM
        DMMU TLB RAM Access Test
Ecache Tests
        Probe Ecache
        ecache_size = 0x00200000
        Ecache RAM Addr Test
        Ecache Tag Addr Test
        Ecache RAM Test
        Ecache Tag Test
        Invalidate Ecache Tags
All CPU Basic Tests
        V9 Instruction Test
        CPU Tick and Tick Compare Reg Test
        CPU Soft Trap Test
        CPU Softint Reg and Int Test
All Basic MMU Tests
        DMMU Primary Context Reg Test
        DMMU Secondary Context Reg Test
        DMMU TSB Reg Test
        DMMU Tag Access Reg Test
        DMMU VA Watchpoint Reg Test
        DMMU PA Watchpoint Reg Test
        IMMU TSB Reg Test
        IMMU Tag Access Reg Test
        IMMU TLB RAM Access Test
        IMMU TLB Tag Access Test
All Basic Cache Tests
        Dcache RAM Test
        Dcache Tag Test
        Icache RAM Test
        Icache Tag Test
        Icache Next Test
        Icache Predecode Test
UltraSPARC IIi MCU Control & Status Regs Init and Tests
        Init UltraSPARC IIi MCU Control & Status Regs
        CPU speed : 440 Mhz, mc1 set : 0x544cb9dd
Memory Probe and Init
        Probe Memory
                INFO: All the memory Group in 10 bit column mode
                Group 0: 256MB
                Group 1: 256MB
                Group 2: 256MB
                Group 3: 256MB
        Malloc Post Memory
        Init Post Memory
..........
        Memory Addr w/ Ecache
        Map PROM/STACK/NVRAM in DMMU
        Load Post In Memory
        Run POST from MEM
        ..........
        loaded POST in memory
        Update Master Stack/Frame Pointers
All FPU Basic Tests
        FPU Regs Test
        FPU State Reg Test
        FPU Functional Test
        FPU Trap Test
Memory Tests
        Init Memory
...............
................
................
................
................
................
................
................
        Memory Addr w/ Ecache Test
        ECC Memory Addr Test
        Block Memory Addr Test
        Block Memory Test
...............
...............

................
................

................
................

................
................

................
................

................
................

................
................

................
................

        ECC Blk Memory Test
...............
...............

................
................

................
................

................
................

................
................

................
................

................
................

................
................

All Basic UltraSPARC IIi PBM Tests
        Init UltraSPARC IIi PBM
        PIO Decoder and BCT Test
        PCI Byte Enable Test
        UltraSPARC IIi IOMMU Regs Test
        UltraSPARC IIi IOMMU RAM NTA Test
        UltraSPARC IIi IOMMU CAM NTA Test
        UltraSPARC IIi IOMMU RAM Address Test
        UltraSPARC IIi IOMMU CAM Address Test
        IOMMU TLB Compare Test
        IOMMU TLB Flush Test
        PBM Control/Status Reg Test
        PBM Diag Reg Test
        UltraSPARC IIi PBM Regs Test
All Advanced CPU Tests
        DMMU Hit/Miss Test
        DMMU Little Endian Test
        IU ASI Access Test
        FPU ASI Access Test
        Ecache Thrash Test
All CPU Error Reporting Tests
        CPU Addr Align Trap Test
        DMMU Access Priv Page Test
        DMMU Write Protected Page Test
All Advanced UltraSPARC IIi PBM Tests
        Init UltraSPARC IIi PBM
        Consist DMA Wr, IOMMU hit Ebus Test
All Basic Cheerio Tests
        Cheerio Ebus PCI Config Space Test
        Cheerio Ethernet PCI Config Space Test
        Cheerio Ebus Engine Reg Test
        Cheerio Init
All Basic I2c Tests
        Init i2c bus
        Thermister Reading Test
        Thermister Position       Readings (in Hex)
        CPU                       0x4e
All Basic PCI-PCI Bridge Tests
        PCI-PCI Bridge Config Space Test
All Basic Symbios 875 SCSI controller Tests
        Symbios 875 SCSI controller PCI Config Space Test

Extended POST:
Start Extended POST : No EXT POST is found


Power On Selftest Completed
    Status  = 0000.0000.0000.0000 ffff.ffff.f100.1db0 019f.3333.3a50.0011

Software Power ON

@(#) SPARCengine(tm)Ultra CP 1500  3.10.27 ME created 2000/06/22 16:45
Enter Checking KB
ps/2 kbd check: 0000.0000.0000.00fe
Checking Sun KB
Clearing E$ Tags  Done
Clearing I/D TLBs Done
Probing Memory
Group Info[0000.0000.0000.0003] : 0000.0000.0000.0110
Group Info[0000.0000.0000.0002] : 0000.0000.0000.0110
Group Info[0000.0000.0000.0001] : 0000.0000.0000.0110
Group Info[0000.0000.0000.0000] : 0000.0000.0000.0110
Done
Clearing Memory...Done
MEM BASE = 0000.0000.3800.0000
MEM SIZE = 0000.0000.0800.0000
MMUs ON
Copy Done
PC = 0000.01ff.f000.30dc
PC = 0000.0000.0000.3120
Decompressing into Memory Done
Size = 0000.0000.0008.7710
ttya initialized
flashprom flashprom Incorrect configuration checksum;
Setting NVRAM parameters to default values.
Setting diag-switch? NVRAM parameter to true
Reset Control: BXIR:0 BPOR:0 SXIR:0 SPOR:1 POR:0
UltraSPARC-IIi Version 9.1 (E$=2 MB) 2-2 module
Advanced PCI Bridge Version 1.3
Probing Memory Group #0 128 + 128 : 256 Megabytes
Probing Memory Group #1 128 + 128 : 256 Megabytes
Probing Memory Group #2 128 + 128 : 256 Megabytes
Probing Memory Group #3 128 + 128 : 256 Megabytes
Initialise 2nd I2c controller
Environmental monitoring:  Enabled
i2c adc gpio gpio
i2c Probing Floppy: No drives detected
Probing /pci@1f,0/pci@1,1 at Device 1  network
Probing /pci@1f,0/pci@1,1 at Device 2  scsi disk tape
Probing /pci@1f,0/pci@1,1 at Device 3  network
Probing /pci@1f,0/pci@1 at Device 1  pci
Probing /pci@1f,0/pci@1/pci@1 at Device 0  Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 1  Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 2  Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 3  Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 4  Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 5  Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 6  Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 7  Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 8  Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device 9  Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device a  Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device b  Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device c  Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device d  Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device e  Nothing there
Probing /pci@1f,0/pci@1/pci@1 at Device f  Nothing there

Netra t1 (UltraSPARC-IIi 440MHz), No Keyboard
OpenBoot 3.10.27 ME, 1024 MB memory installed, Serial #12731976.
Ethernet address 8:0:20:c2:46:48, Host ID: 80c24648.



Boot device: net  File and args:
Using External Transceiver - Link Up.
3a000
Server IP address: 172.16.35.58
Client IP address: 172.16.35.33
Using External Transceiver - Link Up.
ramdisk-root |

Well at this point the machine will try to boot the diag-device which is
simply "net" and yes the machine can netboot fine. That works.

I will break and stop it and then boot local linux and we get a machine
with well tested ECC memory.


--
Dennis Clarke
RISC-V/SPARC/PPC/ARM/CISC
UNIX and Linux spoken
GreyBeard and suspenders optional


Reply to: