[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#827557: Acknowledgement (linux-image-4.6.0-1-armmp-lpae: 4.6 armmp kernel fails to boot on nvidia jetson (4.5 works))



On 2016-06-17 18:27 +0000, Debian Bug Tracking System wrote:

Using the latest 4.6.3-1 version of this package does not change the issue.

Here is the output from the failed boot:

----------------------
MMC: no card present
switch to partitions #0, OK
mmc0(part 0) is current device
Scanning mmc 0...
Found U-Boot script /boot.scr
2440 bytes read in 27 ms (87.9 KiB/s)
## Executing script at 90000000
3637048 bytes read in 169 ms (20.5 MiB/s)
64138 bytes read in 86 ms (727.5 KiB/s)
16334316 bytes read in 480 ms (32.5 MiB/s)
Booting Debian 4.6.0-1-armmp from mmc 0:1...
Kernel image @ 0x81000000 [ 0x000000 - 0x377f38 ]
## Flattened Device Tree blob at 82000000
   Booting using the fdt blob at 0x82000000
   Using Device Tree in place at 82000000, end 82012a89
Starting kernel ...

[    0.000000] Tegra clk 127: register failed with -17
[    0.068144] /cpus/cpu@0 missing clock-frequency property
[    0.068194] /cpus/cpu@1 missing clock-frequency property
[    0.068239] /cpus/cpu@2 missing clock-frequency property
[    0.068286] /cpus/cpu@3 missing clock-frequency property
[    0.158867] +USB0_VBUS_SW: Failed to request enable GPIO108: -517
[    0.158885] reg-fixed-voltage regulators:regulator@7: Failed to register regulator: -517
[    0.159005] +5V_USB_HS: Failed to request enable GPIO109: -517
[    0.159021] reg-fixed-voltage regulators:regulator@8: Failed to register regulator: -517
[    0.159243] +1.05V_RUN_AVDD_HDMI_PLL: Failed to request enable GPIO63: -517
[    0.159258] reg-fixed-voltage regulators:regulator@11: Failed to register regulator: -517
[    0.159375] +5V_HDMI_CON: Failed to request enable GPIO86: -517
[    0.159391] reg-fixed-voltage regulators:regulator@12: Failed to register regulator: -517
[    0.159508] +5V_SATA: Failed to request enable GPIO242: -517
[    0.159523] reg-fixed-voltage regulators:regulator@13: Failed to register regulator: -517
[    0.159638] +12V_SATA: Failed to request enable GPIO242: -517
[    0.159654] reg-fixed-voltage regulators:regulator@14: Failed to register regulator: -517
[    1.720038] tegra-pcie 1003000.pcie-controller: Failed to get supply 'avddio-pex': -517
[    1.918190] tegra-pcie 1003000.pcie-controller: Slot present pin change, signature: 00000008
[    2.330537] tegra-pcie 1003000.pcie-controller: link 0 down, retrying
[    2.743653] tegra-pcie 1003000.pcie-controller: link 0 down, retrying
[    3.156872] tegra-pcie 1003000.pcie-controller: link 0 down, retrying
[    3.167450] tegra-pcie 1003000.pcie-controller: Slot present pin change, signature: 00000000
[    3.589677] mmc0: Unknown controller version (3). You may experience problems.
[    3.650454] mmc1: Unknown controller version (3). You may experience problems.
  WARNING: Failed to connect to lvmetad. Falling back to device scanning.
  /dev/mmcblk1rpmb: read failed after 0 of 4096 at 0: Input/output error
  /dev/mmcblk1rpmb: read failed after 0 of 4096 at 4128768: Input/output error
  /dev/mmcblk1rpmb: read failed after 0 of 4096 at 4186112: Input/output error
  /dev/mmcblk1rpmb: read failed after 0 of 4096 at 4096: Input/output error
  /dev/mapper/tegra-debian--root: recovering journal
  /dev/mapper/tegra-debian--root: clean, 39411/1831424 files, 434011/7323648 blocks
[   13.311582] tegra-snd-rt5640 sound: ASoC: CPU DAI (null) not registered
[   13.318332] tegra-snd-rt5640 sound: snd_soc_register_card failed (-517)
[   13.328334] tegra-snd-rt5640 sound: ASoC: CPU DAI (null) not registered
[   13.335000] tegra-snd-rt5640 sound: snd_soc_register_card failed (-517)
[   13.625987] tegra-snd-rt5640 sound: ASoC: CPU DAI (null) not registered
[   13.632660] tegra-snd-rt5640 sound: snd_soc_register_card failed (-517)
-----------------------

At which point it seems to have hung.


A successful boot with 4.5.0 looks like this:

-----------------------
MMC: no card present
switch to partitions #0, OK
mmc0(part 0) is current device
Scanning mmc 0...
Found U-Boot script /boot.scr
2445 bytes read in 27 ms (87.9 KiB/s)
## Executing script at 90000000
** File not found /vmlinuz-4.6.0-1-armmp-lpae **
3655488 bytes read in 217 ms (16.1 MiB/s)
64138 bytes read in 112 ms (558.6 KiB/s)
16263429 bytes read in 472 ms (32.9 MiB/s)
Booting Debian from mmc 0:1...
Kernel image @ 0x81000000 [ 0x000000 - 0x37c740 ]
## Flattened Device Tree blob at 82000000
   Booting using the fdt blob at 0x82000000

EHCI failed to shut down host controller.
   Using Device Tree in place at 82000000, end 82012a89

Starting kernel ...

[    0.000000] L2C: failed to init: -19
[    0.000000] Tegra clk 127: register failed with -17
[    0.055528] /cpus/cpu@0 missing clock-frequency property
[    0.055576] /cpus/cpu@1 missing clock-frequency property
[    0.055621] /cpus/cpu@2 missing clock-frequency property
[    0.055666] /cpus/cpu@3 missing clock-frequency property
[    0.141445] +USB0_VBUS_SW: Failed to request enable GPIO108: -517
[    0.141599] reg-fixed-voltage regulators:regulator@7: Failed to register regulator: -517
[    0.141900] +5V_USB_HS: Failed to request enable GPIO109: -517
[    0.142048] reg-fixed-voltage regulators:regulator@8: Failed to register regulator: -517
[    0.142453] +1.05V_RUN_AVDD_HDMI_PLL: Failed to request enable GPIO63: -517
[    0.142598] reg-fixed-voltage regulators:regulator@11: Failed to register regulator: -517
[    0.142893] +5V_HDMI_CON: Failed to request enable GPIO86: -517
[    0.143036] reg-fixed-voltage regulators:regulator@12: Failed to register regulator: -517
[    0.143331] +5V_SATA: Failed to request enable GPIO242: -517
[    0.143475] reg-fixed-voltage regulators:regulator@13: Failed to register regulator: -517
[    0.143770] +12V_SATA: Failed to request enable GPIO242: -517
[    0.143913] reg-fixed-voltage regulators:regulator@14: Failed to register regulator: -517
[    1.430560] kvm [1]: HYP mode not available
[    1.454606] tegra-pcie 1003000.pcie-controller: Failed to get supply 'avddio-pex': -517
[    1.648853] tegra-pcie 1003000.pcie-controller: Slot present pin change, signature: 00000008
[    2.062181] tegra-pcie 1003000.pcie-controller: link 0 down, retrying
[    2.475511] tegra-pcie 1003000.pcie-controller: link 0 down, retrying
[    2.888914] tegra-pcie 1003000.pcie-controller: link 0 down, retrying
[    2.899486] tegra-pcie 1003000.pcie-controller: Slot present pin change, signature: 00000000
[    3.265821] mmc0: Unknown controller version (3). You may experience problems.
[    3.318975] mmc1: Unknown controller version (3). You may experience problems.
  WARNING: Failed to connect to lvmetad. Falling back to device scanning.
  /dev/mmcblk1rpmb: read failed after 0 of 4096 at 0: Input/output error
  /dev/mmcblk1rpmb: read failed after 0 of 4096 at 4128768: Input/output error
  /dev/mmcblk1rpmb: read failed after 0 of 4096 at 4186112: Input/output error
  /dev/mmcblk1rpmb: read failed after 0 of 4096 at 4096: Input/output error
/dev/mapper/tegra-debian--root: recovering journal
/dev/mapper/tegra-debian--root: clean, 39411/1831424 files, 434011/7323648 blocks
[   15.632476] r8169 0000:01:00.0: firmware: failed to load rtl_nic/rtl8168g-2.fw (-2)

Debian GNU/Linux stretch/sid debian-jetson ttyS0

debian-jetson login: 
------------------------

The obvious difference is some output from loading tegra-snd-rt5640
But I don't know if that's why it is dying.


Here is the uboot output for both cases in case it matters:

-----------------
Hit any key to stop autoboot:  0
Tegra124 (Jetson TK1) #
U-Boot SPL 2014.10-rc2-00002-g105d2f3 (Jun 25 2015 - 15:19:50)


U-Boot 2014.10-rc2-00002-g105d2f3 (Jun 25 2015 - 15:19:50)

TEGRA124
Board: NVIDIA Jetson TK1
I2C:   ready
DRAM:  2 GiB
MMC:   Tegra SD/MMC: 0, Tegra SD/MMC: 1
tegra-pcie: PCI regions:
tegra-pcie:   I/O: 0x12000000-0x12010000
tegra-pcie:   non-prefetchable memory: 0x13000000-0x20000000
tegra-pcie:   prefetchable memory: 0x20000000-0x40000000
tegra-pcie: 2x1, 1x1 configuration
��Ʌ�pcie: probing port 0, using 2 lanes
tegra-pcie: link 0 down, retrying
tegra-pcie: link 0 down, retrying
tegra-pcie: link 0 down, retrying
tegra-pcie: link 0 down, ignoring
tegra-pcie: probing port 1, using 1 lanes
In:    serial
Out:   serial
Err:   serial
Net:   RTL8169#0
Hit any key to stop autoboot:  0
------------------


lsmod after working boot:
Module                  Size  Used by
snd_hda_codec_hdmi     41587  1
snd_hda_tegra           5908  0
snd_hda_codec          96551  2 snd_hda_codec_hdmi,snd_hda_tegra
snd_hda_core           39592  3 snd_hda_codec_hdmi,snd_hda_codec,snd_hda_tegra
snd_hwdep               5791  1 snd_hda_codec
snd_pcm                79829  4 snd_hda_codec_hdmi,snd_hda_codec,snd_hda_tegra,snd_hda_core
sg                     22235  0
tegra_drm             134421  1
evdev                  11844  1
host1x                 93415  1 tegra_drm
snd_timer              19728  1 snd_pcm
snd                    53893  6 snd_hwdep,snd_timer,snd_hda_codec_hdmi,snd_pcm,snd_hda_codec,snd_hda_tegra
soundcore               5507  1 snd
drm_kms_helper        105176  1 tegra_drm
at24                    5585  0
drm                   276666  3 tegra_drm,drm_kms_helper
autofs4                31151  2
ext4                  559450  3
ecb                     2191  0
crc16                   1274  1 ext4
mbcache                 9488  1 ext4
jbd2                   95959  1 ext4
crc32c_generic          1862  4
dm_mod                 98607  10
sd_mod                 32241  3
ahci_tegra              3707  2
libahci_platform        6494  1 ahci_tegra
libahci                22865  2 libahci_platform,ahci_tegra
libata                181479  3 libahci,libahci_platform,ahci_tegra
sdhci_tegra             4932  0
sdhci_pltfm             3786  1 sdhci_tegra
phy_tegra_usb           8822  0
usb_common              3659  1 phy_tegra_usb
sdhci                  39431  2 sdhci_pltfm,sdhci_tegra
scsi_mod              188568  3 sg,libata,sd_mod
r8169                  79227  0
mii                     4102  1 r8169
at24                    5585  0
drm                   276666  3 tegra_drm,drm_kms_helper
autofs4                31151  2
ext4                  559450  3
ecb                     2191  0
crc16                   1274  1 ext4
mbcache                 9488  1 ext4
jbd2                   95959  1 ext4
crc32c_generic          1862  4
dm_mod                 98607  10
sd_mod                 32241  3
ahci_tegra              3707  2
libahci_platform        6494  1 ahci_tegra
libahci                22865  2 libahci_platform,ahci_tegra
libata                181479  3 libahci,libahci_platform,ahci_tegra
sdhci_tegra             4932  0
sdhci_pltfm             3786  1 sdhci_tegra
phy_tegra_usb           8822  0
usb_common              3659  1 phy_tegra_usb
sdhci                  39431  2 sdhci_pltfm,sdhci_tegra
scsi_mod              188568  3 sg,libata,sd_mod
r8169                  79227  0
mii                     4102  1 r8169


enabling ignore_loglevel we find out some more detail about where it goes wrong:
[   15.926802] tegra-snd-rt5640 sound: ASoC: CPU DAI (null) not registered
[   15.933496] tegra-snd-rt5640 sound: snd_soc_register_card failed (-517)
[   15.966289] tegra-snd-rt5640 sound: ASoC: CPU DAI (null) not registered
[   15.973030] tegra-snd-rt5640 sound: snd_soc_register_card failed (-517)
[   15.982022] at24 0-0056: 256 byte 24c02 EEPROM, writable, 8 bytes/write
[   16.007589] tegra-snd-rt5640 sound: ASoC: CPU DAI (null) not registered
[   16.014306] tegra-snd-rt5640 sound: snd_soc_register_card failed (-517)
[  OK  ] Found device /dev/mmcblk1p1.
[   16.263028] tegra-snd-rt5640 sound: ASoC: CPU DAI (null) not registered
[   16.263440] input: tegra-hda HDMI/DP,pcm=3 as /devices/soc0/70030000.hda/sound/card0/input1
[   16.278098] tegra-snd-rt5640 sound: snd_soc_register_card failed (-517)
         Starting File System Check on /dev/mmcblk1p1...
[   16.475671] tegra-snd-rt5640 sound: ASoC: CPU DAI (null) not registered
[   16.482357] tegra-snd-rt5640 sound: snd_soc_register_card failed (-517)
[  OK  ] Started File System Check Daemon to report status.
[  OK  ] Found device RTL8111/8168/8411 PCI ...ess Gigabit Ethernet Controller.

Sugesting that in fact it is initialising the ethernet controller that is going wrong?

further back in the log we have:
[    4.763377] mii: module verification failed: signature and/or required key missing - tainting kernel
[    4.781616] r8169 Gigabit Ethernet driver 2.3LK-NAPI loaded
[    4.792208] r8169 0000:01:00.0: enabling device (0140 -> 0143)
[    4.838932] r8169 0000:01:00.0 eth0: RTL8168g/8111g at 0xf087c000, 00:04:4b:25:ca:3f, XID 0c000800 IRQ 388
[    4.848662] r8169 0000:01:00.0 eth0: jumbo features [frames: 9200 bytes, tx checksumming: ko]   

so the kernel-level ethernet init was fine. This looks like the systemd-level init. Right?

I've managed to avoid systemd so far, but I guess I'm going to have to find out how it works.

Why would this work OK with kernel 4.5, but not 4.6? The userspace
should be the same, although I presume we are still in the initrd at
this point, and those could differ in some important way?

Clues welcome.

Wookey
-- 
Principal hats:  Linaro, Debian, Wookware, ARM
http://wookware.org/

Attachment: signature.asc
Description: Digital signature


Reply to: