[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#696554: System freezes and lockups



Okay, system idling with browser for email, rhythmbox playing music and a terminal to tail syslog and watch sensors:

adt7473-i2c-1-2e
Adapter: nouveau-0000:01:00.0-3
in1:          +3.00 V  (min =  +0.00 V, max =  +2.99 V)
+3.3V:        +3.30 V  (min =  +0.00 V, max =  +4.39 V)
fan1:         674 RPM  (min =    0 RPM)
fan2:           0 RPM  (min =    0 RPM)
fan3:           0 RPM  (min =  164 RPM)  ALARM
temp1:        +41.5°C  (low  = +65.0°C, high = +85.0°C)  ALARM
                       (crit = +97.0°C, hyst = +97.0°C)
Board Temp:   +41.5°C  (low  = +20.0°C, high = +60.0°C)
                       (crit = +100.0°C, hyst = +100.0°C)
temp3:        +41.8°C  (low  = +80.0°C, high = +105.0°C)  ALARM
                       (crit = +107.0°C, hyst = +103.0°C)


Errors are piling up... I feel a crash is imminent!

Dec 25 21:56:55 humonculux kernel: [ 4669.516767] [drm] nouveau 0000:01:00.0: Restoring mode...
Dec 25 22:24:10 humonculux kernel: [ 6306.928469] [drm] nouveau 0000:01:00.0: PFIFO_DMA_PUSHER - Ch 2 Get 0x0020032920 Put 0x0020034f48 IbGet 0x00000c6f IbPut 0x00000c74 State 0x8000f354 (err: INVALID_CMD) Push 0x00400040
Dec 25 22:24:10 humonculux kernel: [ 6306.931752] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR INVALID_ENUM
Dec 25 22:24:10 humonculux kernel: [ 6306.931765] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR
Dec 25 22:24:10 humonculux kernel: [ 6306.931767] [drm] nouveau 0000:01:00.0: PGRAPH - ch 2 (0x0001256000) subc 7 class 0x8397 mthd 0x1344 data 0x0008e6a0
Dec 25 22:24:10 humonculux kernel: [ 6306.931775] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR INVALID_ENUM
Dec 25 22:24:10 humonculux kernel: [ 6306.931776] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR
Dec 25 22:24:10 humonculux kernel: [ 6306.931778] [drm] nouveau 0000:01:00.0: PGRAPH - ch 2 (0x0001256000) subc 7 class 0x8397 mthd 0x1348 data 0x000700b5
Dec 25 22:24:10 humonculux kernel: [ 6306.931785] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR INVALID_ENUM
Dec 25 22:24:10 humonculux kernel: [ 6306.931786] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR
Dec 25 22:24:10 humonculux kernel: [ 6306.931788] [drm] nouveau 0000:01:00.0: PGRAPH - ch 2 (0x0001256000) subc 7 class 0x8397 mthd 0x134c data 0x00200170
Dec 25 22:24:10 humonculux kernel: [ 6306.931795] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR INVALID_ENUM
Dec 25 22:24:10 humonculux kernel: [ 6306.931796] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR
Dec 25 22:24:10 humonculux kernel: [ 6306.931797] [drm] nouveau 0000:01:00.0: PGRAPH - ch 2 (0x0001256000) subc 7 class 0x8397 mthd 0x1350 data 0x0004e680
Dec 25 22:24:45 humonculux kernel: [ 6341.625573] [drm] nouveau 0000:01:00.0: PFIFO_DMA_PUSHER - Ch 2 Get 0x002002ea3c Put 0x0020030260 IbGet 0x00000d9e IbPut 0x00000d9f State 0x8000e6a4 (err: INVALID_CMD) Push 0x00702031
Dec 25 22:24:45 humonculux kernel: [ 6341.638265] [drm] nouveau 0000:01:00.0: PFIFO_DMA_PUSHER - Ch 2 Get 0x002000f1ec Put 0x002000f1f0 IbGet 0x00000d9f IbPut 0x00000da1 State 0x40000030 (err: INVALID_MTHD) Push 0x00400040
Dec 25 22:24:45 humonculux kernel: [ 6341.639475] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR INVALID_BITFIELD
Dec 25 22:24:45 humonculux kernel: [ 6341.639477] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR
Dec 25 22:24:45 humonculux kernel: [ 6341.639479] [drm] nouveau 0000:01:00.0: PGRAPH - ch 2 (0x0001256000) subc 7 class 0x8397 mthd 0x15e0 data 0x00000030
Dec 25 22:24:45 humonculux kernel: [ 6341.656092] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR BEGIN_END_ACTIVE
Dec 25 22:24:45 humonculux kernel: [ 6341.656097] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR
Dec 25 22:24:45 humonculux kernel: [ 6341.656107] [drm] nouveau 0000:01:00.0: PGRAPH - ch 2 (0x0001256000) subc 7 class 0x8397 mthd 0x1360 data 0x00000001
Dec 25 22:24:45 humonculux kernel: [ 6341.656115] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR BEGIN_END_ACTIVE
Dec 25 22:24:45 humonculux kernel: [ 6341.656116] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR
Dec 25 22:24:45 humonculux kernel: [ 6341.656117] [drm] nouveau 0000:01:00.0: PGRAPH - ch 2 (0x0001256000) subc 7 class 0x8397 mthd 0x1340 data 0x00008006
Dec 25 22:24:45 humonculux kernel: [ 6341.656125] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR BEGIN_END_ACTIVE
Dec 25 22:24:45 humonculux kernel: [ 6341.656126] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR
Dec 25 22:24:45 humonculux kernel: [ 6341.656127] [drm] nouveau 0000:01:00.0: PGRAPH - ch 2 (0x0001256000) subc 7 class 0x8397 mthd 0x1344 data 0x00004001
Dec 25 22:24:45 humonculux kernel: [ 6341.656135] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR BEGIN_END_ACTIVE
Dec 25 22:24:45 humonculux kernel: [ 6341.656136] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR
Dec 25 22:24:45 humonculux kernel: [ 6341.656137] [drm] nouveau 0000:01:00.0: PGRAPH - ch 2 (0x0001256000) subc 7 class 0x8397 mthd 0x1348 data 0x00004303
Dec 25 22:24:45 humonculux kernel: [ 6341.656144] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR BEGIN_END_ACTIVE
Dec 25 22:24:45 humonculux kernel: [ 6341.656145] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR
Dec 25 22:24:45 humonculux kernel: [ 6341.656147] [drm] nouveau 0000:01:00.0: PGRAPH - ch 2 (0x0001256000) subc 7 class 0x8397 mthd 0x134c data 0x00008006
Dec 25 22:24:45 humonculux kernel: [ 6341.656154] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR BEGIN_END_ACTIVE
Dec 25 22:24:45 humonculux kernel: [ 6341.656155] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR
Dec 25 22:24:45 humonculux kernel: [ 6341.656156] [drm] nouveau 0000:01:00.0: PGRAPH - ch 2 (0x0001256000) subc 7 class 0x8397 mthd 0x1350 data 0x00004001
Dec 25 22:24:45 humonculux kernel: [ 6341.656164] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR BEGIN_END_ACTIVE
Dec 25 22:24:45 humonculux kernel: [ 6341.656165] [drm] nouveau 0000:01:00.0: PGRAPH - DATA_ERROR
Dec 25 22:24:45 humonculux kernel: [ 6341.656166] [drm] nouveau 0000:01:00.0: PGRAPH - ch 2 (0x0001256000) subc 7 class 0x8397 mthd 0x1358 data 0x00004303
Dec 25 22:29:06 humonculux kernel: [ 6602.329083] [drm] nouveau 0000:01:00.0: PGRAPH_TRAP_TPDMA_RT - TP 1 - Unknown fault at address 004f260100
Dec 25 22:29:06 humonculux kernel: [ 6602.329085] [drm] nouveau 0000:01:00.0: PGRAPH_TRAP_TPDMA_RT - TP 1 - e0c: 00000000, e18: 00000000, e1c: 00060500, e20: 00002a00, e24: 00030000
Dec 25 22:29:06 humonculux kernel: [ 6602.329095] [drm] nouveau 0000:01:00.0: PGRAPH_TRAP_TPDMA_RT - TP 2 - Unknown fault at address 004f261000
Dec 25 22:29:06 humonculux kernel: [ 6602.329096] [drm] nouveau 0000:01:00.0: PGRAPH_TRAP_TPDMA_RT - TP 2 - e0c: 00000000, e18: 00000000, e1c: 00020510, e20: 00002a00, e24: 00030000
Dec 25 22:29:06 humonculux kernel: [ 6602.329105] [drm] nouveau 0000:01:00.0: PGRAPH_TRAP_TPDMA_RT - TP 3 - Unknown fault at address 004f262100
Dec 25 22:29:06 humonculux kernel: [ 6602.329106] [drm] nouveau 0000:01:00.0: PGRAPH_TRAP_TPDMA_RT - TP 3 - e0c: 00000000, e18: 00000000, e1c: 00060520, e20: 00002a00, e24: 00030000
Dec 25 22:29:06 humonculux kernel: [ 6602.329116] [drm] nouveau 0000:01:00.0: PGRAPH_TRAP_TPDMA_RT - TP 5 - Unknown fault at address 004f263000
Dec 25 22:29:06 humonculux kernel: [ 6602.329117] [drm] nouveau 0000:01:00.0: PGRAPH_TRAP_TPDMA_RT - TP 5 - e0c: 00000000, e18: 00000000, e1c: 00020530, e20: 00002a00, e24: 00030000
Dec 25 22:29:06 humonculux kernel: [ 6602.329126] [drm] nouveau 0000:01:00.0: PGRAPH_TRAP_TPDMA_RT - TP 6 - Unknown fault at address 004f264000
Dec 25 22:29:06 humonculux kernel: [ 6602.329127] [drm] nouveau 0000:01:00.0: PGRAPH_TRAP_TPDMA_RT - TP 6 - e0c: 00000000, e18: 00000000, e1c: 00020540, e20: 00002a00, e24: 00030000
Dec 25 22:29:06 humonculux kernel: [ 6602.329136] [drm] nouveau 0000:01:00.0: PGRAPH_TRAP_TPDMA_RT - TP 7 - Unknown fault at address 004f265000
Dec 25 22:29:06 humonculux kernel: [ 6602.329138] [drm] nouveau 0000:01:00.0: PGRAPH_TRAP_TPDMA_RT - TP 7 - e0c: 00000000, e18: 00000000, e1c: 00020550, e20: 00002a00, e24: 00030000
Dec 25 22:29:06 humonculux kernel: [ 6602.329147] [drm] nouveau 0000:01:00.0: PGRAPH_TRAP_TPDMA_RT - TP 8 - Unknown fault at address 004f266000
Dec 25 22:29:06 humonculux kernel: [ 6602.329148] [drm] nouveau 0000:01:00.0: PGRAPH_TRAP_TPDMA_RT - TP 8 - e0c: 00000000, e18: 00000000, e1c: 00020562, e20: 00002a00, e24: 00030000
Dec 25 22:29:06 humonculux kernel: [ 6602.329150] [drm] nouveau 0000:01:00.0: PGRAPH - TRAP
Dec 25 22:29:06 humonculux kernel: [ 6602.329152] [drm] nouveau 0000:01:00.0: PGRAPH - ch 2 (0x0001256000) subc 7 class 0x8397 mthd 0x15e0 data 0x00000000
Dec 25 22:29:06 humonculux kernel: [ 6602.329161] [drm] nouveau 0000:01:00.0: VM: trapped write at 0x004f260100 on ch 2 [0x00001256] PGRAPH/PROP/RT0 reason: PAGE_NOT_PRESENT





----- Original Message -----
> From: Steven Chamberlain <steven@pyro.eu.org>
> To: Mar Mel <marmel6942@yahoo.com>
> Cc: "696554@bugs.debian.org" <696554@bugs.debian.org>
> Sent: Tuesday, December 25, 2012 9:34 PM
> Subject: Re: Bug#696554: System freezes and lockups
> 
> Hi,
> 
> I wonder, if these errors only appear after some time - and if you've
> had problems with the non-free driver too -  maybe it has to do with the
> buildup of heat?
> 
> Are you able to check temperature sensors on the card?
> 
> I don't know how to do that for your particular model, but on my card
> (NV92 / 8800 GT) I can do it like this (as root) :
> 
> # apt-get install lm-sensors
> # modprobe adt7473
> # modprobe adt7475
> # sensors
> adt7473-i2c-4-2e
> Adapter: nouveau-0000:18:00.0-2
> in1:          +3.00 V  (min =  +0.00 V, max =  +2.99 V)
> +3.3V:        +3.34 V  (min =  +0.00 V, max =  +4.39 V)
> fan1:        2277 RPM  (min = 2000 RPM)
> fan2:           0 RPM  (min =    0 RPM)
> fan3:           0 RPM  (min =  164 RPM)  ALARM
> temp1:        +52.5°C  (low  = +20.0°C, high = +68.0°C)
>                        (crit = +100.0°C, hyst = +98.0°C)
> Board Temp:   +48.0°C  (low  = +20.0°C, high = +60.0°C)
>                        (crit = +100.0°C, hyst = +96.0°C)
> temp3:        +52.5°C  (low  = +20.0°C, high = +68.0°C)
>                        (crit = +136.0°C, hyst = +132.0°C)
> 
> Regards,
> -- 
> Steven Chamberlain
> steven@pyro.eu.org
>


Reply to: