[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: ext4 FS Crash





On Wed, Dec 4, 2024 at 11:43 PM Michael Stone <mstone@debian.org> wrote:
On Wed, Dec 04, 2024 at 05:11:47PM +0000, Daniel Harris wrote:
>Thanks for all your replies.
>As far as I can tell there are no errors reported using fsck or smartctl or
>nvme
> and the firmware is the correct and newest version so no problems there.
>
>The following are the messages that appear but only taken from my phone and
>copied from the photo (lots of scrolling errors repeating over).
>I thought these new drives were supposed to last longer than older moving HDD
>but obviously not

Is this during boot? The messages indicate a corrupted journal, which
generally means a device error, or maybe a device which lost power while
writing. It should be possible to mount read-only without replaying the
journal for recovery purposes, but it's basically unfixable.

So its not actually a crash.  On the 2 occasions it has happened, I have been away from my computer for a while, and when I return and move the mouse, I can see messages scrolling on a black screen (no X running).  I can move to a new vt but I cannot log in.  When I try to log in I just get the errors repeating on the screen.  After I do a hard reset everything works perfectly. No errors anywhere.
 
>I guess its time to buy a new drive : (

Did you try "nvme smart-log /dev/nvme0" to look for issues?


seems normal to me

 Smart Log for NVME device:nvme0 namespace-id:ffffffff
critical_warning                        : 0
temperature                             : 31°C (304 Kelvin)
available_spare                         : 100%
available_spare_threshold               : 10%
percentage_used                         : 0%
endurance group critical warning summary: 0
Data Units Read                         : 807,634 (413.51 GB)
Data Units Written                      : 5,680,746 (2.91 TB)
host_read_commands                      : 6,573,734
host_write_commands                     : 75,990,191
controller_busy_time                    : 1,145
power_cycles                            : 618
power_on_hours                          : 197
unsafe_shutdowns                        : 21
media_errors                            : 0
num_err_log_entries                     : 0
Warning Temperature Time                : 0
Critical Composite Temperature Time     : 0
Temperature Sensor 1           : 31°C (304 Kelvin)
Temperature Sensor 2           : 38°C (311 Kelvin)
Thermal Management T1 Trans Count       : 0
Thermal Management T2 Trans Count       : 0
Thermal Management T1 Total Time        : 0
Thermal Management T2 Total Time        : 0


Thanks Dan

Reply to: