Re: SSD Optimization - Crucial CT1000MX500SSD1

To: debian-user@lists.debian.org
Subject: Re: SSD Optimization - Crucial CT1000MX500SSD1
From: David Christensen <dpchrist@holgerdanske.com>
Date: Mon, 3 Oct 2022 19:56:10 -0700
Message-id: <[🔎] b56c3620-c87e-f5e8-2664-76b5d105d50b@holgerdanske.com>
In-reply-to: <[🔎] 4e48427b-c46c-cfc9-43ee-954fc91a54c7@gmx.com>
References: <[🔎] YzmP8aU23XzTkBMg@marcelo> <[🔎] 13d37333-6862-dd6a-4970-32c3b327489f@holgerdanske.com> <[🔎] 4e48427b-c46c-cfc9-43ee-954fc91a54c7@gmx.com>

On 10/3/22 09:23, piorunz wrote:

On 02/10/2022 21:33, David Christensen wrote:

On 10/2/22 06:19, Marcelo Laia wrote:

# cat /etc/debian_version ; uname -a


bookworm/sid
Linux marcelo 5.19.0-2-amd64 #1 SMP PREEMPT_DYNAMIC Debian 5.19.11-1
(2022-09-24) x86_64 GNU/Linux



Please install Debian Stable.


Why would he?
I have exactly the same SSD (two of them) in my machine, on Debian
Testing, drives in BTRFS Raid1 mode, everything works perfect. But I
have good SATA cables.
OS version has nothing to do with cabling errors in SSD drive SMART log.
He may as well be using DOS, Windows FreeBSD, any Linux - cabling errors
must never happen.

  uname -a

Linux ryzen 5.19.0-2-amd64 #1 SMP PREEMPT_DYNAMIC Debian 5.19.11-1(2022-09-24) x86_64 GNU/Linux


$ sudo smartctl /dev/sda --all | grep "Device
Model\|SATA_Interfac\|DMA_CRC_Error"
Device Model:     CT1000MX500SSD1
183 SATA_Interfac_Downshift 0x0032   100   100   000    Old_age   Always
       -       0
199 UDMA_CRC_Error_Count    0x0032   100   100   000    Old_age   Always
       -       0

$ sudo smartctl /dev/sdb --all | grep "Device
Model\|SATA_Interfac\|DMA_CRC_Error"
Device Model:     CT1000MX500SSD1
183 SATA_Interfac_Downshift 0x0032   100   100   000    Old_age   Always
       -       0
199 UDMA_CRC_Error_Count    0x0032   100   100   000    Old_age   Always
       -       0

Even if you and the OP ran identical OS instances (e.g. clones), I donot believe you two have the same make and model computers. Therefore,different code paths will be executed -- e.g. device drivers. So, theOP's computer may be hitting a bug that your computer does not.

I am applying a trouble-shooting strategy -- change one variable, applya stimulus, and measure the result. If the result is the same as it wasbefore, then the result is unlikely to be related to the variable and/orchange. But if the result is different, then the result is likely to berelated to the variable and/or change.

Of course, this is all premised upon devising a stimulus that reliablyreproduces the result. When my HDD's/SSD's were having SATA cableand/or drive rack problems, reading 10 GB from them typically producedat least one error.

When the OP read 10 GB of the SSD using the d-i rescue shell, he wasapplying a stimulus after changing the variable "OS instance". Theresult was different. Therefore, the SATA UDMA CRC errors are relatedto changing the OS instance.

But, the above experiment has significant flaws (here are few; I expectthere are more):


1.  We cannot reproduce the OP's hardware and software.

2. We do not know what Debian installer the OP used (but we couldobtain it if he told us).

2. The stimulus read from the SSD. The UDMA CRC errors may only occurduring writes.

3.The SMART reports indicate 38 UDMA CRC errors for 1296000877 LogicalSectors Written and 801097450 Logical Sectors Read. So, an average of 1error per 5.52E+7 sectors. The test read 2.05E+7 sectors. That mightbe too few sectors.

4. Similarly, for Number of Read Commands -- 1 error per 4.43E+5commands vs. 1.02E+4 test commands.

5. The Debian installer rescue shell is single-user (single-process?),but the UDMA errors were seen during multi-user operation (SMP). If theSATA UDMA errors are caused by concurrency/ parallel execution, the d-irescue shell environment may not be capable of reproducing the error.

If the OP installs Debian Stable on the SSD, runs the 10 GB sequentialread test, uses the system interactively, and the SATA UDMA errors arenot seen for a some period of time (a week?), then I would be reasonablyconfident the problem was the SSD Debian Testing instance. But if theerrors persist, then we will have to think up another hypothesis andexperiment.



David

Reply to:

Follow-Ups:
- Re: SSD Optimization - Crucial CT1000MX500SSD1
  - From: piorunz <piorunz@gmx.com>

References:
- Re: Re: SSD Optimization - Crucial CT1000MX500SSD1
  - From: Marcelo Laia <marcelolaia@gmail.com>
- Re: SSD Optimization - Crucial CT1000MX500SSD1
  - From: David Christensen <dpchrist@holgerdanske.com>
- Re: SSD Optimization - Crucial CT1000MX500SSD1
  - From: piorunz <piorunz@gmx.com>

Prev by Date: Re: Error including file in nftables.conf
Next by Date: Some applications don't follow default browser settings
Previous by thread: Re: SSD Optimization - Crucial CT1000MX500SSD1
Next by thread: Re: SSD Optimization - Crucial CT1000MX500SSD1
Index(es):
- Date
- Thread