Re: Causes, cures and prevention of orphaned inodes?
On Mon, 2018-02-05 at 10:15 +1300, Ben Caradoc-Davies wrote:
> On 05/02/18 09:49, Stephen P. Molnar wrote:
> > They seem to happen when I am
> > running four or more apps at the same time.
>
> I would never expect to see orphaned inodes except after a system
> crash
> or kernel memory corruption. How did you test your CPU and RAM? Do
> you
> see any other symptoms such as segfaults that could suggest memory
> problems under concurrent load? How long have you seen this problem?
> I
> see you are using ext4; are the inodes on these filesystems? ext4 is
> very well tested and robust.
>
> My preferred memory test for my 4-core (8-thread) Kaby Lake i7 is to
> run
> concurrent "memtester" instances equal to the number of cores (4 in
> my
> case), concurrent with "stress" equal to the number of cores ("stress
> -c
> 4" in my case). This workout detected memory problems not found by
> other
> tools such as "memtest86+" or "mprime -t".
>
> Other hardware issues to consider are overheating (addressed with
> better
> cooling and thermald) and power supply problems which may only be
> evident at load. Is your system prime stable (i.e. runs with "mprime
> -t"
> (with AVX disabled) for many hours)? How do you monitor system
> temperature?
>
> Kind regards,
>
Thanks for your reply.
As a matter of fact I did get an overheating warning from the OS when
running a rather large organic molecule on the Orca package with 8
threads. I upgraded the CPU cooler to a Hyper 212 EVO and the problem
went away.
--
Stephen P. Molnar, Ph.D.
Consultant
www.molecular-modeling.net
(614)312-7528 (c)
Skype: smolnar1
Reply to: