Re: interpreting Gkrellm charts

To: debian-user@lists.debian.org
Subject: Re: interpreting Gkrellm charts
From: Gary Dale <garydale@torfree.net>
Date: Sun, 01 Jun 2014 14:18:56 -0400
Message-id: <[🔎] 538B6E90.7080806@torfree.net>
Reply-to: gary@extremeground.com
In-reply-to: <[🔎] 538B6817.9080702@torfree.net>
References: <nnUkq-3mT-11@gated-at.bofh.it> <20140531164343.095CBCCAAE@wormhole.physics.ubc.ca> <538A0BC0.9010501@torfree.net> <CAMPXz=qg1f6Wie9f445t2kgkktfWtfkXit0p8_srxRHhXUvaHQ@mail.gmail.com> <538A4076.7070307@torfree.net> <[🔎] CAMPXz=ri__ZG3muR4_YoSMN5bmEwgbPDRr92U9jgnRed4o6GnA@mail.gmail.com> <[🔎] 538AA92A.8050207@torfree.net> <[🔎] 538B6817.9080702@torfree.net>

On 01/06/14 01:51 PM, Gary Dale wrote:

On 01/06/14 12:16 AM, Gary Dale wrote:
On 31/05/14 11:05 PM, David wrote:
On 1 June 2014 06:49, Gary Dale <garydale@torfree.net> wrote:
On 31/05/14 03:25 PM, David wrote:
On 1 June 2014 03:05, Gary Dale <garydale@torfree.net> wrote:
On 31/05/14 12:43 PM, William Unruh wrote:
In linux.debian.user, you wrote:
I'm running Debian/Jessie on an AMD64 system using KDE. My system
periodically grinds to a halt for a minute or so then resumesas ifnothing had happened. This only happens when I'm running KDE.Gnome and
xfce work properly, even with the same applications open.
I recently installed Gkrellm (using default settings) to seewhat ishappening when my system grinds to a halt. The only unusualpart I seein it is that the procs box has the brown line climbing to thetop of
the chart.  Interestingly, the slope of the brown line continues
throughout the slowdown, which suggests that whatever it ismeasuring
is
continuing to increase.
That is the number of processes that are running
The blue/green things there are how many forks there are withinsome
process.
Possibly not. Sorry, I'm actually using the "prev" theme, not thedefault
one (right-click on the header, select theme | prev). This shows the
number
of procs as a number. The number remains fairly steady over time.Under
xfce
(which I am currently using - this KDE problem is just tooannoying), the(proc) brown line floats around a bit while the blue chart showslots ofspikes. Under KDE, the brown line goes well above the bluespikes. On thedisk chart, the brown and blue charts show spikes in xcfe butjump to asolid high level under KDE during the slowdown - although I dohave one
saved screenshot where the disk activity shows a high number but the
brown
and blue charts are both at a low level.
My interest in reading and helping with the specifics of your problem
pretty much evaporated when you persist in using "brown" and"blue" as
identifiers, even after you realise that the colors change depending
on the particular theme you are using. I think you are more likely to
receive help if you make the effort to learn what all the gkrellm
plots represent and present your problem in those terms instead of
talking about the pretty colors. That will both improve your
understanding of what is happening, and make it easier for people to
help you.
If you right-click on the proc plot you can discover that onecurve is
"load" and the other is "forks". The fact that one may or may not be
above the other is irrelevant because they both autoscale
independently.
Actually I don't discover that at all. That information is hiddenaway in aninfo tab when I right-click on the Proc name bar, not in the plotarea. Thename bar doesn't respond to left-clicks, just to right clicks,while thechart area responds to left clicks by turning the procs and usersinfo on
and off.
It takes a fair amount of interpretation to guess that the line isreportingforks while the vertical bars are possibly procs (or vice-versa)since thethe line graph goes up and down while the number of procs reportedstays
constant. Similarly the spikes in the vertical bar chart don't seem to
reflect the stable number of procs being reported.
It would be helpful, but would require a larger interface, to haveon-screenlabels for the various graphs, such as a tool-tip style popuptelling you
what the line or bar is measuring.
In the gkrellm configuration for the proc builtin, you can read the
info tab that explains more about that. Also in the setup of the proc
builtin I have entered this format string
\w1000\e$p\f procs\w1000\e$l\f load\n\e$u\f users\w1000\e$f\f forks
My proc setup was \w88\a$p\f procs\n\e$u\f users. When I try yours,it showstwo more numbers in the right side of the chart area. The topnumber goesbetween 0.6 and 0.8 from time to time while the bottom number jumpsaround
between 1 and 6.
It gives more information, but nowhere near what running 'top'offers.
I guess that the gkrellm curve you care about is "load", so you
probably need to look at the "load average" numbers in top. Searching
for information on this will find useful links like these:
https://www.linux.com/component/content/article/174-tutorials/42048-uncover-the-meaning-of-tops-statistics
http://www.linuxjournal.com/article/9001
Also 'iotop' can tell you what process is doing disk read/writes,thismight be helpful if you feel that the slowdown is correlated withsome
process that is disk-intensive.

Once you have the process ID numbers, you can use a tool like 'pstree
-p' to better see what initiates the offending process.

I have no idea about KDE.
Unfortunately, neither top nor iotop identifies a process as doinganythingremotely strenuous. However, iotop does confirm that the total i/ogoing onwhen the computer is rather a lot. This is what GKrellm also showsin anicer format. The processes that iotop shows as doing a little diskactivityare the same whether the computer is running slow or runningnormally. It
doesn't seem to show what is doing the large amounts of disk I/O.



--
To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org witha subject
of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: https://lists.debian.org/538A4076.7070307@torfree.net
On 1 June 2014 06:49, Gary Dale <garydale@torfree.net> wrote:
On 31/05/14 03:25 PM, David wrote:
If you right-click on the proc plot you can discover that onecurve is
"load" and the other is "forks". The fact that one may or may not be
above the other is irrelevant because they both autoscale
independently.
Actually I don't discover that at all. That information is hiddenaway in aninfo tab when I right-click on the Proc name bar, not in the plotarea. Thename bar doesn't respond to left-clicks, just to right clicks,while thechart area responds to left clicks by turning the procs and usersinfo on
and off.
When I wrote 'If you right-click on the proc plot you can discoverthat onecurve is "load" and the other is "forks"', I imagined that you wouldrealisethat you can confirm which is which by simply changing theline-style of
one of them, and observing which one's line-style is affected.
It takes a fair amount of interpretation to guess that the line isreportingforks while the vertical bars are possibly procs (or vice-versa)since thethe line graph goes up and down while the number of procs reportedstays
constant. Similarly the spikes in the vertical bar chart don't seem to
reflect the stable number of procs being reported.
In its proc pane gkrellm provides two plots: "load" and "forks". Idon't know
why you are confusing yourself by thinking one of the plots might show
procs. I'm not aware that gkrellm mentions a plot of procs anywhere.
Also, "load" is short for "load average", the references I gaveexplain thatthis a running average, so it won't ever look spiky. And the loadplot will
match the numeric value of load displayed nearby, and will also closely
track the load averages reported by top. Forks on the other hand are
typically spiky events, unless something uncontrolled is occurring.
It would be helpful, but would require a larger interface, to haveon-screenlabels for the various graphs, such as a tool-tip style popuptelling you
what the line or bar is measuring.
Perhaps, but it's unecessary for anyone who understands what's being
plotted. There are two plots of things that show very differentbehaviours.One reports instantaneous data, the other is a running average. Bothplota numeric value displayed adjacent if using the format string Iprovided.Popup tool tips are typically triggered by mousing over static GUIobjects,but a few colored pixels on a plot is not a GUI object, so thatwould becomplex to implement. gkrellm is a tool for sysadmins. They havelearnt to
read documentation, and to think analytically. They don't need spoon
feeding obvious information.
My proc setup was \w88\a$p\f procs\n\e$u\f users. When I try yours,it showstwo more numbers in the right side of the chart area. The topnumber goesbetween 0.6 and 0.8 from time to time while the bottom number jumpsaround
between 1 and 6.
Here, what you call the new "top number" is labelled "load", andwhat you callthe "bottom number" is labelled "forks"? Do you see those labels? Ifyou do
see them, omitting them from this discussion and your thinking is not
helping it.
If you don't see them, perhaps you need to increase the pixel widthof your
gkrellm display.
iotop does confirm that the total i/o going on when the computer israther a lot.
[snip]
It doesn't seem to show what is doing the large amounts of disk I/O.
There appears to be a contradiction in those two statements, becauseiotopusually shows both the total, and what comprises the total. Sothat's strange.What happens if you run iotop in "only" mode while the problem isoccurring?
OK, I'm back in KDE and the problem just recurred. GKrellm showed theprocs steady at around 640, a constant 5 users, the load ramped up tomore than 10 but the forks never really changed much - bouncingaround between 0 and 5. None of the CPUs showed any significantactivity.
The disk i/o was between 10M and 20M but iotop -o didn't show anyprocesses doing any i/o for most of the slowdown. Occasionally a procwould show doing a little i/o, but most of the time nothing showed up.
iotop shows the disk i/o in the totals above the column header line,but I'm not sure the number it shows match what Gkrellm shows. Theyare in the same order of magnitude however.
---------------------------
Just had another episode. The load this time climbed to about 7before eventually tapering off - it usually drops rapidly, but notthis time. It took a couple of minutes for it to drop back below 1.
The disk i/o showed as high as 31M while iotop -o showed only theActual DISK WRITE as high as 7M/s. The other values were usually 0.Again the disk i/o stopped (went back to normal) quite suddenly
Curious. I noticed that nepomuk-server and dolphin both had a numberof zombie processes (around 16 each) attached. I killed the parentprocesses then did a reinstall of dolphin and libnepomukcore4.Nepomuk-server isn't running but I did restart dolphin. No currentzombie processes after a couple of hours and no slowdowns either.
I'm going to watch for this next time I reboot to see if it happensagain, because the slowdowns definitely survived reboots in the past.

Curiouser still. After a reboot, nepomukserver didn't start. Dolphin isstill creating zombie processes but without nepomukserver, I haven'tseen any fresh slowdowns. I'm not sure why nepomukserver isn't starting,but that's not a priority issue for me. :)

Reply to:

Follow-Ups:
- Re: interpreting Gkrellm charts
  - From: Steve Litt <slitt@troubleshooters.com>

References:
- Re: interpreting Gkrellm charts
  - From: David <bouncingcats@gmail.com>
- Re: interpreting Gkrellm charts
  - From: Gary Dale <garydale@torfree.net>
- Re: interpreting Gkrellm charts
  - From: Gary Dale <garydale@torfree.net>

Prev by Date: Re: Remove unwanted, orphaned files and dependencies
Next by Date: Re: interpreting Gkrellm charts
Previous by thread: Re: interpreting Gkrellm charts
Next by thread: Re: interpreting Gkrellm charts
Index(es):
- Date
- Thread