Re: stress-ng process termination issue

To: debian-hurd@lists.debian.org
Subject: Re: stress-ng process termination issue
From: Michael Kelly <mike@weatherwax.co.uk>
Date: Tue, 29 Jul 2025 20:46:24 +0100
Message-id: <[🔎] 89910661-b576-431c-8aa1-81c67b7b2c30@weatherwax.co.uk>
In-reply-to: <[🔎] 1a34e2ee-637e-4740-9ceb-494019333e5b@weatherwax.co.uk>
References: <[🔎] eb9dda26-d63f-47ba-935d-4baa070f4584@weatherwax.co.uk> <[🔎] 1a34e2ee-637e-4740-9ceb-494019333e5b@weatherwax.co.uk>

I think that I have an understanding of how this problem arises.Apologies for the long explanation especially if much of this is alreadyapparent to you.

Thread $task61.1 receives the message to handle the SIGALRM signal for$task61. Signal handling requires the main thread to be aborted bycalling thread_abort($task61.0). Thread $task61.1 then entersthread_sleep() awaiting thread $task61.0 to become suspended. Thread$task61.1 has a state of TH_WAIT|TH_RUN at this time.

The stressor program ($task60) reaches its "limit of patience" and sendsSIGKILL to the worker ($task61). SIGKILL is handled specially and so$task60 calls task_terminate($task61) directly. An early phase of thiscalls task_hold_locked() which calls thread_hold() on each thread in$task61. This is when $task61.1 (whilst sleeping) changes toTH_WAIT|TH_SUSP before it emerges from thread_sleep().

When $task61.1 is woken up by thread_wakeup_prim() it matches the casewhere it is "Either already running, or suspended" so the code onlyremoves the TH_WAIT state and the thread is not set running. It remainsin this state as there are no further triggers to set it running againto tidy its state before termination. The most relevant state thatcannot be tidied is the remaining reference to thread $task61.0.

Task $task60 is now spinning away with continuous attempts to terminatethe first thread in the list ($task61.0) but that thread neverterminates because of the reference held by $task61.1. Both $task60 and$task61 are now permanently stuck in their current states.

My suggestion is to alter the code within task_terminate() to callthread_force_terminate() for each thread in the list rather than justthe head of the list. Then repeat the iteration through the remainingthreads in the list until there are no threads remaining. This iscompared to the existing code which relies on the threads being able tobe terminated in sequence which might not always be possible (if myanalysis above is correct).

I've attached my proposal for review. The solution appears clumsy but Icould not see a more trivial method of traversing the list safely withthe locks necessarily released for the call to thread_force_terminate().I wouldn't be at all surprised if what I have proposed is not safe butI'd welcome any feedback to improve what I have or to find error in myanalysis. I've included the original code '#if 0' for more simplecomparison rather than a diff at this stage. In any case, even if mysuggestion finds favour, I'd imagine this alteration would want muchscrutiny before being released into what is a critical part of gnumach.

I have tested this patch with my stress-ng test case and it has notfailed for more than 90 minutes now which is some kind of record. Itusually fails within 20 minutes and often less than that.


Regards,

Mike.

On 23/07/2025 20:13, Michael Kelly wrote:

Some additional context for consideration. The thread 0xf60f9170 has areference count of 1 so presumably other than during repetitions ofthe while loop in $task60.0 the only reference is held by $task61.1.That thread is sleeping waiting for TH_EV_WAKE_ACTIVE on thread0xf60f9170. That wakeup event presumably never arrives. Is that downto the task it is associated with being terminated?
On 22/07/2025 20:14, Michael Kelly wrote:
Hi All,
I've been experimenting with stress-ng for some time to stress testmy hurd virtual machine. This has already exposed a few problems buthere is another. Sorry, for the long explanation, but it might benecessary to make sense of the problem. The scenario under test goessomething like:
1) Top level supervisory process 'stress-ng' begins execution
2) It forks N times, one per stressor under test (in my case 64times). Call these processes 'stressor'.
3) The particular tests I am running are stress-vm and stress-mmap.In these tests each of the stressor processes forks again so that itcan be supervised and restart the test should it run out ofresources. Call these processes 'worker'.
4) Each stressor sets a timeout using alarm() and then waits for theworker to terminate by calling waitpid().
5) The stressor SIGALRM handler sets a variable tested occasionallywithin the worker. If the worker tests that variable quickly then itexits normally. If it does not, then the stressor sends a series ofsignals SIGALRM (4 times), SIGTERM then finally SIGKILL with a shorttime gap between them.
The test scenario I set up uses all the vm's real memory and acertain portion of swap. Consequently when the timeout expires, manyof the processes are paged out and they do not respond quickly whichmeans that many workers receive all 6 signals. Occasionally, one ofthe stressor processes gets stuck within this while loop withintask_terminate ($task60.0):
while (!queue_empty(list)) {
thread = (thread_t) queue_first(list); /* thread is 0xf60f9170and is within the worker process */
    ......
    thread_force_terminate(thread);
    ......
}
thread_force_terminate(thread) calls thread_halt(thread, TRUE) and inthis instance does very little as the the thread is already haltedand it simply increases the thread suspend_count (currently standingat 0x64c0fc8e !). The thread is not removed from the list and it isrepeatedly processed in the loop.
The thread 0xf60f9170 is in $task61 (the worker) and is the mainthread which does all the stress testing. Examining its statesuggests it is already halted with a state of 0x112(TH_SUSP|TH_HALTED|TH_SWAPPED).
All stack traces are attached and are annotated with extra context.
I'm trying to make sense of the thread code but as it's rathercomplex I thought it might save time by asking if anyone had anyinput to make. In particular what do I need to look at or consider todetermine why the state has ended this way? Better yet someone mightimmediately see the cause of the problem. I have a virtual machinesnapshot of this moment saved so I can easily relay any additionalinformation required.
There is a 2nd thread ($task60.1) in the stressor process which isalso looping but I think that is just stuck waiting for thetask_terminate() to complete. (This 2nd thread is processing asecondary timeout setup by the stressor using alarm(1) but I don'tthink that is necessarily relevant).
None of the threads in $task61 appear to be active based on their'last updated' time reported by the kernel debugger.
Any ideas?

#if 0
        while (!queue_empty(list)) {
                thread = (thread_t) queue_first(list);
                thread_reference(thread);
                task_unlock(task);
                thread_force_terminate(thread);
                thread_deallocate(thread);
                thread_block(thread_no_continuation);
                task_lock(task);
        }
#else
        while (!queue_empty(list)) {

          thread = (thread_t) queue_first(list);
          thread_reference(thread);

          do
            {
              thread_t next = (thread_t) queue_next(&thread->thread_list);

              if (!queue_end(list, (queue_entry_t) next))
                  thread_reference(next);

              task_unlock(task);
              thread_force_terminate(thread);
              thread_deallocate(thread);
              thread_block(thread_no_continuation);
              thread = next;
              task_lock(task);
            }
          while (!queue_end(list, (queue_entry_t) thread));
        }
#endif

Reply to:

Follow-Ups:
- Re: stress-ng process termination issue
  - From: Samuel Thibault <sthibault@debian.org>

References:
- stress-ng process termination issue
  - From: Michael Kelly <mike@weatherwax.co.uk>
- Re: stress-ng process termination issue
  - From: Michael Kelly <mike@weatherwax.co.uk>

Prev by Date: Re: stress-ng process termination issue
Next by Date: Re: stress-ng process termination issue
Previous by thread: Re: stress-ng process termination issue
Next by thread: Re: stress-ng process termination issue
Index(es):
- Date
- Thread