Re: Transitioning from existence-based lockfiles and /var/lock to flock

To: debian-policy@lists.debian.org
Subject: Re: Transitioning from existence-based lockfiles and /var/lock to flock
From: Simon McVittie <smcv@debian.org>
Date: Mon, 13 Oct 2025 19:55:55 +0100
Message-id: <[🔎] aO1LO87dy-6kXHL5@remnant.pseudorandom.co.uk>
In-reply-to: <[🔎] aO0_FT9-iz46vTr_@localhost>
References: <[🔎] aO0_FT9-iz46vTr_@localhost>

On Mon, 13 Oct 2025 at 11:04:05 -0700, Josh Triplett wrote:

- Software should not use existence-based lockfiles (where the existence
 of the lockfile constitutes holding the lock); software should use
 file-based locking (`flock`) on an appropriate file instead.

There are several orthogonal advisory lock mechanisms, and I don't thinkPolicy should take a general position on which one should be used, aslong as all programs that might want to exclude each other by holding alock can agree on which one they are going to use. The ones I know aboutare:


* flock(2) (POSIX) and its command-line interface flock(1) (util-linux)
* fcntl F_SETLK and friends (POSIX)
* fcntl F_OFD_SETLCK and friends (Linux-specific)
* lockf(3) (POSIX)
  - which wraps one of the fcntl locks on GNU/Linux, but might be something
    else on other kernel/libc combinations

There might be others.

In general it would be a significant bug to replace one of these withanother of these without domain-specific attention being paid to thesubtleties of their semantics in terms of which ones exclude each other,which ones can be inherited from parent to child, which ones are scopedto a process or a thread or an open file description, and so on.

For example, when Flatpak wants to prevent a concurrent Flatpak processfrom deleting a runtime that is in use by an app, it implements that bylocking the file ${runtime}/.ref with fcntl F_SETLK. It would be finefor a program that interacts with Flatpak (or a newer version of Flatpakitself) to use either F_SETLK or F_OFD_SETLCK on ${runtime}/.ref,because F_OFD_SETLCK is documented to be mutually exclusive with anincompatible F_SETLK, but it would be a potentialy serious bug for itto use flock(2), because it is unspecified whether flock(2) and F_SETLKexclude each other (and on Linux they don't, unless NFS happens to beinvolved).

Similarly, it would be a potentially serious bug if one program lockedthe file ${runtime}/.ref, but another took out a lock on the directoryitself, ${runtime}, intending to exclude the other program. Either oneof those two locking disciplines is OK in isolation, but the twoprograms must agree on which one they are going to use. Clusters ofclosely-cooperating programs can just agree this among themselveswithout any special coordination and without any Policy involvement, butbroader or looser categories of programs could benefit from coordinationin Policy.


In particular:

Policy does specify (in §11.6) how to lock the mailboxes in /var/mail/,because that is an example of a single domain-specific context whereit's necessary that everything agrees. (It already calls for this to bedone inside /var/mail/ rather than involving /var/lock/ or /run/lock/,so it's out-of-scope for #1115317.)

According to #1110980 and #1110981, the FHS, which Policy incorporatesby reference, specifies the use of lock files in /var/lock/ for serialports. If we want programs like uucp to prefer to use flock or fcntllocks for this purpose, then we will need to document a FHS exception inPolicy for this, and specify which of the various advisory lockingmechanisms is to be used for it - preferably one that is alreadysupported in software that locks serial ports, or already used in otherdistros. In #1110980, Luca recommended "BSD locks" and mentions thatsome serial-port-related software already supports those, but I'm notsure which specific API that was intended to refer to - as approximatelyPOSIX-compliant OS distributions, the BSDs presumably support bothflock(2) and fcntl F_SETLK, and possibly others.

I think it would be best to have a specific, narrowly-scoped bug toagree on how programs like uucp should lock serial ports, with itsconclusion documented in Policy. I don't know whether there are othernon-closely-cooperating groups of programs currently using /run/lock/ or(equivalently) /var/lock/ that need similarcoordination.


    smcv

Reply to:

Follow-Ups:
- Re: Transitioning from existence-based lockfiles and /var/lock to flock
  - From: Bastian Blank <waldi@debian.org>

References:
- Transitioning from existence-based lockfiles and /var/lock to flock
  - From: Josh Triplett <josh@joshtriplett.org>

Prev by Date: Re: Transitioning from existence-based lockfiles and /var/lock to flock
Next by Date: Re: Transitioning from existence-based lockfiles and /var/lock to flock
Previous by thread: Re: Re: Transitioning from existence-based lockfiles and /var/lock to flock
Next by thread: Re: Transitioning from existence-based lockfiles and /var/lock to flock
Index(es):
- Date
- Thread