Re: Dropping live image generation and testing for oldstable ?

To: debian-cd@lists.debian.org
Subject: Re: Dropping live image generation and testing for oldstable ?
From: Simon McVittie <smcv@debian.org>
Date: Mon, 12 Jan 2026 00:34:53 +0000
Message-id: <[🔎] aWRBrfCfzIzgnWGg@remnant.pseudorandom.co.uk>
In-reply-to: <[🔎] 68dc4ae6-8161-4f26-8286-0821c21604b3@rclobus.nl>
References: <[🔎] aWKFeQwkfGQ3qeAX@einval.com> <[🔎] 68dc4ae6-8161-4f26-8286-0821c21604b3@rclobus.nl>

[Reducing Cc to debian-cd as suggested]

On Sun, 11 Jan 2026 at 21:25:23 +0100, Roland Clobus wrote:

I understand that the testing effort (8 live images for amd64) ishuge. Many of the tests are really slow to perform (i.e. they takemore than 10 minutes each, with lots of waiting in-between).

Each live-image test done by the images testing group is really 7 steps,after the images have been built:


1. download image and write it to bootable media (can be slow depending
   on network connection and media available)
2. boot the live image (quick) and check the desktop environment works
3. install with Calamares (slow), reboot and check the installed desktop
4. reinstall with the included copy of d-i (slow), reboot and check the
   installed desktop
5-7: repeat 2-4 with BIOS rather than UEFI boot mode, often done in
     parallel if enough test machines and/or testers are available

So, yes, this is certainly time-consuming (and considerably moretime-consuming than the tests we do on each debian-cd installer image,which normally only get one install per image per boot mode).

*Generating* the live images is also quite time-consuming, particularlyif it fails and has to be retried. As you mentioned elsewhere in yourmessage, by the time we get to the 13th point release, I wouldn't expectthere to be many surprises remaining - and yet, this time, the live imagebuilds failed.

One factor potentially contributing to that is that the live imagesappear to be built by the latest live-build from git, and not from astable-branch of live-build into which only relevant fixes arecherry-picked. While trying to help to diagnose the failing build, Ilooked at the diff between the live-build that was used for 12.12 andthe live-build that used for 12.13. Nothing jumped out at me as a likelyroot cause for the failure, but I did notice that some of the changesseemed like the sort of change that, if it was up to me, I wouldn't beapplying to a stable branch (for example fixes for bugs that only affectunofficial/customized images and not our official images, or for buildenvironments other than the official one).

debian-cd has a semi-frozen production branch for each Debian majorrelease, with the bar for backporting changes becoming increasingly highas the branches become older (for example if I understand correctly,13.3 images were built withhttps://salsa.debian.org/images-team/debian-cd/-/tree/buildd/trixie?ref_type=headsbut the 12.13 images produced the same day were built withhttps://salsa.debian.org/images-team/debian-cd/-/tree/buildd/bookworm?ref_type=headswhich contains fewer commits). Could live-build do the same? I thinkthat would be good for robustness.

I also wonder whether building (but not publishing!) a set of 12.x liveimages during the "quiet period" in the week before the point release(perhaps with bookworm-proposed-updates included in its apt sources tosmoke-test the pending changes) would have already exhibited the buildregression that we saw on point release day, allowing it to bedetected and investigated before it was too late to do anything about it.

Or just a handful of random tests onreal hardware could be performed instead of all of them.I am aware that the virtual environment used by openQA will not catchhardware-related issues (especially hardware requiring kernel modulesand/or firmware).

The kernel is one of the components most likely to be updated in anoldstable point release, so that's significant.

The images testing group specifically doesn't use virtual machines totest live images, precisely because in the past there have beenregressions that broke them on real hardware but didn't affectinstalling into a VM. This is unlike the debian-cd (d-i) images, forwhich testing "most" images in a VM is usually considered to besufficient (the only ones that are always tested on real hardware arethose with text-to-speech, I think).

Also, for bookworm the point release is number 13. Would it bepossible for the release team to do the full manual tests for thefirst few point releases instead of for all of them?


(I assume you mean the images team rather than the release team.)

My suggestion would be to produce live images and do manual testing forthe lifetime of the stable release, but stop when it becomes oldstable.Concretely, for bookworm, this would have meant generating and testingthese releases:


12.0: debian-cd + live
12.1: debian-cd + live
...
12.11: debian-cd + live
(at this point, 13.0 was released and 12.x became oldstable)
12.12: debian-cd only
12.13: debian-cd only
12.14 (hasn't happened yet): debian-cd only
12.15 (hasn't happened yet): debian-cd only
(at this point, we expect bookworm to be handed over to the LTS team)

My reasoning for this is that our stated reason for continuing tosupport oldstable for 1 year after the stable release is to give Debianusers a grace period of 1 year to upgrade from oldstable to stable - butlive images aren't really required for that.


    smcv

Reply to:

References:
- Dropping live image generation and testing for oldstable ?
  - From: "Andrew M.A. Cater" <amacater@einval.com>
- Re: Dropping live image generation and testing for oldstable ?
  - From: Roland Clobus <rclobus@rclobus.nl>

Prev by Date: Re: Dropping live image generation and testing for oldstable ?
Previous by thread: Re: Dropping live image generation and testing for oldstable ?
Index(es):
- Date
- Thread