Re: COVID [was: Re: DebConf 25 Daily announcements - 2025.07.15 - Daytrip information && DebConf Day 2]

To: Alex Lieflander <public@atlief.com>, debconf-discuss@lists.debian.org
Subject: Re: COVID [was: Re: DebConf 25 Daily announcements - 2025.07.15 - Daytrip information && DebConf Day 2]
From: Julien Plissonneau Duquène <sre4ever@free.fr>
Date: Sat, 26 Jul 2025 14:12:59 +0200
Message-id: <[🔎] fe4970f753d0b3b8c7dc00870dd2f0bf@free.fr>
In-reply-to: <[🔎] 1926163.tdWV9SEqCh@yogatl>
References: <47cxe7oyoyjixyqathko5tu47ukheqnv6ungblrriij5mwdz3t@rpkdawvwpsws> <22822798.EfDdHjke4D@yogatl> <[🔎] b45b7a3c5112e12c38987cabd1c7e30b@free.fr> <[🔎] 1926163.tdWV9SEqCh@yogatl>

CW: wall of text, probabilities, and some code
TL/DR: GIGO

Hi Alex,

Le 2025-07-18 22:43, Alex Lieflander a écrit :

Here's an example calculation of the risk with some **rough**approximations. Let's assume that:

Let me first point out some flaws in your analysis, without evendiscussing the numbers yet.

Risk of being infected is a function of many parameters, includingexposure to pathogens in the air, which is itself some integration of afunction of the distance and time spent at that distance from a pathogenemitter (but also: strain of virus, temperature, sun, relative humidity,air circulation, use of mask by none/one/both persons involved, type ofmask, is the mask worn properly, has the bearer of a FFP2 mask a beard,vaccination status, ...) and your model doesn't account for distancing(which means raising the distance and minimizing the time spent close toothers). It's also likely that a short but intense exposure will resultin a worse outcome than the same amount of exposure over an extendedperiod of time, but as I could not find any data about this I willignore this possibility as well.

Your reasoning about probabilities is also wrong. Let's assume, to keepyour 5-on-1 ratio, that members of unmasked COVID-infected folks (let'scall them group A, 6 of them) are each emitting pathogens in ambient airat 5 times the rate of mask-bearing knowingly-COVID-positive folks(let's call them group B, 14 of them). Group A is then accountable for6×5 / (6×5 + 14) = 0.68182 of pathogens in ambient air, while group B isfor the remainder = 0.31818. Then, for a single pathogen emitted byeither group, the probability that it was emitted by any single memberof that group is a flat distribution. That is, for a member of A,0.11364, and for a member of B, 0.022727.

Now if we remove entirely group B, the amount of pathogens that arepresent in ambient air would go down accordingly: it would only be68.182% of what it was with group B present. Making a few more unwiseand unrealistic assumptions, such as the initial overall probability foran attendee to become infected at this event being 6×0.05 + 14×0.01 =0.44, and the probability of becoming infected being a linear functionof the exposure, removing entirely group B would lower the overallprobability of any attendee becoming infected to 0.3.

Your own calculation was the probability for someone to not get infectedafter attending a number of successive events each with one masked orunmasked sars-cov-2 emitter; this is not the same thing as attending asingle event with all emitters being simultaneously present, as weassume that n times the pathogen concentration in ambient air implies ntimes the contamination risk (and at some point we would reach asaturation value, where the pathogen concentration is so high thatending up infected is a certainty).

By chance your own calculations often end up close to my own results,but some more fun awaits below. I would not dare to say that I masteredthat one course on probabilities that I had to take maybe some 25 yearsago, I just did better than my peers at that time, and that's not worthmuch, trust me on this.


Let's now have a look at some of your numbers that are clearly off:

- 20% of people with COVID are either asymptomatic or ignore theirsymptoms

I would bet that this one is way over 50%, maybe up to 80% with thecurrent variants. That is, for every single COVID-infected person thathas symptoms strong enough that they can't be ignored, you could have 4other persons that have no symptoms, or symptoms so weak that they wouldnot even suspect that they could have COVID.

For a start, every COVID-infected people start with being asymptomaticfor a few days while already spreading the disease; then it wasconfirmed by local health officials that a good fraction ofCOVID-spreading persons won't experience symptoms (or symptoms so lightthat they won't detect them). And then over that there are people thatwill deliberately ignore the symptoms. As a concrete example of lackingsymptoms, at least one of the COVID-positive persons at this DebConfonly tested because they spent some time close to me at some point; theydidn't actually ever experience any significant symptoms, but decided toself-isolate nonetheless.

- The probability of a particular person being infected by an unmaskedperson with COVID is 5%- The probability of a particular person being infected by a COVIDpositive person wearing a mask is 1%

If this is read incorrectly (as you did) as “the probability of eachsingle unmasked/masked person with COVID infecting any single attendeeis 5%/1%” then (ignoring a few details, like that an infected personcannot be infected again, or that a newly infected person can now spreadthe disease after one or two days) with your other hypotheses theoverall probability for an attendee to be infected at this event wouldbe 6 × 5% + 14 × 1% = 44%.

And thankfully that nightmare 44% (or even 26% with your model)contagion scenario did not actually happen, far from it AFAIK. As Isuspect that there were actually far more unsuspecting (and unmasked)COVID-infected people present than in your hypothesis, this would meanthat the actual contagion probabilities were much (as in: several ordersof magnitude) lower.

Finally let's have some fun with fuzzing: assuming any of the roughhypotheses above could be off by, say, half an order of magnitude (thatis, 10^0.5 = 3.16), let's run both your and my model with lower andhigher values and see which numbers we end up with.

I've published the corresponding Kotlin code [1]. Unfortunately theKotlin playground [2] won't run it as is as it uses full reflectionwhich is not allowed in the playground, but on a Debian system you caninstall the kotlin package and follow the instructions in the header tocompile and run it (or just remove the fuzzing code to run the models inthe web playground).


[1]: https://salsa.debian.org/-/snippets/793
[2]: https://play.kotlinlang.org/

#0 - There are 400 Debconf attendees
#1 - 5% of Debconf attendees currently have COVID
#2 - 20% of people with COVID are either asymptomatic or ignore theirsymptoms
#3 - COVID tests give a false negative 10% of the time
#4 - 100% of people who know they have COVID wear a mask at all times
#5 - 0% of unknowingly COVID-positive people wear masks
#6 - The probability of a particular person being infected by anunmasked person with COVID is 5%#7 - The probability of a particular person being infected by a COVIDpositive person wearing a mask is 1%

I kept #0 constant, made the "upper" bound of #5 100% (would haveremained 0 otherwise), and #4 has no "upper" bound as it's already atits max.


This yields, indeed, "interesting" results.

Depending on the combination and the model, the number of newly infectedattendees (without self-isolation) could be as low as 9 (both models,p=0.0229) or with my model as high as 387 (p=0.98473) or 364 with yours(p=0.95789). And the difference in probability of getting infectedintroduced by self-isolation ranges all the way from 0 (in several highcontamination by unknowingly infected people combinations, e.g. byraising #1 to #3) to 0.99211 (by lowering #1 to#3 and #7, and raising #5and #6).

My conclusion is that this estimate of yours isn't worth anything, andneither are mine in this message. Actual contagion probabilities aredepending on too many factors, most of them being impossible to evenestimate reasonably approximately, and AFAICT researchers in the fielddo not even try to figure them out. That's why they focus on othermetrics such as R numbers, relative risk (or odds) factors, and costassessments (i.e. cost of not implementing a policy vs cost ofimplementing it). We should keep that in mind while discussing thispolicy.


Cheers,

--
Julien Plissonneau Duquène

Reply to:

Follow-Ups:
- Re: COVID [was: Re: DebConf 25 Daily announcements - 2025.07.15 - Daytrip information && DebConf Day 2]
  - From: Alex Lieflander <public@atlief.com>

References:
- Re: COVID [was: Re: DebConf 25 Daily announcements - 2025.07.15 - Daytrip information && DebConf Day 2]
  - From: Julien Plissonneau Duquène <sre4ever@free.fr>
- Re: COVID [was: Re: DebConf 25 Daily announcements - 2025.07.15 - Daytrip information && DebConf Day 2]
  - From: Alex Lieflander <public@atlief.com>

Prev by Date: Re: [staff] Debian Install workshop in Brest
Next by Date: Re: COVID [was: Re: DebConf 25 Daily announcements - 2025.07.15 - Daytrip information && DebConf Day 2]
Previous by thread: Re: COVID [was: Re: DebConf 25 Daily announcements - 2025.07.15 - Daytrip information && DebConf Day 2]
Next by thread: Re: COVID [was: Re: DebConf 25 Daily announcements - 2025.07.15 - Daytrip information && DebConf Day 2]
Index(es):
- Date
- Thread