Re: Journald's qualities

To: debian-user@lists.debian.org
Subject: Re: Journald's qualities
From: Stefan Monnier <monnier@iro.umontreal.ca>
Date: Fri, 23 Feb 2024 23:17:27 -0500
Message-id: <[🔎] jwvzfvqlcw6.fsf-monnier+gmane.linux.debian.user@gnu.org>
References: <[🔎] ZdhwQX1feB4oseHJ@phare.normalesup.org> <[🔎] 20240223124225.6f5374dd@hydra.home.zxz.li> <[🔎] ZdiJOCqxiaBbs0de@phare.normalesup.org> <[🔎] 20240223134341.18be2048@hydra.home.zxz.li> <[🔎] jwv1q93usq9.fsf-monnier+gmane.linux.debian.user@gnu.org> <[🔎] 20240223181018.7ea534d3@hydra.home.zxz.li> <[🔎] jwvmsrrlz4y.fsf-monnier+gmane.linux.debian.user@gnu.org> <[🔎] 65D93434.1090103@fastmail.fm>

>>>> but what are the advantages of journald's representation compared
>>>> to a naive one?
>>> 
>>> in short: querability without text parsing. That's about it.
>> 
>> They have to parse the binary format, so that's not in and of itself
>> an upside compared to parsing CSV.
>> 
>> I've made my share of bad design decisions that don't pan out. But
>> there's always an upside to my decision (even when it turns out it 
>> speeds up only those cases which can never occur, because of some
>> other aspect of the system).
>> 
>> AFAICT the format is *not* just a plain sequence of log entries, so 
>> there's some additional structure which is intended to speed up some
>> operations.
>> 
>> IOW, even if contrived, there should be *some* use case where it
>> does better than CSV, no?
>
> I can think of two possibilities, just offhand, in no particular order:
>
> * No need to parse the timestamps, et cetera, and take the risk that
> someone put in one that's in a format you don't expect; the times are
> stored internally in a consistent guaranteed format, so you can just use
> internal reader functions (paired with, and updated alongside, the
> internal writer functions) and be done with it.

Can't think of any reason why the same wouldn't apply to CSV: if someone
messes up the timestamps by hand, they're on their own.

> * No need to worry about handling log entries that *contain* commas, or
> whatever other element was chosen as the separator.

That's just a very minor convenience issue and it does not require
a structure any more complex than a plain sequence of log entries.

Same for FSS, it doesn't seem to require the more complex structure used
by journald.  There must have been some other use-case they had in mind
where they thought they could avoid the linear-time scan or something in
a way that they expected would be algorithmically beneficial.
I just can't see what it is they had in mind.


        Stefan

Reply to:

References:
- Selective rotation of journald logs
  - From: Nicolas George <george@nsup.org>
- Re: Selective rotation of journald logs
  - From: Mariusz Gronczewski <xani@devrandom.pl>
- Re: Selective rotation of journald logs
  - From: Nicolas George <george@nsup.org>
- Re: Selective rotation of journald logs
  - From: Mariusz Gronczewski <xani@devrandom.pl>
- Journald's qualities (was: Selective rotation of journald logs)
  - From: Stefan Monnier <monnier@iro.umontreal.ca>
- Re: Journald's qualities (was: Selective rotation of journald logs)
  - From: Mariusz Gronczewski <xani@devrandom.pl>
- Re: Journald's qualities
  - From: Stefan Monnier <monnier@iro.umontreal.ca>
- Re: Journald's qualities
  - From: The Wanderer <wanderer@fastmail.fm>

Prev by Date: Re: Journald's qualities
Next by Date: Re: Where to report print driver bug
Previous by thread: Re: Journald's qualities
Next by thread: Re: Journald's qualities (was: Selective rotation of journald logs)
Index(es):
- Date
- Thread