Re: GIT for pdiff generation

To: Henrique de Moraes Holschuh <hmh@debian.org>
Cc: d-d <debian-devel@lists.debian.org>
Subject: Re: GIT for pdiff generation
From: Joerg Jaspert <joerg@debian.org>
Date: Mon, 28 Mar 2011 08:45:42 +0200
Message-id: <[🔎] 87fwq73fqh.fsf@gkar.ganneff.de>
Mail-followup-to: Henrique de Moraes Holschuh <hmh@debian.org>, d-d <debian-devel@lists.debian.org>
In-reply-to: <[🔎] 20110327192759.GA4173@khazad-dum.debian.net> (Henrique de Moraes Holschuh's message of "Sun, 27 Mar 2011 16:27:59 -0300")
References: <874o6prlfn.fsf@delenn.ganneff.de> <[🔎] 20110327132911.GA31834@khazad-dum.debian.net> <[🔎] 87lj00a3vq.fsf@gkar.ganneff.de> <[🔎] 20110327192759.GA4173@khazad-dum.debian.net>

>> Right now the source contents of unstable has, unpacked, 220MB. (Packed
>> gzip its 28MB, while the binary contents per have each have 18MB
>> packed).
> That should not be a problem in any non-joke box.  Unless you'll run it
> in a memory-constrained vm or something.

Well. For our archives it is turned on in the main and backports one. I
dont think main will ever run in trouble there:
             total       used       free     shared    buffers     cached
Mem:      33006584   29241780    3764804          0    2343936   20783680

while backports isnt as big but still large enough:
             total       used       free     shared    buffers     cached
Mem:       8198084    7352164     845920          0    1063012    5650672

>> Lets add a safety margin: 350MB is a good guess for the largest.
>> A packages file nearly doesnt count compared to them, unpacked its just
>> some 34mb
> I.e. something very easy to keep in RAM on a "server class" or "desktop
> class" box.

Yes.

>> > Other than that, git loads entire objects to memory to manipulate them,
>> > which AFAIK CAN cause problems in datasets with very large files (the
>> > problem is not usually the size of the repository, but rather the size
>> > of the largest object).  You probably want to test your use case with
>> > several worst-case files AND a large safety margin to ensure it won't
>> > break on us anytime soon, using something to track git memory usage.
>> Well, yes.
> At the sizes you explained now (I thought it would deal with objects 7GB
> in size, not 7GB worth of objects at most 0.5GB in size), it should not
> be a problem in any box with a reasonable ammount of free RAM and vm
> space (say, 1GB).

Right, could have written that better.

-- 
bye, Joerg
<liw> I'm a blabbermouth

Reply to:

References:
- GIT for pdiff generation (was: Meeting Minutes, FTPMaster meeting March 2011)
  - From: Henrique de Moraes Holschuh <hmh@debian.org>
- Re: GIT for pdiff generation
  - From: Joerg Jaspert <joerg@debian.org>
- Re: GIT for pdiff generation
  - From: Henrique de Moraes Holschuh <hmh@debian.org>

Prev by Date: Re: Packaging Openstack for Debian: anyone else interested?
Next by Date: Re: MBF alert: packages with very long source / .deb filenames
Previous by thread: Re: GIT for pdiff generation
Next by thread: Re: either bash or dash should be enough
Index(es):
- Date
- Thread