[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Mirror split stuff



On 1/13/06, Anthony Towns <aj@azure.humbug.org.au> wrote:
> Hey all,
>
> First, the executive summary for mirror operators reading this: we'll be
> switching the primary mirror stuff for Debian to be for a small number
> of architectures rather than all of them; initially this will just be
> i386, but will probably expand to include amd64. Single architecture
> mirrors appear to need about 60GB of disk space, dual architecture
> mirrors will need about 80GB of disk space. A full mirror at the moment
> requires slightly over 170GB. We'll be trying to get in touch with you
> all further over the next week or two to make sure the changes go as
> smoothly as possible.
>
> Second, there's a call for help down below. Particularly for people with
> some web programming skillz.
>
> Okay, so!
>
> You've probably heard about this mirror split thing, also known as "scc",
> "second class citizens", and "the evil plot by the cabal to make poor
> blighted amd64 developers and users suffer unduly".
>
> At present, most mirrors mirror all the archive, and at present almost all
> Debian users that use those mirrors use i386 [0]. But with the archive
> growing almost daily (recently breaking the 170GB mark, up from 130GB
> in July iirc), and mirrors often finding it hard to keep up, that's a
> bad tradeoff. That tradeoff becomes even worse when we're unwilling to
> add interesting new architectures because of the immediate increase in
> archive size they cause, in addition to the ongoing accretion.
>
> So obviously that tradeoff's changing in favour of partial mirrors,
> particularly by architecture.
>
> People have been able to do partial mirrors for a while now, and the
> anonftpsync [1] tool we offer includes explicit support for that. In
> further support of partial mirrors, we'll be doing these things:
>
>         (a) allowing and in some cases encouraging official mirrors to
>             mirror a limited number of archictures
>
>         (b) limiting ftp.debian.org to i386 only (probably to also
>             include amd64, depending on how popular that architecture
>             actually is)
>
>         (c) providing simple recommendations on running arch-specific
>             mirrors
>
>         (d) providing <cc>.<arch>.mirror.debian.org aliases to make it
>             easy for users to find a local mirror that supports their
>             preferred architecture
>
> As well as allowing additional architectures, these changes should
> make it plausible to think about a few other things, notably including
> additional suites in the archive (such as "volatile" or "backports"),
> and having the archive pulse occur more frequently than daily.
>
> But one of the things all this stuff requires is some good communication
> with our mirrors. That can use some work at the moment, and one thing
> that would be particularly helpful is some better tracking of who's
> mirroring what. At present we have the "Mirrors masterlist" [2], the
> mirrors pseudopackage on the BTS [3] (and its corresponding mailing list,
> which has public archives on master [4]) and the Debian mirror checker
> [5].
>
> What would be helpful would to improve our tracking stuff so that:
>
>         (a) the mirror checker can provide more detailed information on
>             mirror stability such as that offered by the apache tracker [6]
>
>         (b) the mirror checker script can verify which architectures a
>             mirror actually carries
>
>         (c) there's some easy way for mirror admins to add, remove and
>             update their details in the masterlist, rather than waiting
>             for a developer to review the changes (with additions
>             confirmed automatically by the checker, ideally)
>
>         (d) there are status fields to indicate whether the mirror supports
>             pushing downstream mirrors by ssh trigger, in the same manner as
>             the top level Debian mirrors
>
>         (e) what mirror each mirror mirrors from (ideally checked
>             automatically by looking in project/trace; and ideally with
>             a pretty graph generated from the data, highlighting mirrors
>             that are out of date)
>
>         (f) whether the mirror is able to accept *.mirror.debian.org
>             requests, so that if, eg, "at.alpha.mirror.debian.org"
>             goes down, it can automatically be pointed at another site
>             in Austria, or if none are available, another nearby alpha
>             mirror.
>
> But that's not really something I'm good at, and the existing folks working
> on organising the mirrors haven't had time to magic it up either, so if other
> folks would like to try their hand at whipping something up that can be
> made official that'd be great.
>
> Having it be something that can be run with minimal priveleges, not
> require PHP or a large framework, something that supports the existing
> Mirrors masterlist format for input and output, and be something that
> is written to run efficiently would be great. Naturally it needs to be
> free software. Followups on #debian-mirrors on OFTC, or to -devel I guess


Reply to: