Re: Dreamhost dumps Debian

To: debian-devel@lists.debian.org
Subject: Re: Dreamhost dumps Debian
From: Russ Allbery <rra@debian.org>
Date: Tue, 27 Aug 2013 13:47:01 -0700
Message-id: <[🔎] 8738pu6euy.fsf@windlord.stanford.edu>
In-reply-to: <[🔎] 1377633770-sup-2279@fewbar.com> (Clint Byrum's message of "Tue, 27 Aug 2013 13:20:45 -0700")
References: <[🔎] 87wqnhpkn3.fsf@jidanni.org> <[🔎] m338q5lbk6.fsf@neo.luffy.cx> <[🔎] 1376942753-sup-2696@fewbar.com> <[🔎] m3y57xjsg6.fsf@neo.luffy.cx> <[🔎] 20130820000440.GA25760@falafel.plessy.net> <[🔎] 21011.32310.864168.568030@chiark.greenend.org.uk> <[🔎] 20130820145339.GA2363@angband.pl> <[🔎] 21011.34255.48484.270530@chiark.greenend.org.uk> <[🔎] CAKcBoksQWK7a-vyeCBm+XE34=gh9nbUM0W2+S7zYzyw_n76xoQ@mail.gmail.com> <[🔎] 20130820200214.GD26098@simplex.0x539.de> <[🔎] 887559.77544.bm@smtp140.mail.ir2.yahoo.com> <[🔎] 521A9510.6060405@debian.org> <[🔎] 575000.76357.bm@smtp104.mail.ir2.yahoo.com> <[🔎] 1377633770-sup-2279@fewbar.com>

Clint Byrum <spamaps@debian.org> writes:

> Perhaps you missed the blog post [1] details?

> "About ten months ago, we realized that the next installation of Debian
> was upcoming, and after upgrading about 20,000 machines since Debian 6
> (aka Squeeze) was released, we got pretty tired."

> Even if the script is _PERFECT_ and handles all of the changes in
> wheezy, just scheduling downtime and doing basic sanity checks on 20,000
> machines would require an incredible effort. If you started on release
> day, and finished 2-3 machines per hour without taking any weekend days
> off, you would just barely finish in time for oldstable to reach EOL. I
> understand that they won't be done in a linear fashion, and some will
> truly be a 5 minute upgrade/reboot, but no matter how you swing it you
> are talking about a very expensive change.

A few comments here from an enterprise administration perspective:

First, if you have 20,000 machines, it's highly unlikely that each system
will be a special snowflake.  In that environment, you're instead talking
about large swaths of systems that are effectively identical.  You
therefore don't have to repeat your sanity checking on each individual
system, just on representives of the class, while using your configuration
management system to ensure that all the systems in a class are identical.
And in many cases you won't have to arrange downtime at all (because the
systems are part of redundant pools).

Second, with 20,000 machines, there is no way that I would upgrade the
systems.  Debian's upgrade support is very important for individual
systems, personal desktops, and smaller-scale environments, but even when
you're at the point of several dozen systems, I would stop doing upgrades.
At Stanford, we have a general policy that we rebuild systems from FAI for
new Debian releases.  All local data is kept isolated from the operating
system (or, ideally, not even on that system, which is the most common
case -- data is on separate database servers or on the network file
system) so that you can just wipe the disk, build a new system on the
current stable, and put the data back on (after performing whatever
related upgrade process you need to perform).  There's up-front
development required for your new service model for the new operating
system release, which you validate outside of production, and then the
production rollout is mechanical system rebuilds (which usually take under
10 minutes with FAI and are parallelizable).

My personal opinion is that if someone is scripting an upgrade to 20,000
systems and running it on those systems one-by-one, they're doing things
at the wrong scale and with the wrong tools for that sort of environment.

-- 
Russ Allbery (rra@debian.org)               <http://www.eyrie.org/~eagle/>

Reply to:

Follow-Ups:
- Re: Dreamhost dumps Debian
  - From: Clint Byrum <spamaps@debian.org>

References:
- Dreamhost dumps Debian
  - From: jidanni@jidanni.org
- Re: Dreamhost dumps Debian
  - From: Vincent Bernat <bernat@debian.org>
- Re: Dreamhost dumps Debian
  - From: Clint Byrum <spamaps@debian.org>
- Re: Dreamhost dumps Debian
  - From: Vincent Bernat <bernat@debian.org>
- Re: Dreamhost dumps Debian
  - From: Charles Plessy <plessy@debian.org>
- Re: Dreamhost dumps Debian
  - From: Ian Jackson <ijackson@chiark.greenend.org.uk>
- Re: Dreamhost dumps Debian
  - From: Adam Borowski <kilobyte@angband.pl>
- Re: Dreamhost dumps Debian
  - From: Ian Jackson <ijackson@chiark.greenend.org.uk>
- Re: Dreamhost dumps Debian
  - From: Pau Garcia i Quiles <pgquiles@elpauer.org>
- Re: Dreamhost dumps Debian
  - From: Philipp Kern <pkern@debian.org>
- Re: Dreamhost dumps Debian
  - From: Kevin Chadwick <ma1l1ists@yahoo.co.uk>
- Re: Dreamhost dumps Debian
  - From: Thomas Goirand <zigo@debian.org>
- Re: Dreamhost dumps Debian
  - From: Kevin Chadwick <ma1l1ists@yahoo.co.uk>
- Re: Dreamhost dumps Debian
  - From: Clint Byrum <spamaps@debian.org>

Prev by Date: Re: Dreamhost dumps Debian
Next by Date: Bug#721086: ITP: minetest-mod-plantlife -- Minetest mod - Plantlife
Previous by thread: Re: Dreamhost dumps Debian
Next by thread: Re: Dreamhost dumps Debian
Index(es):
- Date
- Thread