Re: beowulf with afs

To: debian-beowulf@lists.debian.org
Subject: Re: beowulf with afs
From: Adam C Powell IV <hazelsct@mit.edu>
Date: Tue, 29 Oct 2002 23:03:20 -0500
Message-id: <[🔎] 3DBF5A08.8030701@mit.edu>
References: <[🔎] Pine.LNX.4.44.0210152127510.23191-100000@latt.if.usp.br>

Jorge L. deLyra wrote:

of building a cluster with ~ 150 nodes and about 100-140 GB/node of
'free' hard drive space.

Man, that's a lot of disk. Just a wild idea I'm curious about: anybody
ever tried exporting disks from nodes via the nbd (network block device)
and assembling a huge raid0 (stripping) with the nbd's on the front end?
This might make a very fast disk even with IDE disks and traffic through
the network. This would be sort of a funny way to use a cluster...

That would be cool... especially if you could make it somewhatredundant, like RAID-4 or -5. Kinda like EMC-class storage on the cheap!

The latest Illuminator does something like this (0.4.1, sorry, not yetin unstable, I'm waiting for working mpich to go in with shared libs,the latest one doesn't build on ia-64 or hppa). Using PETSc distributedarrays, each node saves/loads the local part of the data to a separatefile, optionally compressed. If the filename is on a local disk, thenit's like a giant RAID-0.

Then you "playback" the data by doing distributed read from the localdisks, distributed triangulate (of contour surfaces in 3-D), and renderand rotate in Geomview on the head node. We've got 40 GB/node, which is20 GB/CPU, on our latest cluster, and plan to use it all for time seriesdata. (We're working on distributed rendering too, but that's still afew months off, maybe 0.6, or perhaps it'll be worthy of 0.9/beta by then.)

But you have to use PETSc distributed arrays for this to work. Also, itis RAID-0, if you lose one disk, you have an incomplete data set andit's pretty worthless. :-( Rewriting PETSc to be robust to node failurewould take just a few man-years...

Regarding an earlier post, I kind of like running our cluster "diskless"(NFS-root) with local disks for scratch storage, because it greatlysimplifies administration. In such a setup, using AFS/Coda/Intermezzofor, say, /home, /usr, /var, /tmp, /etc. (ha ha) might make sense,because the nodes would cache frequently-used files in the local scratchdisks, cutting down on network traffic and making distributed jobs startfaster than over NFS. Right?

I could even envision this as being useful for lots of clientworkstations, for which the centralized administration would let itscale to thousands of seats, and Coda/Intermezzo would make it somewhatrobust to network failure. Just get a boatload of these $199 Lindowsmachines, and netboot them all... But that's not a Beowulf, so it'soff-topic -- unless somebody harvests the machines' spare cycles...

I imagine Coda-root, etc. is a ways off; another approach might be tokeep an initrd for root, and mount everything else on that, in order torun "diskless" without NFS... if one really wanted to.


Okay, enough idle musing for an evening.

Zeen,
--

-Adam P.

GPG fingerprint: D54D 1AEE B11C CE9B A02B  C5DD 526F 01E8 564E E4B6

Welcome to the best software in the world today cafe!<http://lyre.mit.edu/%7Epowell/The_Best_Stuff_In_The_World_Today_Cafe.ogg>

Reply to:

Follow-Ups:
- Re: beowulf with afs
  - From: Carlos O'Donell <carlos@baldric.uwo.ca>

References:
- Re: beowulf with afs
  - From: "Jorge L. deLyra" <delyra@latt.if.usp.br>

Prev by Date: Re: beowulf with afs
Next by Date: Re: beowulf with afs
Previous by thread: Re: beowulf with afs
Next by thread: Re: beowulf with afs
Index(es):
- Date
- Thread