Re: Bug#97336: ITP: dsh -- dancer's shell, or a distributed shell.
On Monday 21 May 2001 17:15, Junichi Uekawa wrote:
> Do you mean, some architecture where a standard system-wide configuration
> file exists so that mpich and dsh can rip off the list of machines from one
> That would be good, and Debianlike.
Yeah, definitely :) Not just mpich and dsh, but any package that is a beowuf
program could use that. I have in mind a nice scripting environment not
unlike the menu system.
Athough a beowulf admin could certainly do all that by himself, some
convenience wouldn't hurt. We can also build cool administration utils on
top of that.
So, how would it look like? We should have an easy way of defining beowulf
Some standard location for beowulf configuration would be the first step.
|___ clusters -- names of clusters
Specifying the content of those configuration files would be an important
Then, some utility like `update-beowulf' could run package-specific filters.
We can also provide system tools in beowulf packages with wrappers that
automatically update their nodes list, etc. Inserting some fault tolerance
there wouldn't be difficult. We could also determine standard system services
such as logging, monitoring and testing for the beowulf clusters.
My idea is simply to store configuration in the beowulf "interface" node.
Next, we could define what a beowulf configuration is. What configuration and
programs beowulf "interface" and "compute" nodes will have. Those
configurations would be automated through some packages that oversee
tweaking of various parameters.
Now that fai is finally released we could also try to hook fai into all that.
Of course, debian-installer development is going on and we should see how
scriptable that is. We could certainly use a client/server version of
debian-installer to do "magic-floppy" or netboot installs. Installation is
a very important part of any beowulf setup. So are upgrades, configuration
changes etc. I remember how many times nodes have been re-installed
or scripts written just to change a small thing at our cluster.
Surely these are off the top of my head. :) Kind of a wishlist as a suffering
beowulf admin, and would be fun to implement.
Looking forward to your suggestions.