Re: debian package of hadoop

To: common-user@hadoop.apache.org
Cc: Debian Java List <debian-java@lists.debian.org>
Subject: Re: debian package of hadoop
From: Steve Loughran <stevel@apache.org>
Date: Mon, 04 Jan 2010 14:46:55 +0000
Message-id: <[🔎] 4B41FF5F.1020908@apache.org>
In-reply-to: <200912311128.55967.thomas@koch.ro>
References: <200912301953.47398.thomas@koch.ro> <20091230194348.GA9044@moriah> <200912311128.55967.thomas@koch.ro>

Thomas Koch wrote:

Hi Jordà,
The main issue that prevents the inclusion of the current Cloudera
package into Debian is that it depends on Sun's Java. I think it would
be interesting, at least for an official Debian package, to depend on
OpenJDK in order to make it possible to distribute it in "main" instead
of "contrib".
The build-depends line can easily be changed as long as hadoop will build withopenjdk. The binary will depend on java5-runtime-headless which is provided byany java runtime. So the user of the package is free to choose either Sun oropenjdk.

Java6+ only. It will build on openjdk or jrockit, the Hadoop teammerely chooses to ignore all bug reports that you can't recreate on theofficial JDKs. You are still free to fix them yourself. You must alsoknow that your JVM hasn't been tested at scale, unless you have thescale to compare with the big datacentres.


What use cases are you thinking of here?

1) developer coding against the hadoop Java and C APIs
2) Someone setting up a small 1-5 machine cluster
3) large production datacentre of hundreds of worker nodes
4) transient virtualised worker nodes

for (3) and (4) the challenge is getting the right configuration outthere, where configuration =

hadoop XML files
log4j settings
rack awareness scripts
and such like

For virtualised clusters you set up one node then ask the infrastructurefor 100 instances; for physical ones you just need to get the rightfiles out everywhere. Packaging them up and pushing it out as a .deb orRPM is one option -the cloudera one- and is better than trying to byhand -but it is only one option.

Reply to:

Follow-Ups:
- Re: debian package of hadoop
  - From: Isabel Drost <isabel@apache.org>

Prev by Date: Re: debian package of hadoop
Next by Date: Re: State of openjdk on hppa
Previous by thread: Re: debian package of hadoop
Next by thread: Re: debian package of hadoop
Index(es):
- Date
- Thread