I am a Junior System Administrator with one of the Engineering Schools. One of the Professors got a donation of 45 servers (Dell Poweredge 1690) from Yahoo. Following are his requirements:
hadoop (mapreduce) on Linux (which flavor of Linux and Hadoop?)
pig on top of hadoop
Dryad on top of Windows
MPI on linux
possibly other software, say for cloud computing
I would like to create a cluster using VMware such that I can utilize the hardware optimally. I am very new virtualization. Can anyone suggest me how to go about it. I really look forward to work on this project as it will give me a good exposure and some hands on experience.
This will be a lab to which many students will be logging in simultaneously. I am planning to use LDAP authentication which will authenticate students with our Active Directory.
So how do I go about it? What strategy will be the best one in this scenario? Any input is appreciated. Thank you.